; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G191470 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G191470
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionCathepsin B-like
Genome locationCiama_Chr10:25774904..25781098
RNA-Seq ExpressionCaUC10G191470
SyntenyCaUC10G191470
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR012599 - Peptidase C1A, propeptide
IPR025660 - Cysteine peptidase, histidine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141146.1 cathepsin B-like protease 3 isoform X2 [Cucumis sativus]1.9e-17987.93Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
        MASSH Y SLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
Subjt:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK

Query:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
        LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
Subjt:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC

Query:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGV+AYR+K DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY

Query:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        W            DGYFKIRRGTNECGIEEDVVAGLPS +NIAREAAI
Subjt:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_008465335.1 PREDICTED: cathepsin B-like isoform X1 [Cucumis melo]2.8e-17888.25Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAAVCTFHH QV+AEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED

Query:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        YW            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]1.1e-17988.51Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
        MASS LY SLSLLFLAAVCTFHHQV+AEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL+
Subjt:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK

Query:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
        LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
Subjt:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC

Query:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY

Query:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        W            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]7.4e-17988.54Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSHLY SLSLLFLAAVCTFHH QVYAEEQVL+FK +ADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        KLPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED

Query:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        YW            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]3.0e-18088.79Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
        MASSHLY SLSLLFLAAVCTFHHQVYAEEQVL+FK +ADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
Subjt:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK

Query:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
        LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
Subjt:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC

Query:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY

Query:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        W            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein2.3e-17887.68Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASSH Y SLSLLFLAAVCTFHH QVYAEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        KLPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGV+AYR+K DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED

Query:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        YW            DGYFKIRRGTNECGIEEDVVAGLPS +NIAREAAI
Subjt:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A1S3CNJ5 cathepsin B-like isoform X11.4e-17888.25Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS LY SLSLLFLAAVCTFHH QV+AEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAVCTFHH-QVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        +LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
Subjt:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED

Query:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        YW            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A1S3CNM3 cathepsin B-like isoform X25.5e-18088.51Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
        MASS LY SLSLLFLAAVCTFHHQV+AEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL+
Subjt:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK

Query:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
        LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
Subjt:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC

Query:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY

Query:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        W            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A5A7U7U4 Cathepsin B-like isoform X25.5e-18088.51Show/hide
Query:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK
        MASS LY SLSLLFLAAVCTFHHQV+AEEQVLKFKLDADILQESIV+HVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL+
Subjt:  MASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLK

Query:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
        LPKSFDAREAWPQCISIGTIL                  DRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC
Subjt:  LPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC

Query:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPAYPTP+CVRHCVDKNQIWRKTKHYGVNAYRIK DP DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDY

Query:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        W            DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  W------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A6J1F2K3 cathepsin B-like protease 24.8e-16882.52Show/hide
Query:  MASSHLYFSLSLLFLAAVC-TFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL
        MASS+ + SLSLLFL A C   HHQVYAEEQVLKFKL+ADILQESIV+HVNEHP AGWKA MNP FSNYSVSQFK++LGVKQTPEKDLKSTPVLSHPKSL
Subjt:  MASSHLYFSLSLLFLAAVC-TFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSL

Query:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG
        KLPKSFDAREAWPQCI+IGTIL                  DRFCIH++MNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDT G
Subjt:  KLPKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTG

Query:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEPAY TPKCVRHCVDKNQIWRK+KHYGVNAYRIK DPYDIMAEVYKNGPVEV FTVYEDFAHYKSGVYK+I GD +GGHAVKLIGWGT+DDGED
Subjt:  CSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGED

Query:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        YW            DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREA+I
Subjt:  YW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 11.7e-11757.79Show/hide
Query:  LSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDARE
        L+ +FL    +F+ Q  A E + K KL + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SLKLPK FDAR 
Subjt:  LSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDARE

Query:  AWPQCISIGTIL--------------------------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE
        AW  C SI  IL                                      DRFCI +++N+SLS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+
Subjt:  AWPQCISIGTIL--------------------------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE

Query:  QCDPYFDTTGCSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLI
        +CDPYFD TGCSHPGCEP YPTPKC R CV +NQ+W ++KHYGV AYRI  DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITG  +GGHAVKLI
Subjt:  QCDPYFDTTGCSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLI

Query:  GWGTTDDGEDYW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNI
        GWGT+DDGEDYW            DGYFKIRRGTNECGIE+ VVAGLPS +N+
Subjt:  GWGTTDDGEDYW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNI

Q4R5M2 Cathepsin B1.3e-5840.69Show/hide
Query:  LQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISI------------------G
        L + +V +VN+     W+A  N  F N  VS  K L    LG  + P++       +   + LKLP+SFDARE WPQC +I                   
Subjt:  LQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISI------------------G

Query:  TILDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPKCVRHC-VD
         I DR CIH + ++S+ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TPKC + C   
Subjt:  TILDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPKCVRHC-VD

Query:  KNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW------------DGYFKIR
         +  +++ KHYG N+Y +     DIMAE+YKNGPVE +F+VY DF  YKSGVY+++TG++MGGHA++++GWG  ++G  YW            +G+FKI 
Subjt:  KNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW------------DGYFKIR

Query:  RGTNECGIEEDVVAGLP
        RG + CGIE +VVAG+P
Subjt:  RGTNECGIEEDVVAGLP

Q5R6D1 Cathepsin B2.3e-5840.69Show/hide
Query:  LQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISI------------------G
        L + +V +VN+     W+A  N  F N  VS  K L    LG  + P++       +   + LKLP+SFDARE WPQC +I                   
Subjt:  LQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISI------------------G

Query:  TILDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPKCVRHC-VD
         I DR CIH + ++S+ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TPKC + C   
Subjt:  TILDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYFDTTGCSH------PGCEPAYPTPKCVRHC-VD

Query:  KNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW------------DGYFKIR
         +  +++ KHYG N+Y +     DIMAE+YKNGPVE +F+VY DF  YKSGVY+++TG++MGGHA++++GWG  ++G  YW            +G+FKI 
Subjt:  KNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW------------DGYFKIR

Query:  RGTNECGIEEDVVAGLP
        RG + CGIE +VVAG+P
Subjt:  RGTNECGIEEDVVAGLP

Q93VC9 Cathepsin B-like protease 27.5e-12663.27Show/hide
Query:  SSHLYFSLSLLFLAAVCTFH-HQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKL
        S+ ++F L LL    + +F+  Q  A E + K KL + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SLKL
Subjt:  SSHLYFSLSLLFLAAVCTFH-HQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKL

Query:  PKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK FDAR AW QC SIG IL                  DRFCI ++MN+SLSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW
        HPGCEPAYPTPKC R CV  NQ+WR++KHYGV+AY++++ P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+DDGEDYW
Subjt:  HPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW

Query:  ------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                    DGYFKIRRGTNECGIE  VVAGLPS RN+ +
Subjt:  ------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

Q94K85 Cathepsin B-like protease 31.1e-12463.86Show/hide
Query:  LFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWP
        L L  +  F  +    E + K KLD+ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SLKLPK+FDAR AWP
Subjt:  LFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWP

Query:  QCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTP
        QC SIG IL                  DRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD TGCSHPGCEPAYPTP
Subjt:  QCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTP

Query:  KCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW-----------
        KC R CV  N++W ++KHY V+ Y +K++P DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYW           
Subjt:  KCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW-----------

Query:  -DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
         DGYF IRRGTNECGIE++ VAGLPS +N+ R
Subjt:  -DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein1.2e-11857.79Show/hide
Query:  LSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDARE
        L+ +FL    +F+ Q  A E + K KL + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SLKLPK FDAR 
Subjt:  LSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDARE

Query:  AWPQCISIGTIL--------------------------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE
        AW  C SI  IL                                      DRFCI +++N+SLS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+
Subjt:  AWPQCISIGTIL--------------------------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE

Query:  QCDPYFDTTGCSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLI
        +CDPYFD TGCSHPGCEP YPTPKC R CV +NQ+W ++KHYGV AYRI  DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITG  +GGHAVKLI
Subjt:  QCDPYFDTTGCSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLI

Query:  GWGTTDDGEDYW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNI
        GWGT+DDGEDYW            DGYFKIRRGTNECGIE+ VVAGLPS +N+
Subjt:  GWGTTDDGEDYW------------DGYFKIRRGTNECGIEEDVVAGLPSPRNI

AT1G02305.1 Cysteine proteinases superfamily protein5.3e-12763.27Show/hide
Query:  SSHLYFSLSLLFLAAVCTFH-HQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKL
        S+ ++F L LL    + +F+  Q  A E + K KL + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SLKL
Subjt:  SSHLYFSLSLLFLAAVCTFH-HQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKL

Query:  PKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS
        PK FDAR AW QC SIG IL                  DRFCI ++MN+SLSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFD TGCS
Subjt:  PKSFDAREAWPQCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCS

Query:  HPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW
        HPGCEPAYPTPKC R CV  NQ+WR++KHYGV+AY++++ P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+DDGEDYW
Subjt:  HPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW

Query:  ------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                    DGYFKIRRGTNECGIE  VVAGLPS RN+ +
Subjt:  ------------DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

AT3G45310.1 Cysteine proteinases superfamily protein5.3e-1030.37Show/hide
Query:  FDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDTTGCSHPGCE-PAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPY
        F   ISLS   L+ C G     GC GG P  A+ Y     G+ TE+  PY    G    GC+  A      VR  V+          + V   R      
Subjt:  FDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDTTGCSHPGCE-PAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPY

Query:  DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMG------GHAVKLIGWGTTDDGEDYW------------DGYFKIRRGTNECGI
                  PV V+F V  +F  YK GV+   T +  G       HAV  +G+G  DD   YW            +GYFK+  G N CG+
Subjt:  DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMG------GHAVKLIGWGTTDDGEDYW------------DGYFKIRRGTNECGI

AT4G01610.1 Cysteine proteinases superfamily protein7.7e-12663.86Show/hide
Query:  LFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWP
        L L  +  F  +    E + K KLD+ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SLKLPK+FDAR AWP
Subjt:  LFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWP

Query:  QCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTP
        QC SIG IL                  DRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD TGCSHPGCEPAYPTP
Subjt:  QCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTP

Query:  KCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW-----------
        KC R CV  N++W ++KHY V+ Y +K++P DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYW           
Subjt:  KCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW-----------

Query:  -DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
         DGYF IRRGTNECGIE++ VAGLPS +N+ R
Subjt:  -DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

AT4G01610.2 Cysteine proteinases superfamily protein7.7e-12663.86Show/hide
Query:  LFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWP
        L L  +  F  +    E + K KLD+ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SLKLPK+FDAR AWP
Subjt:  LFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWP

Query:  QCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTP
        QC SIG IL                  DRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFD TGCSHPGCEPAYPTP
Subjt:  QCISIGTIL------------------DRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTP

Query:  KCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW-----------
        KC R CV  N++W ++KHY V+ Y +K++P DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYW           
Subjt:  KCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYW-----------

Query:  -DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
         DGYF IRRGTNECGIE++ VAGLPS +N+ R
Subjt:  -DGYFKIRRGTNECGIEEDVVAGLPSPRNIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAGCATGGCGTACGAACATGGCTGACGTTAACTCGGACGTCGAATCAAAGGGAAAGTTCGGATTGGTCCTTTTTCCCAAATCTCAACTCTCTATTCTTTATAC
GGAACTTATTATCTCTTCCCATTTTCCCCATCCGCTCTCTCTTCCTCCTCCATCTCCATCGCCTTTCAAGGAAATGGCATCATCTCACTTGTATTTTTCCCTTTCCTTGC
TATTTTTGGCTGCCGTCTGCACCTTCCATCATCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAACTCGACGCTGATATTCTTCAGGAGTCTATCGTTCAGCATGTA
AATGAACACCCACAGGCTGGCTGGAAAGCTACCATGAACCCACGGTTTTCAAACTATTCTGTTAGCCAATTCAAGTACCTGCTTGGTGTCAAACAAACTCCTGAAAAGGA
TTTGAAAAGTACCCCTGTTTTATCCCATCCCAAGTCTTTAAAGTTGCCAAAAAGCTTTGATGCAAGAGAAGCTTGGCCTCAGTGTATCAGCATTGGAACCATTCTAGATC
GATTCTGCATCCATTTTGACATGAACATTTCTCTGTCTGTTAATGATCTTTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCA
TGGCGATATTTTGTTCGCCATGGAGTTGTTACTGAACAGTGTGATCCATATTTTGACACTACTGGTTGTTCCCATCCTGGTTGTGAACCTGCATATCCTACTCCTAAATG
TGTCAGGCATTGCGTAGATAAGAACCAGATTTGGAGAAAAACAAAGCACTATGGTGTTAATGCTTATAGGATTAAAACAGATCCCTATGATATCATGGCAGAAGTTTATA
AGAATGGACCAGTCGAGGTTTCCTTCACGGTGTATGAGGATTTTGCTCACTATAAATCTGGGGTTTACAAATATATTACTGGCGATGTAATGGGAGGGCATGCTGTAAAG
CTTATTGGATGGGGAACAACGGATGATGGAGAGGATTATTGGGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGGATTGAGGAAGATGTTGTTGCTGGTTT
GCCCTCACCTAGAAATATTGCCAGGGAGGCTGCCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAGCATGGCGTACGAACATGGCTGACGTTAACTCGGACGTCGAATCAAAGGGAAAGTTCGGATTGGTCCTTTTTCCCAAATCTCAACTCTCTATTCTTTATAC
GGAACTTATTATCTCTTCCCATTTTCCCCATCCGCTCTCTCTTCCTCCTCCATCTCCATCGCCTTTCAAGGAAATGGCATCATCTCACTTGTATTTTTCCCTTTCCTTGC
TATTTTTGGCTGCCGTCTGCACCTTCCATCATCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAACTCGACGCTGATATTCTTCAGGAGTCTATCGTTCAGCATGTA
AATGAACACCCACAGGCTGGCTGGAAAGCTACCATGAACCCACGGTTTTCAAACTATTCTGTTAGCCAATTCAAGTACCTGCTTGGTGTCAAACAAACTCCTGAAAAGGA
TTTGAAAAGTACCCCTGTTTTATCCCATCCCAAGTCTTTAAAGTTGCCAAAAAGCTTTGATGCAAGAGAAGCTTGGCCTCAGTGTATCAGCATTGGAACCATTCTAGATC
GATTCTGCATCCATTTTGACATGAACATTTCTCTGTCTGTTAATGATCTTTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCA
TGGCGATATTTTGTTCGCCATGGAGTTGTTACTGAACAGTGTGATCCATATTTTGACACTACTGGTTGTTCCCATCCTGGTTGTGAACCTGCATATCCTACTCCTAAATG
TGTCAGGCATTGCGTAGATAAGAACCAGATTTGGAGAAAAACAAAGCACTATGGTGTTAATGCTTATAGGATTAAAACAGATCCCTATGATATCATGGCAGAAGTTTATA
AGAATGGACCAGTCGAGGTTTCCTTCACGGTGTATGAGGATTTTGCTCACTATAAATCTGGGGTTTACAAATATATTACTGGCGATGTAATGGGAGGGCATGCTGTAAAG
CTTATTGGATGGGGAACAACGGATGATGGAGAGGATTATTGGGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGGATTGAGGAAGATGTTGTTGCTGGTTT
GCCCTCACCTAGAAATATTGCCAGGGAGGCTGCCATATGATCCAGATGCAGCTGTTTCACAATCAAGCTTTGCTCAACCAAAGATATGTTTATGTGTCTTGCATTTGGGT
TAGAACTATTTATGGATATGAAGTTGGTTAATGCTTTGCTGAATGTTTCCAAGTATCTGTGTCTTAAACAATTAGTATGATGTCTGAAAAAAGATACTAATCAAATAATG
TGTTAATATTCTTGGAGAATGATGGCCCTTATAATACTTTCCCCTCCAGTATTGTTCCTCTGAAGAGAGAATATTACACTTGAACAAATTGGATTGGGAGGATGTCCATT
TTTCTTGTTTGGAATTTTCTTTTACAACTTAAAAGCTGA
Protein sequenceShow/hide protein sequence
MAAAWRTNMADVNSDVESKGKFGLVLFPKSQLSILYTELIISSHFPHPLSLPPPSPSPFKEMASSHLYFSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVQHV
NEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGTILDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISA
WRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPKCVRHCVDKNQIWRKTKHYGVNAYRIKTDPYDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKYITGDVMGGHAVK
LIGWGTTDDGEDYWDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI