; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G007770 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G007770
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein ASPARTIC PROTEASE IN GUARD CELL 2-like
Genome locationchr04:7303620..7306493
RNA-Seq ExpressionLsi04G007770
SyntenyLsi04G007770
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034086.1 protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis melo var. makuwa]1.9e-15969.51Show/hide
Query:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR
        MF KQ  SSLL FL  L   AA  AARPS  TKFQYLNVKATKLDFNDGQILHTLNFSD  RQVS HKS N+TFKLNL+HRDKLSHVHGHR GFN+R+KR
Subjt:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR

Query:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGARRGVTDATNNPTRF----------LTPPTPPPLPVSLVAPPSA---------------
        DAIRVATLVRRLSHG  AV D ++K        I  GS   ++  V D+ ++               + P   P   S  A  S                
Subjt:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGARRGVTDATNNPTRF----------LTPPTPPPLPVSLVAPPSA---------------

Query:  ---------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGA
                               L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGA
Subjt:  ---------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGA

Query:  TWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP
        TWISLIRNPRAPSFYYIGLAGIGVGGVRVS+PEETFQLTEFGTNGVVMDTGTAVTRLP +AYVA RDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP
Subjt:  TWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP

Query:  TVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        TVSFYFSDGP LTLPAKNFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
Subjt:  TVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

XP_004147103.1 protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus]4.1e-15967.08Show/hide
Query:  MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRD
        MF KQ  S L FL  +   AAA AAR S  TKFQYLNVKATKLDFNDGQILH LNFSDG RQVS +KSDNNTFKLNL+HRDKLSHVHGHR GFN+R+KRD
Subjt:  MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRD

Query:  AIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA-
        AIRVATLVRRLSHG  A + D   +      +++SG E G+     R GV     N    +                    + P   P   S  A  S  
Subjt:  AIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA-

Query:  -----------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST
                                             L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST
Subjt:  -----------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST

Query:  GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD
        G LEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE+GTNGVVMDTGTAVTR P  AYVAFRDSFTAQTSNLPRAPGVSIFD
Subjt:  GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        TCYDLNGFESVRVPTVSFYFSDGPVLTLPA+NFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

XP_008445900.1 PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis melo]3.2e-15967.77Show/hide
Query:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR
        MF KQ  SSLL FL  L   AA  AARPS  TKFQYLNVKATKLDFNDGQILHTLNFSD  RQVS HKS N+TFKLNL+HRDKLSHVHGHR GFN+R+KR
Subjt:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR

Query:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA
        DAIRVATLVRRLSHG  AV D ++K       +++SG E G+     R GV     +    +                    + P   P   S  A  S 
Subjt:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA

Query:  ------------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS
                                              L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS
Subjt:  ------------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS

Query:  TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIF
        TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVS+PEETFQLTEFGTNGVVMDTGTAVTRLP +AYVA RDSFTAQTSNLPRAPGVSIF
Subjt:  TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIF

Query:  DTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        DTCYDLNGFESVRVPTVSFYFSDGP LTLPAKNFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
Subjt:  DTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

XP_023547721.1 protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita pepo subsp. pepo]4.4e-14562.47Show/hide
Query:  MFLKQPSS-LLSFLHFLFFSAAAVA-ARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIK
        MFLK PSS LL  LH L F AAA A ARPSP   F YLNVK TKL+ N   IL TLNFSD   QV++ K  N T KLNL+HRDKL HVH HRH FNERIK
Subjt:  MFLKQPSS-LLSFLHFLFFSAAAVA-ARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIK

Query:  RDAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPS
        RDAIRVATL+RRLSH   A   D   +      +++SG E G+     R GV     +    +                    + P   P   S  A  S
Subjt:  RDAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPS

Query:  ---------------------------------ATALRT---PAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTG
                                           AL T     V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSR TG
Subjt:  ---------------------------------ATALRT---PAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTG

Query:  STGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSI
        STGTLEFGRGA+PVGATWISLIRNPRAPSFYYIGLAG+GVGGVRV +PEETFQL+E+GTNGVVMDTGTAVTRLP  AY AFRD+FTAQT+NLPRA GVSI
Subjt:  STGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSI

Query:  FDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        FDTC+DLNGFES+RVPTVSFYFSDGPVLTLPAKNFLIPVNG GTFCLAFAPS SGLSIIGNIQQEGIQIS DGANGFVGFGPN+C
Subjt:  FDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

XP_038893071.1 protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Benincasa hispida]6.4e-16870.39Show/hide
Query:  MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRD
        MFLKQPSSLL FL FLFFSAA  A R SP TKFQYLNVKATKLDFND QILHTLNFS   RQVS HKSDNNTFKLNL+HRDKLSHVHGHRHGFNERIKRD
Subjt:  MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRD

Query:  AIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSAT
        AIRVATLVRRLSHGGGAV D+KYK      + ++SG E G+     R GV     +    +                    + P   P   S  A  S  
Subjt:  AIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSAT

Query:  A------------------------------------LRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST
        +                                    L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST
Subjt:  A------------------------------------LRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST

Query:  GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD
        GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE+GTNGVVMDTGTAVTRLP  AYVA RDSFTAQTSNLPRAPGVSIFD
Subjt:  GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPN+C
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

TrEMBL top hitse value%identityAlignment
A0A0A0KPR4 Peptidase A1 domain-containing protein2.0e-15967.08Show/hide
Query:  MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRD
        MF KQ  S L FL  +   AAA AAR S  TKFQYLNVKATKLDFNDGQILH LNFSDG RQVS +KSDNNTFKLNL+HRDKLSHVHGHR GFN+R+KRD
Subjt:  MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRD

Query:  AIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA-
        AIRVATLVRRLSHG  A + D   +      +++SG E G+     R GV     N    +                    + P   P   S  A  S  
Subjt:  AIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA-

Query:  -----------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST
                                             L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST
Subjt:  -----------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST

Query:  GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD
        G LEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE+GTNGVVMDTGTAVTR P  AYVAFRDSFTAQTSNLPRAPGVSIFD
Subjt:  GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        TCYDLNGFESVRVPTVSFYFSDGPVLTLPA+NFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

A0A1S3BEG7 protein ASPARTIC PROTEASE IN GUARD CELL 2-like1.5e-15967.77Show/hide
Query:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR
        MF KQ  SSLL FL  L   AA  AARPS  TKFQYLNVKATKLDFNDGQILHTLNFSD  RQVS HKS N+TFKLNL+HRDKLSHVHGHR GFN+R+KR
Subjt:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR

Query:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA
        DAIRVATLVRRLSHG  AV D ++K       +++SG E G+     R GV     +    +                    + P   P   S  A  S 
Subjt:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAPPSA

Query:  ------------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS
                                              L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS
Subjt:  ------------------------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS

Query:  TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIF
        TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVS+PEETFQLTEFGTNGVVMDTGTAVTRLP +AYVA RDSFTAQTSNLPRAPGVSIF
Subjt:  TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIF

Query:  DTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        DTCYDLNGFESVRVPTVSFYFSDGP LTLPAKNFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
Subjt:  DTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

A0A5A7SVY2 Protein ASPARTIC PROTEASE IN GUARD CELL 2-like9.0e-16069.51Show/hide
Query:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR
        MF KQ  SSLL FL  L   AA  AARPS  TKFQYLNVKATKLDFNDGQILHTLNFSD  RQVS HKS N+TFKLNL+HRDKLSHVHGHR GFN+R+KR
Subjt:  MFLKQ-PSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKR

Query:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGARRGVTDATNNPTRF----------LTPPTPPPLPVSLVAPPSA---------------
        DAIRVATLVRRLSHG  AV D ++K        I  GS   ++  V D+ ++               + P   P   S  A  S                
Subjt:  DAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGARRGVTDATNNPTRF----------LTPPTPPPLPVSLVAPPSA---------------

Query:  ---------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGA
                               L    V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGA
Subjt:  ---------------------TALRTPAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGA

Query:  TWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP
        TWISLIRNPRAPSFYYIGLAGIGVGGVRVS+PEETFQLTEFGTNGVVMDTGTAVTRLP +AYVA RDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP
Subjt:  TWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP

Query:  TVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        TVSFYFSDGP LTLPAKNFLIPV+GGGTFCLAFAPS SGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
Subjt:  TVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

A0A6J1GXH5 protein ASPARTIC PROTEASE IN GUARD CELL 2-like1.3e-14261.63Show/hide
Query:  MFLKQPSS-LLSFLHFLFFSAAAVA--ARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHG----HRHGF
        MFLK PSS LL  LH L F AAA A  ARPSP   F YLNVK TKL+ N   IL TLNFS    QV++ K  N T KLNL+HRDKL HVH     HRH F
Subjt:  MFLKQPSS-LLSFLHFLFFSAAAVA--ARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHG----HRHGF

Query:  NERIKRDAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNP------------------TRFLTPPTPPPLPVSLVA
        NERIKRDAIRVATLVRRLSH   AV D+ YK      + ++SG E G+     R GV     +                   T+      P   P    +
Subjt:  NERIKRDAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNP------------------TRFLTPPTPPPLPVSLVA

Query:  PPSATALRTPAVTLDDVDM---RCR--------------------------------------------AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLV
            +   T    LD+ D    RCR                                            AAGLLGLGGGSMS IGQLGGQTGGAFSYCLV
Subjt:  PPSATALRTPAVTLDDVDM---RCR--------------------------------------------AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLV

Query:  SRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRA
        SR TGSTGTLEFGRGA+PVGATWISLIRNPRAPSFYYIGLAG+GVGGVRV +PEETFQL+E+GTNGVVMDTGTAVTRLP  AY AFRD+F AQT+NLPRA
Subjt:  SRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRA

Query:  PGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
         GVSIFDTC+DLNGFES+RVPTVSFYFSDGPVLTLPAKNFLIPVNG GTFCLAFAPS SGLSIIGNIQQEGIQIS DGANGFVGFGPN+C
Subjt:  PGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

A0A6J1KA06 protein ASPARTIC PROTEASE IN GUARD CELL 2-like2.4e-14463.04Show/hide
Query:  MFLKQPSS-LLSFLHFLFFSAAAVA-ARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHG--HRHGFNER
        MFLK PSS LL  LH LF +AAA A ARPS    F YLNVK TKL+ N  QIL TLNFSD   QV++ K  N T KLNL+HRDKL HVH   HRH FNER
Subjt:  MFLKQPSS-LLSFLHFLFFSAAAVA-ARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHG--HRHGFNER

Query:  IKRDAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAP
        IKRDAIRVATLVRRLSH   AV D+ YK        ++SG E G+     R GV     +    +                    + P   P   S  A 
Subjt:  IKRDAIRVATLVRRLSHGGGAVLDDKYKEWKRGVENILSGSELGA-----RRGVTDATNNPTRFL--------------------TPPTPPPLPVSLVAP

Query:  PS---------------------------------ATALRT---PAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRG
         S                                   AL T     V + DV + C          AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSR 
Subjt:  PS---------------------------------ATALRT---PAVTLDDVDMRC---------RAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRG

Query:  TGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGV
        TGSTGTLEFGRGA+PVGATWISLIRNPRAPSFYYIGLAG+GVGGVRV +PEETFQL+E+GTNGVVMDTGTAVTRLP  AY AFRD+FTAQT+NLPRA GV
Subjt:  TGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGV

Query:  SIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC
        SIFDTC+DLNGFES+RVPTVSFYFSDGPVLTLPAKNFLIPVNG GTFCLAFAPS SGLSIIGNIQQEGIQIS DGANGFVGFGPN+C
Subjt:  SIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-22.4e-3242.22Show/hide
Query:  AGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGA--LPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVV
        AGL+G+G G +S   QLG    G FSYC+ S G+ S  TL  G  A  +P G+   +LI +   P++YYI L GI VGG  + +P  TFQL + GT G++
Subjt:  AGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGA--LPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVV

Query:  MDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRA-PGVSIFDTCYDL-NGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAF-APSRSGLSIIG
        +D+GT +T LP  AY A   +FT Q  NLP      S   TC+   +   +V+VP +S  F DG VL L  +N LI     G  CLA  + S+ G+SI G
Subjt:  MDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRA-PGVSIFDTCYDL-NGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAF-APSRSGLSIIG

Query:  NIQQEGIQISFDGANGFVGFGPNIC
        NIQQ+  Q+ +D  N  V F P  C
Subjt:  NIQQEGIQISFDGANGFVGFGPNIC

Q8S9J6 Aspartyl protease family protein At5g107706.0e-3641.44Show/hide
Query:  AGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMD
        AGLLGLG   +SF  Q        FSYCL S     TG L FG   +     +  +       SFY + +  I VGG ++ +P      T F T G ++D
Subjt:  AGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMD

Query:  TGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSR--SGLSIIGNIQ
        +GT +TRLP  AY A R SF A+ S  P   GVSI DTC+DL+GF++V +P V+F FS G V+ L +K  +  V      CLAFA +   S  +I GN+Q
Subjt:  TGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSR--SGLSIIGNIQ

Query:  QEGIQISFDGANGFVGFGPNIC
        Q+ +++ +DGA G VGF PN C
Subjt:  QEGIQISFDGANGFVGFGPNIC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 21.7e-9978.28Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM
        AAGLLG+GGGSMSF+GQL GQTGGAF YCLVSRGT STG+L FGR ALPVGA+W+ L+RNPRAPSFYY+GL G+GVGGVR+ +P+  F LTE G  GVVM
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM

Query:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQ
        DTGTAVTRLP  AYVAFRD F +QT+NLPRA GVSIFDTCYDL+GF SVRVPTVSFYF++GPVLTLPA+NFL+PV+  GT+C AFA S +GLSIIGNIQQ
Subjt:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQ

Query:  EGIQISFDGANGFVGFGPNIC
        EGIQ+SFDGANGFVGFGPN+C
Subjt:  EGIQISFDGANGFVGFGPNIC

Q9LNJ3 Aspartyl protease family protein 22.3e-5149.33Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS-TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRV-SVPEETFQLTEFGTNGV
        AAGLLGLG G +SF GQ G +    FSYCLV R   S   ++ FG  A+   A +  L+ NP+  +FYY+GL GI VGG RV  V    F+L + G  GV
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS-TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRV-SVPEETFQLTEFGTNGV

Query:  VMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNI
        ++D+GT+VTRL   AY+A RD+F      L RAP  S+FDTC+DL+    V+VPTV  +F  G  ++LPA N+LIPV+  G FC AFA +  GLSIIGNI
Subjt:  VMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNI

Query:  QQEGIQISFDGANGFVGFGPNIC
        QQ+G ++ +D A+  VGF P  C
Subjt:  QQEGIQISFDGANGFVGFGPNIC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 13.2e-5347.75Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM
        AAGLLGLGGG +S   Q+      +FSYCLV R +G + +L+F    L  G     L+RN +  +FYY+GL+G  VGG +V +P+  F +   G+ GV++
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM

Query:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQ
        D GTAVTRL   AY + RD+F   T NL + +  +S+FDTCYD +   +V+VPTV+F+F+ G  L LPAKN+LIPV+  GTFC AFAP+ S LSIIGN+Q
Subjt:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQ

Query:  QEGIQISFDGANGFVGFGPNIC
        Q+G +I++D +   +G   N C
Subjt:  QEGIQISFDGANGFVGFGPNIC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein1.6e-5249.33Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS-TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRV-SVPEETFQLTEFGTNGV
        AAGLLGLG G +SF GQ G +    FSYCLV R   S   ++ FG  A+   A +  L+ NP+  +FYY+GL GI VGG RV  V    F+L + G  GV
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS-TGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRV-SVPEETFQLTEFGTNGV

Query:  VMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNI
        ++D+GT+VTRL   AY+A RD+F      L RAP  S+FDTC+DL+    V+VPTV  +F  G  ++LPA N+LIPV+  G FC AFA +  GLSIIGNI
Subjt:  VMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNI

Query:  QQEGIQISFDGANGFVGFGPNIC
        QQ+G ++ +D A+  VGF P  C
Subjt:  QQEGIQISFDGANGFVGFGPNIC

AT1G25510.1 Eukaryotic aspartyl protease family protein1.7e-5749.77Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM
        AAGLLGLGGG ++   QL      +FSYCLV R + S  T++FG    P  A    L+RN +  +FYY+GL GI VGG  + +P+ +F++ E G+ G+++
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM

Query:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQ
        D+GTAVTRL    Y + RDSF   T +L +A GV++FDTCY+L+   +V VPTV+F+F  G +L LPAKN++IPV+  GTFCLAFAP+ S L+IIGN+QQ
Subjt:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQ

Query:  EGIQISFDGANGFVGFGPNIC
        +G +++FD AN  +GF  N C
Subjt:  EGIQISFDGANGFVGFGPNIC

AT3G18490.1 Eukaryotic aspartyl protease family protein2.3e-5447.75Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM
        AAGLLGLGGG +S   Q+      +FSYCLV R +G + +L+F    L  G     L+RN +  +FYY+GL+G  VGG +V +P+  F +   G+ GV++
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM

Query:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQ
        D GTAVTRL   AY + RD+F   T NL + +  +S+FDTCYD +   +V+VPTV+F+F+ G  L LPAKN+LIPV+  GTFC AFAP+ S LSIIGN+Q
Subjt:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQ

Query:  QEGIQISFDGANGFVGFGPNIC
        Q+G +I++D +   +G   N C
Subjt:  QEGIQISFDGANGFVGFGPNIC

AT3G20015.1 Eukaryotic aspartyl protease family protein1.2e-10078.28Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM
        AAGLLG+GGGSMSF+GQL GQTGGAF YCLVSRGT STG+L FGR ALPVGA+W+ L+RNPRAPSFYY+GL G+GVGGVR+ +P+  F LTE G  GVVM
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVM

Query:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQ
        DTGTAVTRLP  AYVAFRD F +QT+NLPRA GVSIFDTCYDL+GF SVRVPTVSFYF++GPVLTLPA+NFL+PV+  GT+C AFA S +GLSIIGNIQQ
Subjt:  DTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQ

Query:  EGIQISFDGANGFVGFGPNIC
        EGIQ+SFDGANGFVGFGPN+C
Subjt:  EGIQISFDGANGFVGFGPNIC

AT3G61820.1 Eukaryotic aspartyl protease family protein5.5e-5349.78Show/hide
Query:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSR-GTGST----GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRV-SVPEETFQLTEFG
        AAGLLGLG G +SF  Q   +  G FSYCLV R  +GS+     T+ FG  A+P  + +  L+ NP+  +FYY+ L GI VGG RV  V E  F+L   G
Subjt:  AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSR-GTGST----GTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRV-SVPEETFQLTEFG

Query:  TNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSI
          GV++D+GT+VTRL   AYVA RD+F    + L RAP  S+FDTC+DL+G  +V+VPTV F+F  G V +LPA N+LIPVN  G FC AFA +   LSI
Subjt:  TNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSI

Query:  IGNIQQEGIQISFDGANGFVGFGPNIC
        IGNIQQ+G ++++D     VGF    C
Subjt:  IGNIQQEGIQISFDGANGFVGFGPNIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCTTAAACAACCCTCTTCTCTTCTCTCCTTTCTCCATTTTCTCTTCTTCTCTGCCGCCGCGGTCGCCGCCCGTCCATCTCCTCTGACTAAGTTCCAATACCTTAA
TGTTAAAGCTACCAAATTGGACTTTAATGACGGTCAGATTCTTCATACCCTTAATTTCTCCGACGGCCCCCGGCAAGTGTCCAGTCACAAATCCGACAATAATACATTTA
AACTCAACCTTGTCCACCGGGATAAACTGTCCCACGTCCATGGCCATCGTCATGGCTTCAATGAACGCATCAAAAGAGACGCCATCAGAGTCGCCACCCTCGTCCGCCGC
CTATCTCACGGCGGCGGCGCCGTGCTAGATGACAAATACAAGGAATGGAAGAGGGGAGTGGAGAATATTTTGTCCGGATCGGAGTTGGGAGCCCGCCGAGGAGTCACCGA
TGCTACCAACAATCCGACCCGGTTTTTGACCCCGCCGACTCCTCCTCCTTTGCCGGTGTCTCTTGTGGCTCCGCCGTCTGCGACCGCCTTGAGAACACCGGCTGTAACGC
TGGACGATGTCGATATGAGGTGTCGGGCCGCCGGGTTACTCGGACTCGGTGGAGGCTCAATGTCATTCATCGGCCAACTCGGTGGTCAAACCGGCGGCGCATTCAGCTAC
TGTTTAGTGAGCCGAGGCACTGGTTCGACCGGCACATTGGAATTCGGTCGTGGAGCATTGCCAGTTGGCGCCACGTGGATCTCCCTAATTCGAAACCCACGCGCCCCAAG
CTTCTACTATATCGGACTTGCTGGCATCGGTGTCGGCGGCGTTCGAGTTTCGGTACCGGAAGAAACTTTCCAACTCACCGAGTTCGGGACAAACGGCGTCGTAATGGACA
CTGGCACCGCCGTCACGCGTCTACCAATGACAGCTTACGTGGCGTTCCGTGATTCGTTCACAGCCCAAACTAGCAACCTCCCACGAGCGCCTGGTGTTTCGATCTTCGAC
ACGTGTTACGATCTCAACGGGTTCGAGTCCGTACGAGTGCCAACGGTGTCGTTTTACTTCTCTGATGGACCGGTTCTGACGTTGCCAGCGAAGAACTTTTTGATTCCGGT
GAACGGCGGCGGAACTTTTTGCCTGGCTTTTGCTCCGTCGCGGTCGGGACTTTCTATTATCGGAAACATCCAGCAGGAAGGAATTCAAATTTCATTCGATGGGGCTAATG
GGTTCGTGGGATTCGGCCCAAATATTTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCTTAAACAACCCTCTTCTCTTCTCTCCTTTCTCCATTTTCTCTTCTTCTCTGCCGCCGCGGTCGCCGCCCGTCCATCTCCTCTGACTAAGTTCCAATACCTTAA
TGTTAAAGCTACCAAATTGGACTTTAATGACGGTCAGATTCTTCATACCCTTAATTTCTCCGACGGCCCCCGGCAAGTGTCCAGTCACAAATCCGACAATAATACATTTA
AACTCAACCTTGTCCACCGGGATAAACTGTCCCACGTCCATGGCCATCGTCATGGCTTCAATGAACGCATCAAAAGAGACGCCATCAGAGTCGCCACCCTCGTCCGCCGC
CTATCTCACGGCGGCGGCGCCGTGCTAGATGACAAATACAAGGAATGGAAGAGGGGAGTGGAGAATATTTTGTCCGGATCGGAGTTGGGAGCCCGCCGAGGAGTCACCGA
TGCTACCAACAATCCGACCCGGTTTTTGACCCCGCCGACTCCTCCTCCTTTGCCGGTGTCTCTTGTGGCTCCGCCGTCTGCGACCGCCTTGAGAACACCGGCTGTAACGC
TGGACGATGTCGATATGAGGTGTCGGGCCGCCGGGTTACTCGGACTCGGTGGAGGCTCAATGTCATTCATCGGCCAACTCGGTGGTCAAACCGGCGGCGCATTCAGCTAC
TGTTTAGTGAGCCGAGGCACTGGTTCGACCGGCACATTGGAATTCGGTCGTGGAGCATTGCCAGTTGGCGCCACGTGGATCTCCCTAATTCGAAACCCACGCGCCCCAAG
CTTCTACTATATCGGACTTGCTGGCATCGGTGTCGGCGGCGTTCGAGTTTCGGTACCGGAAGAAACTTTCCAACTCACCGAGTTCGGGACAAACGGCGTCGTAATGGACA
CTGGCACCGCCGTCACGCGTCTACCAATGACAGCTTACGTGGCGTTCCGTGATTCGTTCACAGCCCAAACTAGCAACCTCCCACGAGCGCCTGGTGTTTCGATCTTCGAC
ACGTGTTACGATCTCAACGGGTTCGAGTCCGTACGAGTGCCAACGGTGTCGTTTTACTTCTCTGATGGACCGGTTCTGACGTTGCCAGCGAAGAACTTTTTGATTCCGGT
GAACGGCGGCGGAACTTTTTGCCTGGCTTTTGCTCCGTCGCGGTCGGGACTTTCTATTATCGGAAACATCCAGCAGGAAGGAATTCAAATTTCATTCGATGGGGCTAATG
GGTTCGTGGGATTCGGCCCAAATATTTGCTAA
Protein sequenceShow/hide protein sequence
MFLKQPSSLLSFLHFLFFSAAAVAARPSPLTKFQYLNVKATKLDFNDGQILHTLNFSDGPRQVSSHKSDNNTFKLNLVHRDKLSHVHGHRHGFNERIKRDAIRVATLVRR
LSHGGGAVLDDKYKEWKRGVENILSGSELGARRGVTDATNNPTRFLTPPTPPPLPVSLVAPPSATALRTPAVTLDDVDMRCRAAGLLGLGGGSMSFIGQLGGQTGGAFSY
CLVSRGTGSTGTLEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEFGTNGVVMDTGTAVTRLPMTAYVAFRDSFTAQTSNLPRAPGVSIFD
TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVNGGGTFCLAFAPSRSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC