; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016512 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016512
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionaspartic proteinase-like protein 1
Genome locationChr03:5562831..5566352
RNA-Seq ExpressionHG10016512
SyntenyHG10016512
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143563.2 aspartic proteinase-like protein 1 isoform X1 [Cucumis sativus]1.6e-29394.75Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+LL+LM I VHQ VSITFTSRILHRFSEEMKALR SGSTNTSVR SWPEKGSMEYYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPC+CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIV+EFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGY+MVFDRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPA+EQQS  GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAA PCFIPS FYSIRLPHLLLL L LV SCV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

XP_008440641.1 PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis melo]8.2e-29394.57Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL++L+LM I VHQ VSITFTSRILHRFSEEMKALRVS STNTSVR SWPEKGSMEYYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGY+MVFDRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPANEQQS  GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PS-AAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PS AA PCFIPS FYSIRLP+LLLL L LV SCV
Subjt:  PS-AAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

XP_022950779.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita moschata]1.0e-28792.68Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLILL+LM I VHQ VSITFTSR+LHRFSE+MKALRVSGST T VRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPS SSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDY TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        D KYE YIVGVEACCI NSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN +S+VSFKGYPWKYCYKIS DAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCF+ILPADGDIGILGQNYMTGY+MVFDRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQS PGGHAVAPA+AGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAA PC +PSSFYSIRLPHL+LLVL LV +CV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

XP_022978453.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita maxima]8.8e-28792.5Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLILL+LM I VHQ VSITFTSR+LHRFS++MKA RVSGST T VRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPS SSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDY TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        D KYE YIVGVEACCI NSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN +S+VSFKGYPWKYCYKIS DAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCF+ILPADGDIGILGQNYMTGY+MVFDRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQS PGGHAVAPA+AGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAA PC IPSSFYSIRLPHL+LLVL LV +CV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

XP_038882807.1 aspartic proteinase-like protein 1 isoform X1 [Benincasa hispida]1.4e-29795.68Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSL+NLILL+LM I VHQ VSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLS GC NSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGD+GPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        DGKYETY+VGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVP+VTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFP+YGD+GLAGFCFAILPADGDIGILGQNYMTGY+MVFDRD+LKLGWSRANC DLSNEKKMPLAPAKETPPNPLPANEQQS PGGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAAVPCFI SSFYSIRLPHLLLLV YLV SCV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

TrEMBL top hitse value%identityAlignment
A0A1S3B270 aspartic proteinase-like protein 1 isoform X14.0e-29394.57Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL++L+LM I VHQ VSITFTSRILHRFSEEMKALRVS STNTSVR SWPEKGSMEYYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGY+MVFDRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPANEQQS  GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PS-AAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PS AA PCFIPS FYSIRLP+LLLL L LV SCV
Subjt:  PS-AAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

A0A5D3CLH5 Aspartic proteinase-like protein 1 isoform X14.0e-29394.57Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL++L+LM I VHQ VSITFTSRILHRFSEEMKALRVS STNTSVR SWPEKGSMEYYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNT+S+VSFKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGY+MVFDRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPANEQQS  GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PS-AAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PS AA PCFIPS FYSIRLP+LLLL L LV SCV
Subjt:  PS-AAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

A0A6J1GFS3 aspartic proteinase-like protein 1 isoform X15.0e-28892.68Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLILL+LM I VHQ VSITFTSR+LHRFSE+MKALRVSGST T VRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPS SSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDY TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        D KYE YIVGVEACCI NSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN +S+VSFKGYPWKYCYKIS DAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCF+ILPADGDIGILGQNYMTGY+MVFDRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQS PGGHAVAPA+AGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAA PC +PSSFYSIRLPHL+LLVL LV +CV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

A0A6J1HE55 aspartic proteinase-like protein 1 isoform X17.2e-28791.56Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L++LM I  HQ +SI FTSRILHRFSEEMKALRVS STNTSVR SWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFV L
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        DGKYE YIVGVEACCI NSCL+QTSFKALIDSGTSFTYLPEE YEN+VMEFDKRLNT+S+V+FKGYPWKYCYKISADAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFA+LP DGDIGILGQNYMTGY+MVFDR+NLKL WSRANCQDLSNEKKMPLAP+KETPPNPLPANEQQS   GHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAA PCFIPS FY++RL HLLLLV YLV +CV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

A0A6J1IU36 aspartic proteinase-like protein 1 isoform X14.2e-28792.5Show/hide
Query:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLILL+LM I VHQ VSITFTSR+LHRFS++MKA RVSGST T VRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFG
Subjt:  MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPS SSTSKHISCSHNLC+SGQSCQSPKQSCPYVIDY TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
        LLIQD+LHLSSGCENSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL
Subjt:  LLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPL

Query:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP
        D KYE YIVGVEACCI NSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN +S+VSFKGYPWKYCYKIS DAMPKVPSVTLLFP NNSFVVHDP
Subjt:  DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCF+ILPADGDIGILGQNYMTGY+MVFDRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQS PGGHAVAPA+AGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV
        PSAA PC IPSSFYSIRLPHL+LLVL LV +CV
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 364.3e-2627.76Show/hide
Query:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCE---SGQSCQSPKQSCPYVIDYITENTS
        L++T I +G+P   + V +D GSD+LWV C  C +C P+       L   L+ Y   +SSTSK++ C  + C      ++C   K+ C Y + Y  + ++
Subjt:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCE---SGQSCQSPKQSCPYVIDYITENTS

Query:  SSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGY-LSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF-NEDGSGRIFFGDEGPASQQMT
        S G  I+D + L     N     +   V+ GCG  QSG    +  A DG+ G G    S++S LA     +  FS C  N +G G    G+      + T
Subjt:  SSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGY-LSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF-NEDGSGRIFFGDEGPASQQMT

Query:  SFVPLDGKYETYIVG--VEACCIE-NSCLKQTSFK--ALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFP
          VP    Y   + G  V+   I+    L  T+     +IDSGT+  YLP+  Y +++ +   +      +  + +    C+  +++     P V L F 
Subjt:  SFVPLDGKYETYIVG--VEACCIE-NSCLKQTSFK--ALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFP

Query:  QNNSFVV--HDPVFPIYGDQGLAGFCF-----AILPADG-DIGILGQNYMTGYQMVFDRDNLKLGWSRANC
         +    V  HD +F +  D     +CF      +   DG D+ +LG   ++   +V+D +N  +GW+  NC
Subjt:  QNNSFVV--HDPVFPIYGDQGLAGFCF-----AILPADG-DIGILGQNYMTGYQMVFDRDNLKLGWSRANC

Q8VYV9 Aspartyl protease family protein 11.9e-7436.87Show/hide
Query:  HRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW
        HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+G++T+ + +  G+LHY  + +GTPS  F+VALD GSDL W
Subjt:  HRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW

Query:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVIL
        +PCDC  C   L A   G    DLN Y P++SSTS  + C+  LC  G  C SP+  CPY I Y++  TSS+G+L++D+LHL S   + S+  I A V  
Subjt:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVIL

Query:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK
        GCG  Q+G +  G AP+GLFGLGL +ISV S LAKE +  NSFS+CF  DG+GRI FGD+G   Q+ T  + +   + TY + V    +  +      F 
Subjt:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK

Query:  ALIDSGTSFTYLPEEAYENIVMEF-----DKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPAD
        A+ DSGTSFTYL + AY  I   F     DKR  T+ S      P++YCY +S +    + P+V L     +S+ V+ P+  +   +    +C AI+  +
Subjt:  ALIDSGTSFTYLPEEAYENIVMEF-----DKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPAD

Query:  GDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQST--PGGHAVAPAVAGRAPSKPSAAVPCFIPSSFYSIRL
         DI I+GQN+MTGY++VFDR+ L LGW  ++C               ET    LP+N   S+  P   +  P        +P+ +      S+ YS+ +
Subjt:  GDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQST--PGGHAVAPAVAGRAPSKPSAAVPCFIPSSFYSIRL

Q9LEW3 Aspartyl protease AED11.5e-2328.04Show/hide
Query:  PSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSC
        P++   T+  GN     +   I IGTP     +  D GSDL W      QC P   S Y   +   N   PSSSST +++SCS  +CE  +SC +   +C
Subjt:  PSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSC

Query:  PYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLC---FNEDGSGRI
         Y I Y  + + + G L ++   L       +N  +   V  GCG + + G   GVA  GL GLG G++S+ +         N FS C   F  + +G +
Subjt:  PYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLC---FNEDGSGRI

Query:  FFGDEGPASQQMTSFVPLDGKYETYIVGVEACCI----ENSCLKQTSFK---ALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKIS
         FG  G    +   F P+      +  G++   I    +   +   SF    A+IDSGT FT LP + Y  +   F +++++  S S  G  +  CY  +
Subjt:  FFGDEGPASQQMTSFVPLDGKYETYIVGVEACCI----ENSCLKQTSFK---ALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKIS

Query:  ADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANC
           +  V   T+ F    S VV      I     ++  C A    D    I G    T   +V+D    ++G++   C
Subjt:  ADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANC

Q9LX20 Aspartic proteinase-like protein 13.2e-15955.11Show/hide
Query:  ILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW
        +L  ++ +   + ++  F+SR++HRFS+E +A   + S++ S+    P K S+EYY+ L   DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHYTW
Subjt:  ILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW

Query:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD
        IDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL  KDLNEY PSSSSTSK   CSH LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D
Subjt:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD

Query:  MLHLSSGCEN---SSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLD-
        +LHL+    N   + +  ++A V++GCG KQSG YL GVAPDGL GLG  EISV S L+K  L++NSFSLCF+E+ SGRI+FGD GP+ QQ T F+ LD 
Subjt:  MLHLSSGCEN---SSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLD-

Query:  GKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV
         KY  YIVGVEACCI NSCLKQTSF   IDSG SFTYLPEE Y  + +E D+ +N +S  +F+G  W+YCY+ SA+  PKVP++ L F  NN+FV+H P+
Subjt:  GKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV

Query:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        F     QGL  FC  I P+  + IG +GQNYM GY+MVFDR+N+KLGWS + CQ+   E     +P   + PNPLP +EQQS  GGHAV+PA+AG+ PSK
Subjt:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYL
          ++   +  SS   +RL + LLL+ +L
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYL

Q9S9K4 Aspartic proteinase 393.5e-2827.12Show/hide
Query:  STNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSAS
        S N   +A     G  +  +   S D +R    L S   +  P  G   +    D   L++T I +G+P   + V +D GSD+LW+ C  C +C   +  
Subjt:  STNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSAS

Query:  YYGSLDKDLNEYRPSSSSTSKHISCSHNLC---ESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLS
           +L+  L+ +  ++SSTSK + C  + C       SCQ P   C Y I Y  E+T S G  I+DML L     +     +   V+ GCG  QSG   +
Subjt:  YYGSLDKDLNEYRPSSSSTSKHISCSHNLC---ESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLS

Query:  G-VAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF-NEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVE----ACCIENSCLKQTSFKALIDSG
        G  A DG+ G G    SVLS LA     +  FS C  N  G G    G       + T  VP    Y   ++G++    +  +  S ++      ++DSG
Subjt:  G-VAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF-NEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVE----ACCIENSCLKQTSFKALIDSG

Query:  TSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVV--HDPVFPIYGDQGLAGFCFAILPAD--GDIGILGQ
        T+  Y P+  Y++++     R      +  + +    C+  S +     P V+  F  +    V  HD +F +  +    G+    L  D   ++ +LG 
Subjt:  TSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVV--HDPVFPIYGDQGLAGFCFAILPAD--GDIGILGQ

Query:  NYMTGYQMVFDRDNLKLGWSRANC
          ++   +V+D DN  +GW+  NC
Subjt:  NYMTGYQMVFDRDNLKLGWSRANC

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein1.3e-7536.87Show/hide
Query:  HRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW
        HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+G++T+ + +  G+LHY  + +GTPS  F+VALD GSDL W
Subjt:  HRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW

Query:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVIL
        +PCDC  C   L A   G    DLN Y P++SSTS  + C+  LC  G  C SP+  CPY I Y++  TSS+G+L++D+LHL S   + S+  I A V  
Subjt:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVIL

Query:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK
        GCG  Q+G +  G AP+GLFGLGL +ISV S LAKE +  NSFS+CF  DG+GRI FGD+G   Q+ T  + +   + TY + V    +  +      F 
Subjt:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK

Query:  ALIDSGTSFTYLPEEAYENIVMEF-----DKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPAD
        A+ DSGTSFTYL + AY  I   F     DKR  T+ S      P++YCY +S +    + P+V L     +S+ V+ P+  +   +    +C AI+  +
Subjt:  ALIDSGTSFTYLPEEAYENIVMEF-----DKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPAD

Query:  GDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQST--PGGHAVAPAVAGRAPSKPSAAVPCFIPSSFYSIRL
         DI I+GQN+MTGY++VFDR+ L LGW  ++C               ET    LP+N   S+  P   +  P        +P+ +      S+ YS+ +
Subjt:  GDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQST--PGGHAVAPAVAGRAPSKPSAAVPCFIPSSFYSIRL

AT3G51330.1 Eukaryotic aspartyl protease family protein6.4e-7035.54Show/hide
Query:  FTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL
        F+  + H FS+ +K        +  +    PEKGS+EY++ L   D   +   L S  +   + F   G++TI++ +  G+LHY  + +GTP+  FLVAL
Subjt:  FTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL

Query:  DAGSDLLWVPCDCIQCAPLSASYYG-SLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNC
        D GSDL W+PC+C           G S  + LN Y P++SSTS  I CS + C     C SP  SCPY I Y++++T ++G L +D+LHL +  E+    
Subjt:  DAGSDLLWVPCDCIQCAPLSASYYG-SLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNC

Query:  MIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNE--DGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIE
         ++A + LGCG  Q+G   S  A +GL GLGL + SV S LAK ++  NSFS+CF    D  GRI FGD+G   Q  T  +P +    TY V V    + 
Subjt:  MIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNE--DGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIE

Query:  NSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKV-PSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFA
           +      AL D+GTSFT+L E  Y  I   FD  +           P+++CY +S +    + P V + F   +   + +P+F ++ +   A +C  
Subjt:  NSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKV-PSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFA

Query:  ILPA-DGDIGILGQNYMTGYQMVFDRDNLKLGWSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAP
        IL + D  I I+GQN+M+GY++VFDR+ + LGW R++C +D S E   P  P  E P      +   STP    + P  A   P
Subjt:  ILPA-DGDIGILGQNYMTGYQMVFDRDNLKLGWSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAP

AT3G51350.1 Eukaryotic aspartyl protease family protein2.1e-6033.71Show/hide
Query:  PEKGSMEYYQELVSGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDC-IQCAPLSASYYGSLDKDL
        PE+GS+EY++ L   D   R +    +  +     +G          G L+Y  + +GTP  SFLVALD GSDL W+PC+C   C              L
Subjt:  PEKGSMEYYQELVSGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDC-IQCAPLSASYYGSLDKDL

Query:  NEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGL
        N Y P++S+TS  I CS   C   + C SP   CPY I Y + +T + G L+QD+LHL++  EN +   ++A V LGCG KQ+G +    + +G+ GLG+
Subjt:  NEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGL

Query:  GEISVLSSLAKEELVQNSFSLCFNE--DGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVM
           SV S LAK  +  NSFS+CF       GRI FGD G   Q+ T F+ +      Y V +    +    +    F A  D+G+SFT+L E AY  +  
Subjt:  GEISVLSSLAKEELVQNSFSLCFNE--DGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVM

Query:  EFDKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPADG-DIGILGQNYMTGYQMVFDRDNLKLG
         FD+ +           P+++CY +S +A   + P V + F   +  ++++P F     +G   +C  +L + G  I ++GQN++ GY++VFDR+ + LG
Subjt:  EFDKRLNTSSSVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPADG-DIGILGQNYMTGYQMVFDRDNLKLG

Query:  WSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAP
        W ++ C +D S E   P  P  E P   + A   +S P   +  P
Subjt:  WSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAP

AT4G35880.1 Eukaryotic aspartyl protease family protein4.9e-8639.34Show/hide
Query:  LLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGWL
        +LML+  G   G   TF   + HRFS+E+K      S +T   A +P KGS EY+  LV  D+  +  +L      S   L F S+G+ T  + +  G+L
Subjt:  LLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGWL

Query:  HYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLL
        HYT + +GTP + F+VALD GSDL WVPCDC +CAP   + Y S + +L+ Y P  S+T+K ++C+++LC     C     +CPY++ Y++  TS+SG+L
Subjt:  HYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLL

Query:  IQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDG
        ++D++HL++  +N     ++A V  GCG  QSG +L   AP+GLFGLG+ +ISV S LA+E LV +SFS+CF  DG GRI FGD+G + Q+ T F  L+ 
Subjt:  IQDMLHLSSGCENSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDG

Query:  KYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPK-VPSVTLLFPQNNSFVVHDPV
         +  Y + V    +  + L    F AL D+GTSFTYL +  Y  +   F  +            P++YCY +S DA    +PS++L    N+ F ++DP+
Subjt:  KYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPK-VPSVTLLFPQNNSFVVHDPV

Query:  FPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDL
          +   +G   +C AI+ +  ++ I+GQNYMTGY++VFDR+ L L W + +C D+
Subjt:  FPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFDRDNLKLGWSRANCQDL

AT5G10080.1 Eukaryotic aspartyl protease family protein2.3e-16055.11Show/hide
Query:  ILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW
        +L  ++ +   + ++  F+SR++HRFS+E +A   + S++ S+    P K S+EYY+ L   DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHYTW
Subjt:  ILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW

Query:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD
        IDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL  KDLNEY PSSSSTSK   CSH LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D
Subjt:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD

Query:  MLHLSSGCEN---SSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLD-
        +LHL+    N   + +  ++A V++GCG KQSG YL GVAPDGL GLG  EISV S L+K  L++NSFSLCF+E+ SGRI+FGD GP+ QQ T F+ LD 
Subjt:  MLHLSSGCEN---SSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLD-

Query:  GKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV
         KY  YIVGVEACCI NSCLKQTSF   IDSG SFTYLPEE Y  + +E D+ +N +S  +F+G  W+YCY+ SA+  PKVP++ L F  NN+FV+H P+
Subjt:  GKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPV

Query:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK
        F     QGL  FC  I P+  + IG +GQNYM GY+MVFDR+N+KLGWS + CQ+   E     +P   + PNPLP +EQQS  GGHAV+PA+AG+ PSK
Subjt:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYQMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSK

Query:  PSAAVPCFIPSSFYSIRLPHLLLLVLYL
          ++   +  SS   +RL + LLL+ +L
Subjt:  PSAAVPCFIPSSFYSIRLPHLLLLVLYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTTCGGAATCTGATTTTGTTGATGCTGATGGGGATTGGCGTTCACCAGGGAGTGTCGATTACGTTCACATCGAGGATACTTCACAGGTTCTCTGAGGAAATGAA
AGCGCTTAGGGTTTCAGGGAGTACGAATACGAGTGTTCGAGCATCATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGA
AGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGTAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTTGCATTACACCTGGATCGATATCGGG
ACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGAAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCT
GGATAAAGATCTCAATGAATATCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATAATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGT
CATGTCCTTATGTTATTGACTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATATGTTGCATCTTTCATCCGGTTGTGAGAATTCATCTAATTGTATG
ATTCAGGCTCCGGTCATTTTAGGGTGTGGTATGAAGCAAAGTGGTGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCT
TAGTTCCCTTGCGAAAGAAGAATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATTTTTTTTGGGGACGAGGGACCAGCAAGTCAACAAA
TGACTTCATTTGTGCCGTTAGATGGGAAATATGAAACCTACATTGTCGGGGTGGAAGCATGTTGTATTGAGAATTCGTGCCTCAAGCAGACAAGTTTTAAAGCATTGATA
GATAGTGGAACGTCATTTACGTATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGTTTGATAAAAGGTTAAACACTTCAAGCTCCGTCTCCTTTAAAGGATATCC
GTGGAAGTATTGCTATAAGATCAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCCGTGTTCCCTA
TCTATGGCGATCAGGGTTTAGCTGGATTTTGTTTTGCTATACTACCTGCTGATGGAGATATCGGAATACTGGGACAAAATTACATGACTGGATACCAGATGGTATTCGAT
AGGGATAATTTGAAGTTGGGTTGGTCACGTGCAAATTGTCAAGATCTCAGTAACGAAAAGAAAATGCCTCTTGCTCCTGCAAAAGAGACGCCGCCAAACCCATTACCAGC
CAATGAGCAGCAGAGCACTCCAGGGGGGCACGCGGTGGCTCCTGCCGTAGCTGGAAGGGCCCCCTCTAAACCATCAGCTGCTGTCCCTTGCTTCATCCCATCAAGCTTTT
ATTCGATCAGATTGCCGCACCTGCTTCTTCTGGTACTCTACCTTGTTTGTTCTTGTGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCTTCGGAATCTGATTTTGTTGATGCTGATGGGGATTGGCGTTCACCAGGGAGTGTCGATTACGTTCACATCGAGGATACTTCACAGGTTCTCTGAGGAAATGAA
AGCGCTTAGGGTTTCAGGGAGTACGAATACGAGTGTTCGAGCATCATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGA
AGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGTAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTTGCATTACACCTGGATCGATATCGGG
ACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGAAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCT
GGATAAAGATCTCAATGAATATCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATAATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGT
CATGTCCTTATGTTATTGACTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATATGTTGCATCTTTCATCCGGTTGTGAGAATTCATCTAATTGTATG
ATTCAGGCTCCGGTCATTTTAGGGTGTGGTATGAAGCAAAGTGGTGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCT
TAGTTCCCTTGCGAAAGAAGAATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATTTTTTTTGGGGACGAGGGACCAGCAAGTCAACAAA
TGACTTCATTTGTGCCGTTAGATGGGAAATATGAAACCTACATTGTCGGGGTGGAAGCATGTTGTATTGAGAATTCGTGCCTCAAGCAGACAAGTTTTAAAGCATTGATA
GATAGTGGAACGTCATTTACGTATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGTTTGATAAAAGGTTAAACACTTCAAGCTCCGTCTCCTTTAAAGGATATCC
GTGGAAGTATTGCTATAAGATCAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCCGTGTTCCCTA
TCTATGGCGATCAGGGTTTAGCTGGATTTTGTTTTGCTATACTACCTGCTGATGGAGATATCGGAATACTGGGACAAAATTACATGACTGGATACCAGATGGTATTCGAT
AGGGATAATTTGAAGTTGGGTTGGTCACGTGCAAATTGTCAAGATCTCAGTAACGAAAAGAAAATGCCTCTTGCTCCTGCAAAAGAGACGCCGCCAAACCCATTACCAGC
CAATGAGCAGCAGAGCACTCCAGGGGGGCACGCGGTGGCTCCTGCCGTAGCTGGAAGGGCCCCCTCTAAACCATCAGCTGCTGTCCCTTGCTTCATCCCATCAAGCTTTT
ATTCGATCAGATTGCCGCACCTGCTTCTTCTGGTACTCTACCTTGTTTGTTCTTGTGTGTGA
Protein sequenceShow/hide protein sequence
MSLRNLILLMLMGIGVHQGVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIG
TPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDMLHLSSGCENSSNCM
IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALI
DSGTSFTYLPEEAYENIVMEFDKRLNTSSSVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYQMVFD
RDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSTPGGHAVAPAVAGRAPSKPSAAVPCFIPSSFYSIRLPHLLLLVLYLVCSCV