; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009702 (gene) of Snake gourd v1 genome

Gene IDTan0009702
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartic proteinase-like protein 1
Genome locationLG03:74333360..74337635
RNA-Seq ExpressionTan0009702
SyntenyTan0009702
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594861.1 Aspartic proteinase-like protein 1, partial [Cucurbita argyrosperma subsp. sororia]2.0e-28693.51Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L+L+MMIS HQA+SI FTSRILHRFSEEMKALRVS+STN SV+VSWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLCESGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYEAYIVGVEACCIGNSCL++TSFKALIDSGTSFTYLPEEVYEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAMPKVPSV LLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILP DGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAP+KETPPNPLPANEQQSV GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLL
        PSAA PCF+PS FY++RL HLLLL
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLL

XP_022963037.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita moschata]3.2e-28992.87Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L+L+MMIS HQA+SI FTSRILHRFSEEMKALRVS+STN SV+VSWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLCESGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYEAYIVGVEACCIGNSCL++TSFKALIDSGTSFTYLPEEVYEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFA+LP DGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAP+KETPPNPLPANEQQSV  GHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAA PCF+PS FY++RL HLLLLV YLV TCV
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV

XP_023003199.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita maxima]1.6e-28893.05Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L+L+MMIS HQA+SI FTSRILHRFSEEMKALRVS+STN SV+VSWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKH+SC HNLCESGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYEAYIVGVEACCIGNSCL++TSFKALIDSGTSFTYLPEEVYEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILP DGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAP+KETPPN LPANEQQSV GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTC
        PSAA PCF+PS FY++RL HLLLLVLYL  TC
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTC

XP_023517669.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-28893.06Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L+L+MMIS HQA+SI FTSRILHRFSEEMKALRVS+STN SV+VSWPEKGSMEYYQ+L+SGDFQRQKMKLGS+FQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLC+SGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYEAYIVGVEACCIGNSCL++TSFKALIDSGTSFTYLPEEVYEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILP DGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAP+KETPPNPLPANEQQSV GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAA P F+PS FY++RL HLLLLVLYLV TCV
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV

XP_038882807.1 aspartic proteinase-like protein 1 isoform X1 [Benincasa hispida]6.1e-28893.06Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSL+NLILLL+M+I+VHQAVSITFTSRILHRFSEEMKALRVS STN SV+ SWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLCESGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLS GCGNSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LVQNSFSLCFNEDGSGRIFFGD+G ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYE Y+VGVEACCI NSCLK+TSFKALIDSGTSFTYLPEE YENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVP+VTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFP+YGD+GLAGFCFAILPADGDIGILGQNYMTGYRMVFDR++LKL WSRANC DLSNEKKMPLAPAKETPPNPLPANEQQS PGGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAA PCF+ SSFYSIRLPHLLLLV YLV +CV
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV

TrEMBL top hitse value%identityAlignment
A0A1S3B270 aspartic proteinase-like protein 1 isoform X18.9e-28592.51Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL++LL+M+I VHQAVSITFTSRILHRFSEEMKALRVS STN SV+VSWPEKGSMEYYQ+L+SGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLC+SGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYE YIVGVEACCI NSCLK+TSFKALIDSGTSFTYLPEE YENIVMEFDKRLNTTS VSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDR+NLKL WS ANCQDLSNEKKMPL PAKETPPNPLPANEQQS  GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAA-PCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAAA PCF+PS FYSIRLP+LLLL L LV +CV
Subjt:  PSAAA-PCFMPSSFYSIRLPHLLLLVLYLVFTCV

A0A5D3CLH5 Aspartic proteinase-like protein 1 isoform X18.9e-28592.51Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL++LL+M+I VHQAVSITFTSRILHRFSEEMKALRVS STN SV+VSWPEKGSMEYYQ+L+SGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLW+PC+CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLC+SGQSCQSPKQSCPYVIDYITENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKE LVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYE YIVGVEACCI NSCLK+TSFKALIDSGTSFTYLPEE YENIVMEFDKRLNTTS VSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDR+NLKL WS ANCQDLSNEKKMPL PAKETPPNPLPANEQQS  GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAA-PCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAAA PCF+PS FYSIRLP+LLLL L LV +CV
Subjt:  PSAAA-PCFMPSSFYSIRLPHLLLLVLYLVFTCV

A0A6J1GFS3 aspartic proteinase-like protein 1 isoform X11.2e-28492.12Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNLILLL+M+I+VHQAVSITFTSR+LHRFSE+MKALRVS ST   V+ SWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPS SSTSKHISC HNLC+SGQSCQSPKQSCPYVIDY TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLV NSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        D KYEAYIVGVEACCIGNSCLK+TSFKALIDSGTSFTYLPEE YENIVMEFDKRLN TSTVSFKGYPWKYCYKIS DAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCF+ILPADGDIGILGQNYMTGYRMVFDR+NLKL WSRANCQDLSN+K+MP+APAKETPPNPLPANEQQS PGGHAVAPA+AGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAAAPC MPSSFYSIRLPHL+LLVL LV TCV
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV

A0A6J1HE55 aspartic proteinase-like protein 1 isoform X11.6e-28992.87Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L+L+MMIS HQA+SI FTSRILHRFSEEMKALRVS+STN SV+VSWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLW+PCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISC HNLCESGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYEAYIVGVEACCIGNSCL++TSFKALIDSGTSFTYLPEEVYEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFA+LP DGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAP+KETPPNPLPANEQQSV  GHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV
        PSAA PCF+PS FY++RL HLLLLV YLV TCV
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV

A0A6J1KSL9 aspartic proteinase-like protein 1 isoform X17.7e-28993.05Show/hide
Query:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
        MSLRNL+L+L+MMIS HQA+SI FTSRILHRFSEEMKALRVS+STN SV+VSWPEKGSMEYYQ+L+SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG
Subjt:  MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFG

Query:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        WLHY WIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKH+SC HNLCESGQSCQSPKQSCPYVIDY+TENTSSSG
Subjt:  WLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL
        LLIQDVLHLSSGC NSSNC IQAPV+LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEG ASQQ+TSFV L
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSL

Query:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
        DGKYEAYIVGVEACCIGNSCL++TSFKALIDSGTSFTYLPEEVYEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP
Subjt:  DGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDP

Query:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        VFPIYGDQGLAGFCFAILP DGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAP+KETPPN LPANEQQSV GGHAVAPAVAGRAPSK
Subjt:  VFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTC
        PSAA PCF+PS FY++RL HLLLLVLYL  TC
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTC

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 363.0e-2727.88Show/hide
Query:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCE---SGQSCQSPKQSCPYVIDYITENTS
        L++T I +G+P   + V +D GSD+LWV C  C +C P+       L   L+ Y   +SSTSK++ C  + C      ++C   K+ C Y + Y  + ++
Subjt:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCE---SGQSCQSPKQSCPYVIDYITENTS

Query:  SSGLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGY-LSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCF-NEDGSGRIFFGDEGSASQQVT
        S G  I+D + L    GN     +   V+ GCG  QSG    +  A DG+ G G    S++S LA  G  +  FS C  N +G G    G+  S   + T
Subjt:  SSGLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGY-LSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCF-NEDGSGRIFFGDEGSASQQVT

Query:  SFVSLDGKYEAYIVGVEACCIGNSCLKKTSFKA-------LIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLL
          V     Y   + G++    G+      S  +       +IDSGT+  YLP+ +Y +++ +   +      +  + +    C+  +++     P V L 
Subjt:  SFVSLDGKYEAYIVGVEACCIGNSCLKKTSFKA-------LIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLL

Query:  F--PLNNSFVVHDPVFPIYGDQGLAGFCF-----AILPADG-DIGILGQNYMTGYRMVFDRENLKLSWSRANC
        F   L  S   HD +F +  D     +CF      +   DG D+ +LG   ++   +V+D EN  + W+  NC
Subjt:  F--PLNNSFVVHDPVFPIYGDQGLAGFCF-----AILPADG-DIGILGQNYMTGYRMVFDRENLKLSWSRANC

Q8VYV9 Aspartyl protease family protein 12.9e-7537.83Show/hide
Query:  HRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW
        HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+G++T+ + +  G+LHY  + +GTPS  F+VALD GSDL W
Subjt:  HRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW

Query:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVIL
        +PCDC  C   L A   G    DLN Y P++SSTS  + C   LC  G  C SP+  CPY I Y++  TSS+G+L++DVLHL S   + S+  I A V  
Subjt:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVIL

Query:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFK
        GCG  Q+G +  G AP+GLFGLGL +ISV S LAKEG+  NSFS+CF  DG+GRI FGD+GS  Q+ T  +++   +  Y + V    +G +      F 
Subjt:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFK

Query:  ALIDSGTSFTYLPEEVYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPAD
        A+ DSGTSFTYL +  Y  I   F     DKR  TT +      P++YCY +S +    + P+V L     +S+ V+ P+  +   +    +C AI+  +
Subjt:  ALIDSGTSFTYLPEEVYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPAD

Query:  GDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSKPSAAAPCFMPSSFYSIRL
         DI I+GQN+MTGYR+VFDRE L L W  ++C               ET    LP+N   S       A +    A + PS        S+ YS+ +
Subjt:  GDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSKPSAAAPCFMPSSFYSIRL

Q9LEW3 Aspartyl protease AED15.8e-2328.65Show/hide
Query:  PSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSC
        P++   T+  GN     +   I IGTP     +  D GSDL W      QC P   S Y   +   N   PSSSST +++SC   +CE  +SC +   +C
Subjt:  PSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSC

Query:  PYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLC---FNEDGSGRI
         Y I Y  + + + G L ++   L       +N  +   V  GCG + + G   GVA  GL GLG G++S+ +         N FS C   F  + +G +
Subjt:  PYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLC---FNEDGSGRI

Query:  FFGDEG-SASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKT--SFK---ALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISA
         FG  G S S + T   S    +  Y + +    +G+  L  T  SF    A+IDSGT FT LP +VY  +   F +++++  + S  G  +  CY  + 
Subjt:  FFGDEG-SASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKT--SFK---ALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISA

Query:  DAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANC
          +  V   T+ F    S VV      I     ++  C A    D    I G    T   +V+D    ++ ++   C
Subjt:  DAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANC

Q9LX20 Aspartic proteinase-like protein 18.5e-16055.3Show/hide
Query:  ILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW
        +L  ++ ++  + ++  F+SR++HRFS+E +A   + S++ S+    P K S+EYY+ L   DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHYTW
Subjt:  ILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW

Query:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD
        IDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL  KDLNEY PSSSSTSK   C H LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D
Subjt:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD

Query:  VLHLSSGCGN---SSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLD-
        +LHL+    N   + +  ++A V++GCG KQSG YL GVAPDGL GLG  EISV S L+K GL++NSFSLCF+E+ SGRI+FGD G + QQ T F+ LD 
Subjt:  VLHLSSGCGN---SSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLD-

Query:  GKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPV
         KY  YIVGVEACCIGNSCLK+TSF   IDSG SFTYLPEE+Y  + +E D+ +N TS  +F+G  W+YCY+ SA+  PKVP++ L F  NN+FV+H P+
Subjt:  GKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPV

Query:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        F     QGL  FC  I P+  + IG +GQNYM GYRMVFDREN+KL WS + CQ+   E     +P   + PNPLP +EQQS  GGHAV+PA+AG+ PSK
Subjt:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYL
          +++  +  SS   +RL + LLL+ +L
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYL

Q9S9K4 Aspartic proteinase 397.3e-2625.65Show/hide
Query:  RNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLH
        R L +++ + + V +  S  F  +  H+F+ + K                    ++E+++   S D +R    L S   +  P  G   +    D   L+
Subjt:  RNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLH

Query:  YTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLC---ESGQSCQSPKQSCPYVIDYITENTSSS
        +T I +G+P   + V +D GSD+LW+ C  C +C   +     +L+  L+ +  ++SSTSK + C  + C       SCQ P   C Y I Y  E+T S 
Subjt:  YTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLC---ESGQSCQSPKQSCPYVIDYITENTSSS

Query:  GLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSG-VAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCF-NEDGSGRIFFGDEGSASQQVTSF
        G  I+D+L L    G+     +   V+ GCG  QSG   +G  A DG+ G G    SVLS LA  G  +  FS C  N  G G    G   S   + T  
Subjt:  GLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSG-VAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCF-NEDGSGRIFFGDEGSASQQVTSF

Query:  VSLDGKYEAYIVGVE----ACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNN
        V     Y   ++G++    +  +  S ++      ++DSGT+  Y P+ +Y++++     R      +  + +    C+  S +     P V+  F  + 
Subjt:  VSLDGKYEAYIVGVE----ACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNN

Query:  SFVV--HDPVFPIYGDQGLAGFCFAILPAD--GDIGILGQNYMTGYRMVFDRENLKLSWSRANC
           V  HD +F +  +    G+    L  D   ++ +LG   ++   +V+D +N  + W+  NC
Subjt:  SFVV--HDPVFPIYGDQGLAGFCFAILPAD--GDIGILGQNYMTGYRMVFDRENLKLSWSRANC

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein2.1e-7637.83Show/hide
Query:  HRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW
        HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+G++T+ + +  G+LHY  + +GTPS  F+VALD GSDL W
Subjt:  HRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFP-SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW

Query:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVIL
        +PCDC  C   L A   G    DLN Y P++SSTS  + C   LC  G  C SP+  CPY I Y++  TSS+G+L++DVLHL S   + S+  I A V  
Subjt:  VPCDCIQCA-PLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVIL

Query:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFK
        GCG  Q+G +  G AP+GLFGLGL +ISV S LAKEG+  NSFS+CF  DG+GRI FGD+GS  Q+ T  +++   +  Y + V    +G +      F 
Subjt:  GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFK

Query:  ALIDSGTSFTYLPEEVYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPAD
        A+ DSGTSFTYL +  Y  I   F     DKR  TT +      P++YCY +S +    + P+V L     +S+ V+ P+  +   +    +C AI+  +
Subjt:  ALIDSGTSFTYLPEEVYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPAD

Query:  GDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSKPSAAAPCFMPSSFYSIRL
         DI I+GQN+MTGYR+VFDRE L L W  ++C               ET    LP+N   S       A +    A + PS        S+ YS+ +
Subjt:  GDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSKPSAAAPCFMPSSFYSIRL

AT3G51330.1 Eukaryotic aspartyl protease family protein4.3e-6636.01Show/hide
Query:  LILLLIMMISVHQA-VSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGW
        L+ LL++   + +   S  F+  + H FS+     RV +S      V  PEKGS+EY++ L   D   +   L S  +   + F   G++TI++ +  G+
Subjt:  LILLLIMMISVHQA-VSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGW

Query:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYG-SLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG
        LHY  + +GTP+  FLVALD GSDL W+PC+C           G S  + LN Y P++SSTS  I C  + C     C SP  SCPY I Y++++T ++G
Subjt:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYG-SLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSG

Query:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNE--DGSGRIFFGDEGSASQQVTSFV
         L +DVLHL +   +     ++A + LGCG  Q+G   S  A +GL GLGL + SV S LAK  +  NSFS+CF    D  GRI FGD+G   Q  T  +
Subjt:  LLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNE--DGSGRIFFGDEGSASQQVTSFV

Query:  SLDGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKV-PSVTLLFPLNNSFVV
          +     Y V V    +G   +      AL D+GTSFT+L E  Y  I   FD  +           P+++CY +S +    + P V + F   +   +
Subjt:  SLDGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKV-PSVTLLFPLNNSFVV

Query:  HDPVFPIYGDQGLAGFCFAILPA-DGDIGILGQNYMTGYRMVFDRENLKLSWSRANC-QDLSNEKKMPLAPAKETP----PNPLPA
         +P+F ++ +   A +C  IL + D  I I+GQN+M+GYR+VFDRE + L W R++C +D S E   P  P  E P      PLP+
Subjt:  HDPVFPIYGDQGLAGFCFAILPA-DGDIGILGQNYMTGYRMVFDRENLKLSWSRANC-QDLSNEKKMPLAPAKETP----PNPLPA

AT3G51350.1 Eukaryotic aspartyl protease family protein6.0e-6033.93Show/hide
Query:  PEKGSMEYYQQLISGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDC-IQCAPLSASYYGSLDKDL
        PE+GS+EY++ L   D   R +    +  +     +G          G L+Y  + +GTP  SFLVALD GSDL W+PC+C   C              L
Subjt:  PEKGSMEYYQQLISGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDC-IQCAPLSASYYGSLDKDL

Query:  NEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGL
        N Y P++S+TS  I C    C   + C SP   CPY I Y + +T + G L+QDVLHL++   N +   ++A V LGCG KQ+G +    + +G+ GLG+
Subjt:  NEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGL

Query:  GEISVLSSLAKEGLVQNSFSLCFNE--DGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVM
           SV S LAK  +  NSFS+CF       GRI FGD G   Q+ T F+S+     AY V +    +    +    F A  D+G+SFT+L E  Y  +  
Subjt:  GEISVLSSLAKEGLVQNSFSLCFNE--DGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVM

Query:  EFDKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADG-DIGILGQNYMTGYRMVFDRENLKLS
         FD+ +           P+++CY +S +A   + P V + F   +  ++++P F     +G   +C  +L + G  I ++GQN++ GYR+VFDRE + L 
Subjt:  EFDKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADG-DIGILGQNYMTGYRMVFDRENLKLS

Query:  WSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAP
        W ++ C +D S E   P  P  E P   + A   +S+P   +  P
Subjt:  WSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAP

AT4G35880.1 Eukaryotic aspartyl protease family protein5.8e-8740.13Show/hide
Query:  ILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGW
        ++ ++M++S        FT  + HRFS+E+K  + S ST +  +  +P KGS EY+  L+  D+  +  +L      S   L F S+G+ T  + +  G+
Subjt:  ILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGW

Query:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGL
        LHYT + +GTP + F+VALD GSDL WVPCDC +CAP   + Y S + +L+ Y P  S+T+K ++C ++LC     C     +CPY++ Y++  TS+SG+
Subjt:  LHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGL

Query:  LIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLD
        L++DV+HL++   N    +++A V  GCG  QSG +L   AP+GLFGLG+ +ISV S LA+EGLV +SFS+CF  DG GRI FGD+GS+ Q+ T F +L+
Subjt:  LIQDVLHLSSGCGNSSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLD

Query:  GKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMPK-VPSVTLLFPLNNSF
          +  Y + V    +G + L    F AL D+GTSFTYL + +Y  +   F     DKR +  S +     P++YCY +S DA    +PS++L    N+ F
Subjt:  GKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMPK-VPSVTLLFPLNNSF

Query:  VVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDL
         ++DP+  +   +G   +C AI+ +  ++ I+GQNYMTGYR+VFDRE L L+W + +C D+
Subjt:  VVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRENLKLSWSRANCQDL

AT5G10080.1 Eukaryotic aspartyl protease family protein6.1e-16155.3Show/hide
Query:  ILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW
        +L  ++ ++  + ++  F+SR++HRFS+E +A   + S++ S+    P K S+EYY+ L   DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHYTW
Subjt:  ILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTW

Query:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD
        IDIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YY SL  KDLNEY PSSSSTSK   C H LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D
Subjt:  IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSL-DKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD

Query:  VLHLSSGCGN---SSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLD-
        +LHL+    N   + +  ++A V++GCG KQSG YL GVAPDGL GLG  EISV S L+K GL++NSFSLCF+E+ SGRI+FGD G + QQ T F+ LD 
Subjt:  VLHLSSGCGN---SSNCKIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLD-

Query:  GKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPV
         KY  YIVGVEACCIGNSCLK+TSF   IDSG SFTYLPEE+Y  + +E D+ +N TS  +F+G  W+YCY+ SA+  PKVP++ L F  NN+FV+H P+
Subjt:  GKYEAYIVGVEACCIGNSCLKKTSFKALIDSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPV

Query:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK
        F     QGL  FC  I P+  + IG +GQNYM GYRMVFDREN+KL WS + CQ+   E     +P   + PNPLP +EQQS  GGHAV+PA+AG+ PSK
Subjt:  FPIYGDQGLAGFCFAILPADGD-IGILGQNYMTGYRMVFDRENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSK

Query:  PSAAAPCFMPSSFYSIRLPHLLLLVLYL
          +++  +  SS   +RL + LLL+ +L
Subjt:  PSAAAPCFMPSSFYSIRLPHLLLLVLYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCTTCGGAATCTGATTTTGTTGCTGATAATGATGATTTCCGTTCACCAGGCGGTGTCGATTACGTTCACATCGAGGATACTTCACCGGTTCTCTGAGGAGATGAA
GGCGCTTAGGGTTTCAAAGAGTACGAATAAGAGTGTACAAGTCTCGTGGCCTGAGAAGGGGAGCATGGAGTATTATCAGCAGCTTATTAGTGGTGACTTCCAGAGGCAGA
AGATGAAGCTTGGCTCTCGGTTTCAGTTACTTTTCCCGTCTGAAGGCAGCAAAACCATTGCGCTGGGGAATGACTTTGGCTGGTTGCATTACACCTGGATCGATATCGGG
ACACCAAGTGTTTCATTTCTGGTTGCGTTGGATGCTGGGAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCT
GGATAAAGATCTCAATGAATACCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCGGCCATAATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGT
CATGTCCTTATGTTATTGACTACATAACTGAAAATACTTCAAGCTCAGGATTACTAATTCAGGATGTATTGCATCTTTCATCCGGTTGTGGGAATTCATCTAACTGTAAA
ATTCAGGCTCCGGTCATTTTAGGATGTGGTATGAAACAAAGTGGCGGTTATCTAAGTGGAGTTGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCT
TAGTTCCCTTGCTAAAGAAGGATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATCTTTTTCGGGGATGAGGGATCAGCAAGTCAACAAG
TGACTTCATTTGTGTCGTTAGACGGGAAATATGAAGCTTACATCGTCGGGGTGGAAGCATGTTGTATTGGGAATTCATGCCTCAAGAAGACAAGTTTTAAAGCATTGATT
GATAGTGGAACCTCGTTTACGTACCTCCCAGAGGAAGTATATGAAAATATTGTGATGGAGTTCGATAAAAGGTTAAACACTACTAGCACCGTCTCCTTTAAAGGTTATCC
TTGGAAGTATTGCTATAAGATCAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACGTTGTTGTTCCCACTAAACAATAGCTTTGTGGTTCATGATCCTGTTTTTCCTA
TCTATGGCGATCAGGGGTTAGCTGGATTTTGTTTTGCTATACTACCTGCTGATGGAGATATCGGAATTCTGGGACAAAATTACATGACTGGATACCGGATGGTATTTGAC
AGGGAAAACTTGAAGTTGAGTTGGTCACGTGCAAATTGTCAAGATCTCAGTAACGAAAAGAAAATGCCACTTGCACCTGCAAAAGAAACGCCGCCAAACCCATTACCAGC
CAATGAGCAGCAGAGCGTTCCAGGGGGGCATGCTGTGGCTCCTGCCGTAGCTGGGAGGGCCCCTTCTAAACCATCAGCTGCCGCCCCTTGCTTCATGCCATCCAGCTTTT
ATTCGATCAGATTGCCGCACCTGCTTCTTCTGGTACTCTACCTTGTTTTTACTTGCGTGTGA
mRNA sequenceShow/hide mRNA sequence
ACTTCTTCTTTACATTTCACTTCACTTCTTCTTTCCAACTTCCGGCTGAACACGTCGGCCTTTTTCTGTGATCTCTCTTCCATAATTTTAATTTCGTTTCGTAATCTCGA
AATCTTTCTGGTTCTGTTTCTCATTTTGGTCCCCGCCGGAAATTGAGTCTCCGAATTATTGGCAGTTTCTTCTGAAGTTTCAGATCTCGTCTTCGAGCTACTTCGTTCGT
CCCTTTGAGGACTGACTTGCAATGTCGCTTCGGAATCTGATTTTGTTGCTGATAATGATGATTTCCGTTCACCAGGCGGTGTCGATTACGTTCACATCGAGGATACTTCA
CCGGTTCTCTGAGGAGATGAAGGCGCTTAGGGTTTCAAAGAGTACGAATAAGAGTGTACAAGTCTCGTGGCCTGAGAAGGGGAGCATGGAGTATTATCAGCAGCTTATTA
GTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTACTTTTCCCGTCTGAAGGCAGCAAAACCATTGCGCTGGGGAATGACTTTGGCTGGTTGCAT
TACACCTGGATCGATATCGGGACACCAAGTGTTTCATTTCTGGTTGCGTTGGATGCTGGGAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTC
TGCAAGTTACTATGGCAGTCTGGATAAAGATCTCAATGAATACCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCGGCCATAATTTGTGCGAGTCAGGCCAAA
GCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGACTACATAACTGAAAATACTTCAAGCTCAGGATTACTAATTCAGGATGTATTGCATCTTTCATCCGGTTGT
GGGAATTCATCTAACTGTAAAATTCAGGCTCCGGTCATTTTAGGATGTGGTATGAAACAAAGTGGCGGTTATCTAAGTGGAGTTGCTCCAGATGGTCTTTTTGGATTGGG
GCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCTAAAGAAGGATTGGTGCAGAACTCTTTCTCGCTGTGTTTTAATGAGGATGGATCTGGCAGAATCTTTTTCGGGGATG
AGGGATCAGCAAGTCAACAAGTGACTTCATTTGTGTCGTTAGACGGGAAATATGAAGCTTACATCGTCGGGGTGGAAGCATGTTGTATTGGGAATTCATGCCTCAAGAAG
ACAAGTTTTAAAGCATTGATTGATAGTGGAACCTCGTTTACGTACCTCCCAGAGGAAGTATATGAAAATATTGTGATGGAGTTCGATAAAAGGTTAAACACTACTAGCAC
CGTCTCCTTTAAAGGTTATCCTTGGAAGTATTGCTATAAGATCAGTGCAGACGCAATGCCAAAGGTTCCATCTGTGACGTTGTTGTTCCCACTAAACAATAGCTTTGTGG
TTCATGATCCTGTTTTTCCTATCTATGGCGATCAGGGGTTAGCTGGATTTTGTTTTGCTATACTACCTGCTGATGGAGATATCGGAATTCTGGGACAAAATTACATGACT
GGATACCGGATGGTATTTGACAGGGAAAACTTGAAGTTGAGTTGGTCACGTGCAAATTGTCAAGATCTCAGTAACGAAAAGAAAATGCCACTTGCACCTGCAAAAGAAAC
GCCGCCAAACCCATTACCAGCCAATGAGCAGCAGAGCGTTCCAGGGGGGCATGCTGTGGCTCCTGCCGTAGCTGGGAGGGCCCCTTCTAAACCATCAGCTGCCGCCCCTT
GCTTCATGCCATCCAGCTTTTATTCGATCAGATTGCCGCACCTGCTTCTTCTGGTACTCTACCTTGTTTTTACTTGCGTGTGATGTAGATTCTGAGCTTATCTAAGTTTT
TACATGTAAATGAAGGTCGTTCCTTTACATCTTCCCCCTAGCAAGGGCATGTAGATCTTTTGTGTCCTTGCAAGTTGCATCAATGGCATTTTTCTTTTTCTTCTCTCTTT
TTTTTTTGGGCATTATACAGGCCATCCAGTGGGGGTTGATGATAGTAGAGTAGTTGCATTTTTGGTTATGTAGCTATACATTCTTCTATCTAATATGTTTTTCTTTTCTA
ATATATATTATCGTCAGTTTTACTGCTTTGTTATCTTTATTATTTATTTGTCCAAACTAAAATTGACTGTAGTCATGAGATTCTCAAGATCAAATCCCATATTTTCCCAT
CA
Protein sequenceShow/hide protein sequence
MSLRNLILLLIMMISVHQAVSITFTSRILHRFSEEMKALRVSKSTNKSVQVSWPEKGSMEYYQQLISGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIG
TPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCGHNLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCGNSSNCK
IQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEGLVQNSFSLCFNEDGSGRIFFGDEGSASQQVTSFVSLDGKYEAYIVGVEACCIGNSCLKKTSFKALI
DSGTSFTYLPEEVYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFD
RENLKLSWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSVPGGHAVAPAVAGRAPSKPSAAAPCFMPSSFYSIRLPHLLLLVLYLVFTCV