; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040637 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040637
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationchr13:6784700..6787289
RNA-Seq ExpressionLag0040637
SyntenyLag0040637
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573756.1 Protein ASPARTIC PROTEASE IN GUARD CELL 2, partial [Cucurbita argyrosperma subsp. sororia]8.6e-23084.65Show/hide
Query:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        ML KL  S LLFF + + F S  R S  +FQYL++K++KFD NDSQILHTL+F +GH+L TGRKSNHTKFKL LVHRD+L H+HG+H GFDERIKRD KR
Subjt:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
        VATLVRRLSR    +S VQD               MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCYQQSDPVF+PA+S S+TGVSCGS
Subjt:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
        AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG+VTVRD+AIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG GSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
        TLEFGRGAMPVGATWISLVRNP APSFYYIGLAGVGVGG+KVPIPEETFQL+E+G NGVVMDTGTAVTRLPTAAY AFRDSFTAQT NLPRAPGVSIFDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CY+LNGFESVRVPTVSFYFSDGPVLTLPA NFLIPVD AGTFCFAFAPSPSGLSIIGNIQQ GIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

KAG7012830.1 Protein ASPARTIC PROTEASE IN GUARD CELL 2, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-23084.85Show/hide
Query:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        ML KL  S LLFF + + F S  R SP +FQYL+VK++K D NDSQILHTL+F +GH+L TGRKSNHTKFKL LVHRD+L H+HG+H GFDERIKRD KR
Subjt:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
        VATLVRRLSR    +S VQD               MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCYQQSDPVF+PA+S S+TGVSCGS
Subjt:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
        AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG+VTVRD+AIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG GSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
        TLEFGRGAMPVGATWISLVRNP APSFYYIGLAGVGVGG+KVPIPEETFQL+E+G NGVVMDTGTAVTRLPTAAY AFRDSFTAQT NLPRAPGVSIFDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CY+LNGFESVRVPTVSFYFSDGPVLTLPA NFLIPVD AGTFCFAFAPSPSGLSIIGNIQQ GIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

XP_022945579.1 protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita moschata]2.8e-22884.02Show/hide
Query:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        ML KL  S LLFF + + F S  R SP +FQYL+VK++KFD NDSQILH L+F +GH+L TGRKSNHTKFKL LVHRD+L H+HG+H GFDERIKRD KR
Subjt:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
        VATLVRRLSR    +S VQD               MEEG+GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCYQQSDPVF+PA+S S+TGVSCGS
Subjt:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
        AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG+V + D+AIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG GSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
        TLEFGRGAMPVGATWISLVRNP APSFYYIGLAGVGVGG+KVPIPEETFQL+E+G NGVVMDTGTAVTRLPTAAY AFRDSFTAQT NLPRAPGVSIFDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CY+LNGFESVRVPTVSFYFSDGPVLTLPA NFLIPVD AGTFCFAFAPSPSGLSIIGNIQQ GIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

XP_023542067.1 protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita pepo subsp. pepo]5.6e-22984.23Show/hide
Query:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        ML KL  S LLFF + + F S  R SP +FQYL+VK++KFD +DSQIL TL+F +GH+L TGRKSNHTKFKL LVHRD++ H+HG+H  FDERIKRD KR
Subjt:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
        VATLVRRLSR    +S+VQD               MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCYQQSDPVF+PA+S S+TGVSCGS
Subjt:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
        AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRD+AIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG GSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
        TL+FGRGAMPVGATWISLVRNP APSFYYIGLAGVGVGG+KVPIPEETFQL+E+G NGVVMDTGTAVTRLPTAAY AFRDSFTAQT NLPRAPGVS+FDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CY+LNGFESVRVPTVSFYFSDGPVLTLPA NFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQ GIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

XP_038893071.1 protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Benincasa hispida]1.5e-22983.92Show/hide
Query:  MLFKLPSFLLFFLLFLFFS-SADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        M  K PS LLFFL FLFFS +A R SPTKFQYL+VKATK DFND QILHTL+F    +  +G KS++  FKLNL+HRDKLSHVHG  HGF+ERIKRDA R
Subjt:  MLFKLPSFLLFFLLFLFFS-SADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSRAASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVC
        VATLVRRLS    AVQD               MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC+PCS+CYQQSDPVFDPADS+SF GVSC S VC
Subjt:  VATLVRRLSRAASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVC

Query:  DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLE
        DRLEN GC+AGRCRYEVSYGDGSYTKGTLALETLT+G+V +RDVAIGCGHTNQGMFIGAAGLLGLGGGSMSF+GQLGGQTGGAFSYCLVSRGTGSTGTLE
Subjt:  DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLE

Query:  FGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYD
        FGRGA+PVGATWISL+RNPRAPSFYYIGLAG+GVGGV+V +PEETFQL+E+G NGVVMDTGTAVTRLPTAAY A RDSFTAQT NLPRAPGVSIFDTCYD
Subjt:  FGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYD

Query:  LNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        LNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDG GTFC AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
Subjt:  LNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

TrEMBL top hitse value%identityAlignment
A0A0A0KPR4 Peptidase A1 domain-containing protein1.0e-22081.12Show/hide
Query:  MLFK-LPSFLLFFLLFLF--FSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDA
        M FK L S LLF L+ +    ++A   S TKFQYL+VKATK DFND QILH L+F DGH+  +G KS++  FKLNL+HRDKLSHVHG   GF++R+KRDA
Subjt:  MLFK-LPSFLLFFLLFLF--FSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDA

Query:  KRVATLVRRLSRAA-SAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
         RVATLVRRLS  A +AV+D               ME GSGEYFVRIGVGSPPR+QYMVIDSGSDIVWVQC+PCS+CYQQSDPVFDPADS+SF GVSCGS
Subjt:  KRVATLVRRLSRAA-SAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
         VCDRLEN GC+AGRCRYEVSYGDGSYTKGTLALETLT+GQV +RDVAIGCGHTNQGMFIGAAGLLGLGGGSMSF+GQLGGQTGGAFSYCLVSRGTGSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
         LEFGRGA+PVGATWISL+RNPRAPSFYYIGLAG+GVGGV+V +PEETFQL+E+G NGVVMDTGTAVTR PTAAY AFRDSFTAQT NLPRAPGVSIFDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CYDLNGFESVRVPTVSFYFSDGPVLTLPA+NFLIPVDG GTFC AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

A0A1S3BEG7 protein ASPARTIC PROTEASE IN GUARD CELL 2-like6.2e-22682.95Show/hide
Query:  MLFK-LPSFLLFFL-LFLFFSSADRP-SPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDA
        M FK L S LLFFL + +  ++A RP SPTKFQYL+VKATK DFND QILHTL+F D H+  +G KS++  FKLNL+HRDKLSHVHG   GF++R+KRDA
Subjt:  MLFK-LPSFLLFFL-LFLFFSSADRP-SPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDA

Query:  KRVATLVRRLSRAASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSA
         RVATLVRRLS  A+AV+D               MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC+PCS+CYQQSDPVFDPADS+SF GVSCGS 
Subjt:  KRVATLVRRLSRAASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSA

Query:  VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGT
        VCDRLEN GC+AGRCRYEVSYGDGSYTKGTLALETLT+GQV +RDVAIGCGHTNQGMFIGAAGLLGLGGGSMSF+GQLGGQTGGAFSYCLVSRGTGSTGT
Subjt:  VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGT

Query:  LEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTC
        LEFGRGA+PVGATWISL+RNPRAPSFYYIGLAG+GVGGV+V IPEETFQL+EFG NGVVMDTGTAVTRLPT+AY A RDSFTAQT NLPRAPGVSIFDTC
Subjt:  LEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTC

Query:  YDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        YDLNGFESVRVPTVSFYFSDGP LTLPAKNFLIPVDG GTFC AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN+C
Subjt:  YDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

A0A5A7SVY2 Protein ASPARTIC PROTEASE IN GUARD CELL 2-like9.6e-22784.8Show/hide
Query:  MLFK-LPSFLLFFL-LFLFFSSADRP-SPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDA
        M FK L S LLFFL + +  ++A RP SPTKFQYL+VKATK DFND QILHTL+F D H+  +G KS++  FKLNL+HRDKLSHVHG   GF++R+KRDA
Subjt:  MLFK-LPSFLLFFL-LFLFFSSADRP-SPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDA

Query:  KRVATLVRRLSRAASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDRLENAGCHAGR
         RVATLVRRLS  A+AV+D   +GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC+PCS+CYQQSDPVFDPADS+SF GVSCGS VCDRLEN GC+AGR
Subjt:  KRVATLVRRLSRAASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDRLENAGCHAGR

Query:  CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGAMPVGATW
        CRYEVSYGDGSYTKGTLALETLT+GQV +RDVAIGCGHTNQGMFIGAAGLLGLGGGSMSF+GQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGA+PVGATW
Subjt:  CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGAMPVGATW

Query:  ISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYDLNGFESVRVPTV
        ISL+RNPRAPSFYYIGLAG+GVGGV+V IPEETFQL+EFG NGVVMDTGTAVTRLPT+AY A RDSFTAQT NLPRAPGVSIFDTCYDLNGFESVRVPTV
Subjt:  ISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYDLNGFESVRVPTV

Query:  SFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        SFYFSDGP LTLPAKNFLIPVDG GTFC AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN+C
Subjt:  SFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

A0A6J1G1D0 protein ASPARTIC PROTEASE IN GUARD CELL 2-like1.3e-22884.02Show/hide
Query:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        ML KL  S LLFF + + F S  R SP +FQYL+VK++KFD NDSQILH L+F +GH+L TGRKSNHTKFKL LVHRD+L H+HG+H GFDERIKRD KR
Subjt:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
        VATLVRRLSR    +S VQD               MEEG+GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCYQQSDPVF+PA+S S+TGVSCGS
Subjt:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
        AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG+V + D+AIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG GSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
        TLEFGRGAMPVGATWISLVRNP APSFYYIGLAGVGVGG+KVPIPEETFQL+E+G NGVVMDTGTAVTRLPTAAY AFRDSFTAQT NLPRAPGVSIFDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CY+LNGFESVRVPTVSFYFSDGPVLTLPA NFLIPVD AGTFCFAFAPSPSGLSIIGNIQQ GIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

A0A6J1HQG0 protein ASPARTIC PROTEASE IN GUARD CELL 2-like8.7e-22883.61Show/hide
Query:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        ML KL  S LLFF + +   S  R S  +FQYL+VK+ KFD NDSQ+LHTL+F +GH+L TGRKSNHTKFKL LVHRDKL H+HG+H GFDERIKRD KR
Subjt:  MLFKLP-SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS
        VATLVRRLSR     S V+D               MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC QCYQQSDPVF+PA+S S+TGVSCGS
Subjt:  VATLVRRLSR---AASAVQDR--------------MEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGS

Query:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG
        A+CDRLENAGCH+GRCRYEVSYGDGSYTKGTLALETLTIG+VTVRD+AIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG GSTG
Subjt:  AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT
        TL+FGRGAMPVGATWISLVRNP APSFYYIGLAGVGVGG+KVPIPEETFQL+E+G NGVVMDTGTAVTRLPTAAY AFRDSFTA T NLPRAPGVSIFDT
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDT

Query:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        CY+LNGFESVRVPTVSFYFSDGPVLTLPA NFLIPVD AGTFCFAFAPSPSGLSIIGNIQQ GIQISFDGANGFVGFGPN+C
Subjt:  CYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-23.0e-6839.95Show/hide
Query:  ERIKRDAKRVATLVRRLS---RAASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDR
        E IKR  KR    +R ++   +++S ++  +  G GEY + + +G+P  S   ++D+GSD++W QC+PC+QC+ Q  P+F+P DS+SF+ + C S  C  
Subjt:  ERIKRDAKRVATLVRRLS---RAASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDR

Query:  LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIG-AAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEF
        L +  C+   C+Y   YGDGS T+G +A ET T    +V ++A GCG  NQG   G  AGL+G+G G +S   QLG    G FSYC+ S G+ S  TL  
Subjt:  LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIG-AAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEF

Query:  GRGA--MPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRA-PGVSIFDTC
        G  A  +P G+   +L+ +   P++YYI L G+ VGG  + IP  TFQL + G  G+++D+GT +T LP  AY A   +FT Q  NLP      S   TC
Subjt:  GRGA--MPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRA-PGVSIFDTC

Query:  YDL-NGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        +   +   +V+VP +S  F DG VL L  +N LI     G  C A   S   G+SI GNIQQ+  Q+ +D  N  V F P  C
Subjt:  YDL-NGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC

Q8S9J6 Aspartyl protease family protein At5g107701.1e-7037.7Show/hide
Query:  KSNHTKFKLNLVHR----DKLSHVHGRHHGFDERIKRDAKRVATLVRRLSRAASA------------VQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGS
        +++ TK  L++ HR     +L++         E ++ D  RV ++  +LS+  +              +D    GSG Y V +G+G+P     ++ D+GS
Subjt:  KSNHTKFKLNLVHR----DKLSHVHGRHHGFDERIKRDAKRVATLVRRLSRAASA------------VQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGS

Query:  DIVWVQCQPCSQ-CYQQSDPVFDPADSASFTGVSCGSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRD-VAIGCGHTNQG
        D+ W QCQPC + CY Q +P+F+P+ S S+  VSC SA C  L +A      C A  C Y + YGD S++ G LA E  T+    V D V  GCG  NQG
Subjt:  DIVWVQCQPCSQ-CYQQSDPVFDPADSASFTGVSCGSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRD-VAIGCGHTNQG

Query:  MFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGN
        +F G AGLLGLG   +SF  Q        FSYCL S     TG L FG   +     +  +       SFY + +  + VGG K+PIP   F        
Subjt:  MFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGN

Query:  GVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFA--PSPSGLSI
        G ++D+GT +TRLP  AY A R SF A+    P   GVSI DTC+DL+GF++V +P V+F FS G V+ L +K  +  V      C AFA     S  +I
Subjt:  GVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFA--PSPSGLSI

Query:  IGNIQQEGIQISFDGANGFVGFGPNVC
         GN+QQ+ +++ +DGA G VGF PN C
Subjt:  IGNIQQEGIQISFDGANGFVGFGPNVC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 28.8e-17765.63Show/hide
Query:  LPSFLLFFLLFLFFSSADRPSPTKFQYLD-------VKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHV--HGRHHGFDERIKR
        LP F  F  L L  SS+   S   FQ +D       V AT  DFN++       F D          + +K+ L L+HRD+   V     HH    R++R
Subjt:  LPSFLLFFLLFLFFSSADRPSPTKFQYLD-------VKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHV--HGRHHGFDERIKR

Query:  DAKRVATLVRRLSRA--------------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCG
        D  RV+ ++RR+S                 S +   M++GSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC  CY+QSDPVFDPA S S+TGVSCG
Subjt:  DAKRVATLVRRLSRA--------------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCG

Query:  SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGST
        S+VCDR+EN+GCH+G CRYEV YGDGSYTKGTLALETLT  +  VR+VA+GCGH N+GMFIGAAGLLG+GGGSMSFVGQL GQTGGAF YCLVSRGT ST
Subjt:  SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGST

Query:  GTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD
        G+L FGR A+PVGA+W+ LVRNPRAPSFYY+GL G+GVGGV++P+P+  F L+E G  GVVMDTGTAVTRLPTAAY AFRD F +QT NLPRA GVSIFD
Subjt:  GTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        TCYDL+GF SVRVPTVSFYF++GPVLTLPA+NFL+PVD +GT+CFAFA SP+GLSIIGNIQQEGIQ+SFDGANGFVGFGPNVC
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

Q9LNJ3 Aspartyl protease family protein 21.1e-10245.34Show/hide
Query:  LLFFLLFLFFSSADRPSPTKFQYLDVK------ATKFDF---NDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        LLF L F F S     S   FQ L         A+   F   +DS+ L    F  G         + +   LNL H D LS        F  R++RD++R
Subjt:  LLFFLLFLFFSSADRPSPTKFQYLDVK------ATKFDF---NDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  V---ATLV-----RRLSRA------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVC
        V   ATL      R ++ A      +S+V   + +GSGEYF R+GVG+P R  YMV+D+GSDIVW+QC PC +CY QSDP+FDP  S ++  + C S  C
Subjt:  V---ATLV-----RRLSRA------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVC

Query:  DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGS-TG
         RL++AGC+  R  C Y+VSYGDGS+T G  + ETLT  +  V+ VA+GCGH N+G+F+GAAGLLGLG G +SF GQ G +    FSYCLV R   S   
Subjt:  DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGS-TG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVP-IPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD
        ++ FG  A+   A +  L+ NP+  +FYY+GL G+ VGG +VP +    F+L + G  GV++D+GT+VTRL   AY A RD+F      L RAP  S+FD
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVP-IPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        TC+DL+    V+VPTV  +F  G  ++LPA N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++ +D A+  VGF P  C
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.1e-10246.25Show/hide
Query:  RIKRDAKRVATLVRRLSRAASAVQDRME-------------------------EGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPV
        R++RD+ RVA +V ++  A   V DR +                         +GSGEYF RIGVG+P +  Y+V+D+GSD+ W+QC+PC+ CYQQSDPV
Subjt:  RIKRDAKRVATLVRRLSRAASAVQDRME-------------------------EGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPV

Query:  FDPADSASFTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQV-TVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQT
        F+P  S+++  ++C +  C  LE + C + +C Y+VSYGDGS+T G LA +T+T G    + +VA+GCGH N+G+F GAAGLLGLGGG +S   Q+    
Subjt:  FDPADSASFTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQV-TVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQT

Query:  GGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFT
          +FSYCLV R +G + +L+F    +  G     L+RN +  +FYY+GL+G  VGG KV +P+  F +   G  GV++D GTAVTRL T AY + RD+F 
Subjt:  GGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFT

Query:  AQTGNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
          T NL + +  +S+FDTCYD +   +V+VPTV+F+F+ G  L LPAKN+LIPVD +GTFCFAFAP+ S LSIIGN+QQ+G +I++D +   +G   N C
Subjt:  AQTGNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein7.8e-10445.34Show/hide
Query:  LLFFLLFLFFSSADRPSPTKFQYLDVK------ATKFDF---NDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR
        LLF L F F S     S   FQ L         A+   F   +DS+ L    F  G         + +   LNL H D LS        F  R++RD++R
Subjt:  LLFFLLFLFFSSADRPSPTKFQYLDVK------ATKFDF---NDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKR

Query:  V---ATLV-----RRLSRA------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVC
        V   ATL      R ++ A      +S+V   + +GSGEYF R+GVG+P R  YMV+D+GSDIVW+QC PC +CY QSDP+FDP  S ++  + C S  C
Subjt:  V---ATLV-----RRLSRA------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVC

Query:  DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGS-TG
         RL++AGC+  R  C Y+VSYGDGS+T G  + ETLT  +  V+ VA+GCGH N+G+F+GAAGLLGLG G +SF GQ G +    FSYCLV R   S   
Subjt:  DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGS-TG

Query:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVP-IPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD
        ++ FG  A+   A +  L+ NP+  +FYY+GL G+ VGG +VP +    F+L + G  GV++D+GT+VTRL   AY A RD+F      L RAP  S+FD
Subjt:  TLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVP-IPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        TC+DL+    V+VPTV  +F  G  ++LPA N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++ +D A+  VGF P  C
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

AT1G25510.1 Eukaryotic aspartyl protease family protein2.6e-10742.04Show/hide
Query:  PSFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNH---TKFKLNLVHRDKLSHVHGRHHGFDE-----RIKRDA
        P++  FF  F+FF ++     ++        T    N +  +H   +    +L    +  H   + F L L  R     V G  H   +     R+ RD 
Subjt:  PSFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNH---TKFKLNLVHRDKLSHVHGRHHGFDE-----RIKRDA

Query:  KRVATLVRRLSRAASAVQ-----------------------DRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSAS
         RV +L+ RL  A + +                            +GSGEYF R+G+G P R  YMV+D+GSD+ W+QC PC+ CY Q++P+F+P+ S+S
Subjt:  KRVATLVRRLSRAASAVQ-----------------------DRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSAS

Query:  FTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLV
        +  +SC +  C+ LE + C    C YEVSYGDGSYT G  A ETLTIG   V++VA+GCGH+N+G+F+GAAGLLGLGGG ++   QL      +FSYCLV
Subjt:  FTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLV

Query:  SRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRA
         R + S  T++FG    P  A    L+RN +  +FYY+GL G+ VGG  + IP+ +F++ E G  G+++D+GTAVTRL T  Y + RDSF   T +L +A
Subjt:  SRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRA

Query:  PGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
         GV++FDTCY+L+   +V VPTV+F+F  G +L LPAKN++IPVD  GTFC AFAP+ S L+IIGN+QQ+G +++FD AN  +GF  N C
Subjt:  PGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

AT3G18490.1 Eukaryotic aspartyl protease family protein7.8e-10446.25Show/hide
Query:  RIKRDAKRVATLVRRLSRAASAVQDRME-------------------------EGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPV
        R++RD+ RVA +V ++  A   V DR +                         +GSGEYF RIGVG+P +  Y+V+D+GSD+ W+QC+PC+ CYQQSDPV
Subjt:  RIKRDAKRVATLVRRLSRAASAVQDRME-------------------------EGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPV

Query:  FDPADSASFTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQV-TVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQT
        F+P  S+++  ++C +  C  LE + C + +C Y+VSYGDGS+T G LA +T+T G    + +VA+GCGH N+G+F GAAGLLGLGGG +S   Q+    
Subjt:  FDPADSASFTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQV-TVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQT

Query:  GGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFT
          +FSYCLV R +G + +L+F    +  G     L+RN +  +FYY+GL+G  VGG KV +P+  F +   G  GV++D GTAVTRL T AY + RD+F 
Subjt:  GGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFT

Query:  AQTGNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
          T NL + +  +S+FDTCYD +   +V+VPTV+F+F+ G  L LPAKN+LIPVD +GTFCFAFAP+ S LSIIGN+QQ+G +I++D +   +G   N C
Subjt:  AQTGNLPR-APGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

AT3G20015.1 Eukaryotic aspartyl protease family protein6.2e-17865.63Show/hide
Query:  LPSFLLFFLLFLFFSSADRPSPTKFQYLD-------VKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHV--HGRHHGFDERIKR
        LP F  F  L L  SS+   S   FQ +D       V AT  DFN++       F D          + +K+ L L+HRD+   V     HH    R++R
Subjt:  LPSFLLFFLLFLFFSSADRPSPTKFQYLD-------VKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHV--HGRHHGFDERIKR

Query:  DAKRVATLVRRLSRA--------------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCG
        D  RV+ ++RR+S                 S +   M++GSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC  CY+QSDPVFDPA S S+TGVSCG
Subjt:  DAKRVATLVRRLSRA--------------ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCG

Query:  SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGST
        S+VCDR+EN+GCH+G CRYEV YGDGSYTKGTLALETLT  +  VR+VA+GCGH N+GMFIGAAGLLG+GGGSMSFVGQL GQTGGAF YCLVSRGT ST
Subjt:  SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGST

Query:  GTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD
        G+L FGR A+PVGA+W+ LVRNPRAPSFYY+GL G+GVGGV++P+P+  F L+E G  GVVMDTGTAVTRLPTAAY AFRD F +QT NLPRA GVSIFD
Subjt:  GTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFD

Query:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        TCYDL+GF SVRVPTVSFYF++GPVLTLPA+NFL+PVD +GT+CFAFA SP+GLSIIGNIQQEGIQ+SFDGANGFVGFGPNVC
Subjt:  TCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC

AT3G61820.1 Eukaryotic aspartyl protease family protein1.6e-9643.33Show/hide
Query:  SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKS-NHTKFKLNLVHRDKLSHVHGRHHG--FDERIKRDAKRVATL
        +F +F +LF F SSA     +++Q L V       N      TLS+ +   L     S + T   ++L H D LS          F+ R++RD+ RV ++
Subjt:  SFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKS-NHTKFKLNLVHRDKLSHVHGRHHG--FDERIKRDAKRVATL

Query:  ------------VRRLSRAA----SAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDR
                     +R  R A     AV   + +GSGEYF+R+GVG+P  + YMV+D+GSD+VW+QC PC  CY Q+D +FDP  S +F  V CGS +C R
Subjt:  ------------VRRLSRAA----SAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDR

Query:  LENAG-CHAGR---CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSR-GTGST-
        L+++  C   R   C Y+VSYGDGS+T+G  + ETLT     V  V +GCGH N+G+F+GAAGLLGLG G +SF  Q   +  G FSYCLV R  +GS+ 
Subjt:  LENAG-CHAGR---CRYEVSYGDGSYTKGTLALETLTIGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSR-GTGST-

Query:  ---GTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVP-IPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGV
            T+ FG  A+P  + +  L+ NP+  +FYY+ L G+ VGG +VP + E  F+L   G  GV++D+GT+VTRL   AY A RD+F      L RAP  
Subjt:  ---GTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVP-IPEETFQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGV

Query:  SIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
        S+FDTC+DL+G  +V+VPTV F+F  G V +LPA N+LIPV+  G FCFAFA +   LSIIGNIQQ+G ++++D     VGF    C
Subjt:  SIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTTAAACTTCCCTCTTTTCTTCTCTTCTTTCTCCTTTTCCTCTTCTTCTCCTCCGCCGACCGACCTTCTCCGACCAAGTTTCAATACCTCGATGTCAAAGCAAC
CAAATTCGACTTTAACGATAGTCAGATTCTTCATACCCTTAGTTTCTTCGACGGTCACCAGTTAGCGACCGGTCGAAAATCCAACCATACTAAATTTAAGCTCAACCTTG
TCCATCGGGATAAGCTATCCCACGTCCACGGCCGCCACCATGGCTTCGACGAGCGTATCAAAAGAGACGCCAAACGAGTCGCCACCCTCGTTCGCCGCCTATCGCGCGCT
GCCTCCGCCGTACAAGATAGAATGGAAGAGGGGAGTGGAGAGTATTTTGTTCGGATCGGAGTTGGGAGCCCGCCGAGGAGTCAGTATATGGTGATTGATTCCGGCAGTGA
CATTGTGTGGGTTCAATGCCAACCTTGTAGCCAATGCTACCAACAGTCCGATCCCGTTTTCGACCCGGCCGACTCCGCCTCGTTCACCGGTGTCTCCTGTGGCTCCGCCG
TTTGTGACCGCCTTGAGAATGCCGGTTGTCACGCTGGACGGTGTCGGTATGAGGTGTCCTATGGGGATGGGTCTTACACTAAGGGCACTCTCGCCCTCGAAACTCTCACC
ATCGGTCAAGTCACGGTTCGTGACGTGGCAATCGGCTGCGGCCATACGAACCAAGGCATGTTCATCGGAGCCGCCGGGCTACTCGGTCTCGGTGGTGGCTCAATGTCATT
CGTCGGCCAGCTTGGCGGTCAGACCGGCGGCGCATTCAGCTACTGTTTGGTGAGCCGAGGAACCGGCTCGACCGGAACATTAGAGTTCGGCCGCGGAGCAATGCCGGTTG
GCGCCACGTGGATCTCCCTGGTCCGAAACCCACGCGCCCCAAGCTTCTACTACATCGGACTCGCCGGCGTCGGCGTCGGCGGCGTCAAAGTCCCGATACCGGAGGAAACT
TTCCAGCTCTCCGAGTTCGGTGGCAACGGTGTGGTAATGGACACCGGCACCGCCGTGACGCGGCTGCCGACGGCGGCCTACGAGGCATTCCGCGATTCTTTCACGGCCCA
AACCGGCAACCTCCCACGAGCGCCGGGAGTTTCGATCTTCGACACGTGTTACGATCTCAACGGGTTCGAGTCCGTACGGGTGCCAACGGTGTCGTTTTACTTCTCCGACG
GGCCGGTGCTGACGCTGCCGGCGAAAAATTTTCTGATTCCGGTCGACGGCGCCGGAACTTTTTGCTTCGCTTTTGCGCCGTCGCCGTCGGGACTTTCCATAATCGGAAAC
ATCCAGCAGGAAGGGATTCAGATTTCATTCGACGGGGCTAATGGGTTTGTGGGATTTGGCCCAAATGTTTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTTAAACTTCCCTCTTTTCTTCTCTTCTTTCTCCTTTTCCTCTTCTTCTCCTCCGCCGACCGACCTTCTCCGACCAAGTTTCAATACCTCGATGTCAAAGCAAC
CAAATTCGACTTTAACGATAGTCAGATTCTTCATACCCTTAGTTTCTTCGACGGTCACCAGTTAGCGACCGGTCGAAAATCCAACCATACTAAATTTAAGCTCAACCTTG
TCCATCGGGATAAGCTATCCCACGTCCACGGCCGCCACCATGGCTTCGACGAGCGTATCAAAAGAGACGCCAAACGAGTCGCCACCCTCGTTCGCCGCCTATCGCGCGCT
GCCTCCGCCGTACAAGATAGAATGGAAGAGGGGAGTGGAGAGTATTTTGTTCGGATCGGAGTTGGGAGCCCGCCGAGGAGTCAGTATATGGTGATTGATTCCGGCAGTGA
CATTGTGTGGGTTCAATGCCAACCTTGTAGCCAATGCTACCAACAGTCCGATCCCGTTTTCGACCCGGCCGACTCCGCCTCGTTCACCGGTGTCTCCTGTGGCTCCGCCG
TTTGTGACCGCCTTGAGAATGCCGGTTGTCACGCTGGACGGTGTCGGTATGAGGTGTCCTATGGGGATGGGTCTTACACTAAGGGCACTCTCGCCCTCGAAACTCTCACC
ATCGGTCAAGTCACGGTTCGTGACGTGGCAATCGGCTGCGGCCATACGAACCAAGGCATGTTCATCGGAGCCGCCGGGCTACTCGGTCTCGGTGGTGGCTCAATGTCATT
CGTCGGCCAGCTTGGCGGTCAGACCGGCGGCGCATTCAGCTACTGTTTGGTGAGCCGAGGAACCGGCTCGACCGGAACATTAGAGTTCGGCCGCGGAGCAATGCCGGTTG
GCGCCACGTGGATCTCCCTGGTCCGAAACCCACGCGCCCCAAGCTTCTACTACATCGGACTCGCCGGCGTCGGCGTCGGCGGCGTCAAAGTCCCGATACCGGAGGAAACT
TTCCAGCTCTCCGAGTTCGGTGGCAACGGTGTGGTAATGGACACCGGCACCGCCGTGACGCGGCTGCCGACGGCGGCCTACGAGGCATTCCGCGATTCTTTCACGGCCCA
AACCGGCAACCTCCCACGAGCGCCGGGAGTTTCGATCTTCGACACGTGTTACGATCTCAACGGGTTCGAGTCCGTACGGGTGCCAACGGTGTCGTTTTACTTCTCCGACG
GGCCGGTGCTGACGCTGCCGGCGAAAAATTTTCTGATTCCGGTCGACGGCGCCGGAACTTTTTGCTTCGCTTTTGCGCCGTCGCCGTCGGGACTTTCCATAATCGGAAAC
ATCCAGCAGGAAGGGATTCAGATTTCATTCGACGGGGCTAATGGGTTTGTGGGATTTGGCCCAAATGTTTGCTAA
Protein sequenceShow/hide protein sequence
MLFKLPSFLLFFLLFLFFSSADRPSPTKFQYLDVKATKFDFNDSQILHTLSFFDGHQLATGRKSNHTKFKLNLVHRDKLSHVHGRHHGFDERIKRDAKRVATLVRRLSRA
ASAVQDRMEEGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYQQSDPVFDPADSASFTGVSCGSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT
IGQVTVRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTGSTGTLEFGRGAMPVGATWISLVRNPRAPSFYYIGLAGVGVGGVKVPIPEET
FQLSEFGGNGVVMDTGTAVTRLPTAAYEAFRDSFTAQTGNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPAKNFLIPVDGAGTFCFAFAPSPSGLSIIGN
IQQEGIQISFDGANGFVGFGPNVC