; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020716 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020716
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationchr7:1563330..1567671
RNA-Seq ExpressionLag0020716
SyntenyLag0020716
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]3.3e-26789.8Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE
        +K LL H FP  + SLIPL  LLSA HCRSSVRS AVMA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVSVGKEE
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE

Query:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS
         PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRV+LMDQRGTGLSTPL+PSSMSQFQ+AEDL NYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQS
Subjt:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG
        YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQR+++E+E++FD  +AVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM

Query:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
         DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]5.5e-27088.52Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI
        MFAARTAAP     PLLLHF   PC  L LIPL N LSAAHCR SVR SA MAG   P  ASPP HV+GTWYSVPELRLRDH+FSVPLNYSL+  +  +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI

Query:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR
        SVFAREVVSVGKE+ PMPYLL+LQGGPGFEC RPTEASGWIQKACEEFRV+LMDQRGTGLSTPLTPSSMSQFQS++DL NYLKHFRADNIVNDAEFIR R
Subjt:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR

Query:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL
        LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGGVLLPSGGIL
Subjt:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL

Query:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG
        TPKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAID+WLSLDSNPLYVLLHE+IYCQGASSRWSAQR+K+E+E++FDANKAVKEG
Subjt:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG

Query:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
        C VYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVL
Subjt:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]6.9e-27389.88Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI
        MFA RTAAP     PLLLHF   P   L LIPLPN LSAAHCR SVR SA MAG   P   SPP HVAGTWYSVPELRLRDH+FSVPLNYSLD  +S +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI

Query:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR
        SVFAREVVSVGKE+ PMPYLLYLQGGPGFEC RP+EASGWIQKACEEFRV+LMDQRGTGLSTPLTPSSMSQF+SAEDL NYLKHFRADNIVNDAEFIR R
Subjt:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR

Query:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL
        LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGIL
Subjt:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL

Query:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG
        TPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHESIYCQGASSRWSAQR+K+E+E++FDANKAVKEG
Subjt:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG

Query:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
        CPVYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
Subjt:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]5.1e-26890.2Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE
        +K LL H FP  + SLIPL  LLSA HCRSSVRS AVMA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVSVGKEE
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE

Query:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS
         PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRV+LMDQRGTGLSTPL+PSSMSQFQSAEDL +YLKHFRADNIVNDAEFIR RLVPDAAPWTILGQS
Subjt:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG
        YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQR+ +E+E++FDA KAVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM

Query:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
         DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]1.5e-26790Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE
        +K LL H FP  + SLIPL  LLSA HCRSSVRS  VMA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVSVGKE+
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE

Query:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS
         PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRV+LMDQRGTGLSTPL+PSSMSQFQSAEDL NYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQS
Subjt:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG
        YGGFCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDVEIV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQR+ +E+E++FDA KAVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM

Query:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
         DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein2.6e-27088.52Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI
        MFAARTAAP     PLLLHF   PC  L LIPL N LSAAHCR SVR SA MAG   P  ASPP HV+GTWYSVPELRLRDH+FSVPLNYSL+  +  +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI

Query:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR
        SVFAREVVSVGKE+ PMPYLL+LQGGPGFEC RPTEASGWIQKACEEFRV+LMDQRGTGLSTPLTPSSMSQFQS++DL NYLKHFRADNIVNDAEFIR R
Subjt:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR

Query:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL
        LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGGVLLPSGGIL
Subjt:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL

Query:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG
        TPKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAID+WLSLDSNPLYVLLHE+IYCQGASSRWSAQR+K+E+E++FDANKAVKEG
Subjt:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG

Query:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
        C VYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVL
Subjt:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase3.3e-27389.88Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI
        MFA RTAAP     PLLLHF   P   L LIPLPN LSAAHCR SVR SA MAG   P   SPP HVAGTWYSVPELRLRDH+FSVPLNYSLD  +S +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI

Query:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR
        SVFAREVVSVGKE+ PMPYLLYLQGGPGFEC RP+EASGWIQKACEEFRV+LMDQRGTGLSTPLTPSSMSQF+SAEDL NYLKHFRADNIVNDAEFIR R
Subjt:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR

Query:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL
        LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGIL
Subjt:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL

Query:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG
        TPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHESIYCQGASSRWSAQR+K+E+E++FDANKAVKEG
Subjt:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG

Query:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
        CPVYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
Subjt:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase3.3e-27389.88Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI
        MFA RTAAP     PLLLHF   P   L LIPLPN LSAAHCR SVR SA MAG   P   SPP HVAGTWYSVPELRLRDH+FSVPLNYSLD  +S +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKI

Query:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR
        SVFAREVVSVGKE+ PMPYLLYLQGGPGFEC RP+EASGWIQKACEEFRV+LMDQRGTGLSTPLTPSSMSQF+SAEDL NYLKHFRADNIVNDAEFIR R
Subjt:  SVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKR

Query:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL
        LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGIL
Subjt:  LVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGIL

Query:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG
        TPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHESIYCQGASSRWSAQR+K+E+E++FDANKAVKEG
Subjt:  TPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEG

Query:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
        CPVYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL
Subjt:  CPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406902.5e-26890.2Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE
        +K LL H FP  + SLIPL  LLSA HCRSSVRS AVMA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVSVGKEE
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEE

Query:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS
         PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRV+LMDQRGTGLSTPL+PSSMSQFQSAEDL +YLKHFRADNIVNDAEFIR RLVPDAAPWTILGQS
Subjt:  LPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG
        YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQR+ +E+E++FDA KAVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWM

Query:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
         DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X13.4e-26589.42Show/hide
Query:  IKPLLLH-FPCL-SLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKE
        +K LL H FP   + SLIPL  LLSA HCRSSVRS AVMAGT P N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVSVGKE
Subjt:  IKPLLLH-FPCL-SLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKE

Query:  ELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQ
        E PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRV+LMDQRGTGLSTPL+PSSMSQFQSAEDL NYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQ
Subjt:  ELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQ

Query:  SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSAL
        SYGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGGV LP GGILTPKGLQTLGLSAL
Subjt:  SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSAL

Query:  GSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPW
        GSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQR+++E+E++FDA KAVKEGCPVYFTGEMIFPW
Subjt:  GSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPW

Query:  MLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPL
        M DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMG LNGKKPL
Subjt:  MLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPL

Query:  F
        F
Subjt:  F

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH3.7e-6735.17Show/hide
Query:  RLRDHYFSVPLNYSLDMLASPKISVFAREVVSV-GKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEE-FRVVLMDQRGTGLSTPLTPSSMSQFQSA
        R  +  F VPLN+S        + +FAR +  V G ++  +P++LYLQGGPG  C  P E + W+    E+ +RV+ +D+RGTG S+P+T  +++Q    
Subjt:  RLRDHYFSVPLNYSLDMLASPKISVFAREVVSV-GKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEE-FRVVLMDQRGTGLSTPLTPSSMSQFQSA

Query:  EDLVNYLKHFRADNIVNDAEFIRKRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY
        +   + LK FRADNIV D E +RK L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+Y
Subjt:  EDLVNYLKHFRADNIVNDAEFIRKRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYC
        P+DV  V+ ++KYL EN   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I++   +  N +Y LL E +YC
Subjt:  PQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYC

Query:  QGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAME
        QG +  W A + + + + RF  N   +    ++FTGE IF  M +    L+  K  A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L   
Subjt:  QGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAME

Query:  TASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        TAS++  ++  + N + H  +     +V+  L  L
Subjt:  TASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase3.3e-10043.45Show/hide
Query:  YSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQ
        Y +  +    H+F+VPL++         I++F R +    + +  +P+LLYLQGGPGF  PRP+   GWI++A +EFRV+L+DQRGTG STP+    ++ 
Subjt:  YSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQ

Query:  FQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY
            +   +YL HFRAD+IV DAE IR++L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA +++V  +N  ++ R+
Subjt:  FQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYC
        P    I   +  +L  +   V LP+G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  + +    ++NP++ +LHE IYC
Subjt:  PQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYC

Query:  QGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAME
        +GA+S W+A+R++ E  +      A  +G    FTGEMIFPWM ++   L P K+AAH+LAEK DW PLYD   L  NKVPVA AVY EDM+V F  + E
Subjt:  QGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAME

Query:  TASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        T   ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  TASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase9.1e-0529.17Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRL-VPDAAPWTILGQSYGGF
        +++L GGPG      T  S       E +R+VL DQRG G STP                  L+     ++VND E +R+ L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT2G14260.2 proline iminopeptidase9.1e-0529.17Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRL-VPDAAPWTILGQSYGGF
        +++L GGPG      T  S       E +R+VL DQRG G STP                  L+     ++VND E +R+ L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein2.3e-21377.19Show/hide
Query:  GASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTG
        G S  EHV G W+SVPELRLRDH F VPL+YS    +SPKI+VFARE+V+VGKEE  MPYLLYLQGGPGFE PRP+EASGWIQ+ACEEFRVVL+DQRGTG
Subjt:  GASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKEELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTG

Query:  LSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK
        LSTPLT SSM QF+SA++L +YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE+
Subjt:  LSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK

Query:  VIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDS
        V  QNEKYYKR+PQD+EIVRE+V YLAE+ GGGV LPSGGILTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA +SW S D+
Subjt:  VIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDS

Query:  NPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVY
        NPLY LLHE+IYC+GASS WSA R++ + E +FDA KAVKE  PV FTGEMIFPWM DEIHAL+PFK AA +LA+KEDWPPLYD+  L+NNKVPVAAAVY
Subjt:  NPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVY

Query:  YEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        YEDM+VNFKL  ETAS I+GIRLW+TNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  YEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGCTCGCACAGCAGCGCCGCCATTTTCGATAAAGCCACTCCTTCTTCACTTCCCCTGCCTCTCCCTTTCATTAATTCCACTCCCAAACCTTCTCTCCGCCGC
CCATTGCCGGAGCTCGGTTCGATCATCGGCAGTCATGGCCGGAACCAATCCCCCCAATGGAGCATCGCCGCCGGAGCACGTTGCTGGCACGTGGTACTCCGTGCCGGAGC
TCCGTCTACGGGATCATTATTTCTCTGTGCCTCTCAATTACTCTCTCGATATGCTTGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGTGGGGAAAGAA
GAACTACCAATGCCATACCTTTTATATTTACAAGGTGGACCCGGATTTGAGTGTCCCCGACCAACTGAAGCGAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGT
TGTATTGATGGATCAGCGAGGAACAGGATTATCGACTCCTTTGACTCCATCGTCCATGTCACAATTTCAAAGTGCAGAGGACTTGGTCAACTACTTGAAACATTTTCGAG
CTGATAACATAGTTAATGATGCTGAATTTATTAGGAAGCGTCTTGTTCCTGATGCAGCACCTTGGACCATATTGGGTCAGAGCTATGGTGGTTTTTGTGCAGTTACGTAT
TTGAGTTTTGCACCACAAGGACTGAAACAAGTCCTCATAACTGGGGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGT
TATCATTCAAAATGAGAAGTACTACAAGAGGTATCCTCAGGATGTTGAAATCGTCCGTGAAGTCGTGAAGTACTTGGCAGAGAATGGAGGCGGGGTTCTTCTTCCCTCTG
GTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGTCTTTCTGCTTTAGGATCTAGTACAGGTTTTGAGCGCATGCACTATCTGTTTGAAAGAGTATGGGATCCTATA
ATAGTTCCTGGAGCGCCGAAACGAATCAGTTATTTCTTCCTCAATGCTATTGATAGCTGGCTCTCACTTGATTCAAATCCTCTCTATGTTCTCTTGCACGAATCGATATA
TTGCCAGGGTGCCTCGTCTCGTTGGTCTGCTCAAAGAATGAAGCATGAAATGGAAAGCAGATTCGATGCAAATAAAGCTGTCAAAGAAGGTTGTCCTGTGTATTTCACTG
GAGAGATGATTTTCCCGTGGATGTTAGACGAGATTCATGCCTTGAGACCATTCAAGGACGCCGCACATATATTGGCCGAGAAGGAGGATTGGCCTCCTCTATATGACATT
GCTGCTCTTAAAAATAACAAGGTTCCGGTCGCAGCTGCGGTTTATTACGAAGATATGTTCGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATTGCTGGAATAAG
GCTGTGGATTACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCAGCTCGCACAGCAGCGCCGCCATTTTCGATAAAGCCACTCCTTCTTCACTTCCCCTGCCTCTCCCTTTCATTAATTCCACTCCCAAACCTTCTCTCCGCCGC
CCATTGCCGGAGCTCGGTTCGATCATCGGCAGTCATGGCCGGAACCAATCCCCCCAATGGAGCATCGCCGCCGGAGCACGTTGCTGGCACGTGGTACTCCGTGCCGGAGC
TCCGTCTACGGGATCATTATTTCTCTGTGCCTCTCAATTACTCTCTCGATATGCTTGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGTGGGGAAAGAA
GAACTACCAATGCCATACCTTTTATATTTACAAGGTGGACCCGGATTTGAGTGTCCCCGACCAACTGAAGCGAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGT
TGTATTGATGGATCAGCGAGGAACAGGATTATCGACTCCTTTGACTCCATCGTCCATGTCACAATTTCAAAGTGCAGAGGACTTGGTCAACTACTTGAAACATTTTCGAG
CTGATAACATAGTTAATGATGCTGAATTTATTAGGAAGCGTCTTGTTCCTGATGCAGCACCTTGGACCATATTGGGTCAGAGCTATGGTGGTTTTTGTGCAGTTACGTAT
TTGAGTTTTGCACCACAAGGACTGAAACAAGTCCTCATAACTGGGGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGT
TATCATTCAAAATGAGAAGTACTACAAGAGGTATCCTCAGGATGTTGAAATCGTCCGTGAAGTCGTGAAGTACTTGGCAGAGAATGGAGGCGGGGTTCTTCTTCCCTCTG
GTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGTCTTTCTGCTTTAGGATCTAGTACAGGTTTTGAGCGCATGCACTATCTGTTTGAAAGAGTATGGGATCCTATA
ATAGTTCCTGGAGCGCCGAAACGAATCAGTTATTTCTTCCTCAATGCTATTGATAGCTGGCTCTCACTTGATTCAAATCCTCTCTATGTTCTCTTGCACGAATCGATATA
TTGCCAGGGTGCCTCGTCTCGTTGGTCTGCTCAAAGAATGAAGCATGAAATGGAAAGCAGATTCGATGCAAATAAAGCTGTCAAAGAAGGTTGTCCTGTGTATTTCACTG
GAGAGATGATTTTCCCGTGGATGTTAGACGAGATTCATGCCTTGAGACCATTCAAGGACGCCGCACATATATTGGCCGAGAAGGAGGATTGGCCTCCTCTATATGACATT
GCTGCTCTTAAAAATAACAAGGTTCCGGTCGCAGCTGCGGTTTATTACGAAGATATGTTCGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATTGCTGGAATAAG
GCTGTGGATTACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
Protein sequenceShow/hide protein sequence
MFAARTAAPPFSIKPLLLHFPCLSLSLIPLPNLLSAAHCRSSVRSSAVMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDMLASPKISVFAREVVSVGKE
ELPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVVLMDQRGTGLSTPLTPSSMSQFQSAEDLVNYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTY
LSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPI
IVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDI
AALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF