; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031541 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031541
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationscaffold11:43658554..43663137
RNA-Seq ExpressionSpg031541
SyntenySpg031541
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]3.7e-23277.67Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN
        +K LL H FP  + SLIPL  LLSA HCRSSVRS A+MA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVS GK  
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN

Query:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANY
                 +P+         FE            W +K   C  F +  +D                      RGTGLSTPL+PSSMSQFQ+AEDLANY
Subjt:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANY

Query:  LKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREV
        LKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EV
Subjt:  LKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREV

Query:  VKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQ
        VKYL ENGGGV LP GGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ
Subjt:  VKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQ

Query:  RMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRL
        R+++E+E++FD  +AVKEGCPVYFTGEMIFPWM DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNN+VPVAAAVYYEDM+VNFKLAMETAS IAGIRL
Subjt:  RMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRL

Query:  WITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        W+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  WITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]6.5e-23777.15Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI
        MFAARTAAP     PLLLHF   PC  L LIPL N LSAAHCR SVR SA MAG   P  ASPP HV+GTWYSVPELRLRDH+FSVPLNYSL++ +  +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI

Query:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS
        SVFAREVVS GK +  +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPLTPS
Subjt:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS

Query:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
        SMSQFQS++DLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
Subjt:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY

Query:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE
        YKRYPQD+EIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAID+WLSLDSNPLYVLLHE
Subjt:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE

Query:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK
        +IYCQGASSRWSAQR+K+E+E++FDANKAVKEGC VYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNN+VPVAAAVYYEDMFVNFK
Subjt:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK

Query:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        LAM+TAS IAGIRLW+TNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]8.2e-24078.43Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI
        MFA RTAAP     PLLLHF   P   L LIPLPN LSAAHCR SVR SA MAG   P   SPP HVAGTWYSVPELRLRDH+FSVPLNYSLD+ +S +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI

Query:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS
        SVFAREVVS GK +  +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPLTPS
Subjt:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS

Query:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
        SMSQF+SAEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
Subjt:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY

Query:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE
        YKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHE
Subjt:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE

Query:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK
        SIYCQGASSRWSAQR+K+E+E++FDANKAVKEGCPVYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNN+VPVAAAVYYEDMFVNFK
Subjt:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK

Query:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        LAMETAS IAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]9.7e-23377.67Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN
        +K LL H FP  + SLIPL  LLSA HCRSSVRS A+MA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVS GK  
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN

Query:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANY
          +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPL+PSSMSQFQSAEDLA+Y
Subjt:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANY

Query:  LKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREV
        LKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EV
Subjt:  LKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREV

Query:  VKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQ
        VKYL ENGGG+ LP GGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ
Subjt:  VKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQ

Query:  RMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRL
        R+ +E+E++FDA KAVKEGCPVYFTGEMIFPWM DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNN+VPVAAAVYYEDM+VNFKLAMETAS IAGIRL
Subjt:  RMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRL

Query:  WITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        W+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  WITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]2.6e-23377.14Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN
        +K LL H FP  + SLIPL  LLSA HCRSSVRS  +MA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVS GK  
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN

Query:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQW-----GKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAE
                                      DSP+ +     G   ++C         P  +     K  + ++  + + RGTGLSTPL+PSSMSQFQSAE
Subjt:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQW-----GKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAE

Query:  DLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVE
        DLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDVE
Subjt:  DLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVE

Query:  IVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASS
        IV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASS
Subjt:  IVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASS

Query:  RWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHI
        RWSAQR+ +E+E++FDA KAVKEGCPVYFTGEMIFPWM DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNN+VPVAAAVYYEDM+VNFKLAMETAS I
Subjt:  RWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHI

Query:  AGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        AGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  AGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein3.1e-23777.15Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI
        MFAARTAAP     PLLLHF   PC  L LIPL N LSAAHCR SVR SA MAG   P  ASPP HV+GTWYSVPELRLRDH+FSVPLNYSL++ +  +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI

Query:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS
        SVFAREVVS GK +  +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPLTPS
Subjt:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS

Query:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
        SMSQFQS++DLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
Subjt:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY

Query:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE
        YKRYPQD+EIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAID+WLSLDSNPLYVLLHE
Subjt:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE

Query:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK
        +IYCQGASSRWSAQR+K+E+E++FDANKAVKEGC VYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNN+VPVAAAVYYEDMFVNFK
Subjt:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK

Query:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        LAM+TAS IAGIRLW+TNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase4.0e-24078.43Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI
        MFA RTAAP     PLLLHF   P   L LIPLPN LSAAHCR SVR SA MAG   P   SPP HVAGTWYSVPELRLRDH+FSVPLNYSLD+ +S +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI

Query:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS
        SVFAREVVS GK +  +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPLTPS
Subjt:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS

Query:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
        SMSQF+SAEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
Subjt:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY

Query:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE
        YKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHE
Subjt:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE

Query:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK
        SIYCQGASSRWSAQR+K+E+E++FDANKAVKEGCPVYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNN+VPVAAAVYYEDMFVNFK
Subjt:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK

Query:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        LAMETAS IAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase4.0e-24078.43Show/hide
Query:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI
        MFA RTAAP     PLLLHF   P   L LIPLPN LSAAHCR SVR SA MAG   P   SPP HVAGTWYSVPELRLRDH+FSVPLNYSLD+ +S +I
Subjt:  MFAARTAAPPFSIKPLLLHF---PCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKI

Query:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS
        SVFAREVVS GK +  +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPLTPS
Subjt:  SVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPS

Query:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
        SMSQF+SAEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY
Subjt:  SMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKY

Query:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE
        YKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHE
Subjt:  YKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHE

Query:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK
        SIYCQGASSRWSAQR+K+E+E++FDANKAVKEGCPVYFTGEMIFPWM DEIHALRPFKDAAHILA+KEDWPPLYDIAALKNN+VPVAAAVYYEDMFVNFK
Subjt:  SIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFK

Query:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        LAMETAS IAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  LAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406904.7e-23377.67Show/hide
Query:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN
        +K LL H FP  + SLIPL  LLSA HCRSSVRS A+MA TNP NGASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVS GK  
Subjt:  IKPLLLH-FPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSN

Query:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANY
          +  +L                          LQ G   ++C         P  +     K  + ++  + + RGTGLSTPL+PSSMSQFQSAEDLA+Y
Subjt:  LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANY

Query:  LKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREV
        LKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EV
Subjt:  LKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREV

Query:  VKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQ
        VKYL ENGGG+ LP GGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ
Subjt:  VKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQ

Query:  RMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRL
        R+ +E+E++FDA KAVKEGCPVYFTGEMIFPWM DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNN+VPVAAAVYYEDM+VNFKLAMETAS IAGIRL
Subjt:  RMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRL

Query:  WITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        W+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  WITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X13.7e-23077.34Show/hide
Query:  IKPLLLH-FPCL-SLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKS
        +K LL H FP   + SLIPL  LLSA HCRSSVRS A+MAGT P N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  +SPKISV+AREVVS GK 
Subjt:  IKPLLLH-FPCL-SLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKS

Query:  NLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLAN
                  +P+         FE            W +K   C  F +  +D                      RGTGLSTPL+PSSMSQFQSAEDLAN
Subjt:  NLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLAN

Query:  YLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVRE
        YLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV E
Subjt:  YLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVRE

Query:  VVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSA
        VVKYL ENGGGV LP GGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSA
Subjt:  VVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSA

Query:  QRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIR
        QR+++E+E++FDA KAVKEGCPVYFTGEMIFPWM DEIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNN+VPVAAAVYYEDM+VNFKLAMETAS IAGIR
Subjt:  QRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIR

Query:  LWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        LW+TNEFMHSGLRD GPQVLDHLMG LNGKKPLF
Subjt:  LWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH6.4e-5434.89Show/hide
Query:  GYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPP
        GY+    + RGTG S+P+T  +++Q    +  A+ LK FRADNIV D E +RK L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P
Subjt:  GYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPP

Query:  IGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRI
        + N      V    F     +NE YYK+YP+DV  V+ ++KYL EN   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K +
Subjt:  IGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRI

Query:  SYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYD
        +   L+ I++   +  N +Y LL E +YCQG +  W A + + + + RF  N   +    ++FTGE IF  M +    L+  K  A +LA   DW  LY+
Subjt:  SYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYD

Query:  IAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
         A L  N+VPV  A   EDM+V++ L   TAS +  ++  + N + H  +     +V+  L  L
Subjt:  IAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase3.4e-7944.16Show/hide
Query:  RGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRA
        RGTG STP+    ++     +  A+YL HFRAD+IV DAE IR++L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA
Subjt:  RGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRA

Query:  CFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLS
         +++V  +N  ++ R+P    I   +  +L  +   V LP+G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  + +   
Subjt:  CFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLS

Query:  LDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAA
         ++NP++ +LHE IYC+GA+S W+A+R++ E  +      A  +G    FTGEMIFPWM ++   L P K+AAH+LAEK DW PLYD   L  N+VPVA 
Subjt:  LDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAA

Query:  AVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        AVY EDM+V F  + ET   ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  AVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein3.7e-18265.85Show/hide
Query:  GASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHT
        G S  EHV G W+SVPELRLRDH F VPL+YS    +SPKI+VFARE+V+ GK    +  +L      G         F     +++    G     C  
Subjt:  GASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVS-GKSNLNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHT

Query:  FYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSF
        F +  +D                      RGTGLSTPLT SSM QF+SA++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSF
Subjt:  FYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSF

Query:  APQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYL
        AP+GLKQVLITGGIPPIG  CTAD VY A FE+V  QNEKYYKR+PQD+EIVRE+V YLAE+ GGGV LPSGGILTPKGLQTLGLS LGSSTGFER+HY+
Subjt:  APQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYL

Query:  FERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFK
         ERVWDPI+V GAPK IS FFLNA +SW S D+NPLY LLHE+IYC+GASS WSA R++ + E +FDA KAVKE  PV FTGEMIFPWM DEIHAL+PFK
Subjt:  FERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIFPWMLDEIHALRPFK

Query:  DAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
         AA +LA+KEDWPPLYD+  L+NN+VPVAAAVYYEDM+VNFKL  ETASHI+GIRLW+TNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  DAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGCTCGCACAGCAGCGCCGCCATTTTCGATAAAGCCACTCCTTCTTCACTTCCCCTGCCTCTCCCTTTCATTAATTCCACTCCCAAACCTTCTCTCCGCCGC
CCATTGCCGGAGCTCGGTTCGATCATCGGCAATCATGGCCGGAACCAATCCCCCCAATGGAGCATCGCCGCCGGAGCACGTTGCTGGCACGTGGTACTCCGTGCCGGAGC
TCCGTCTACGGGATCATTACTTCTCTGTGCCTCTCAATTACTCTCTAGATAAGCTTGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGGTAAATCGAAT
CTTAATTTGACTTTGATTTTACTTTCGATTCCTTTTTATGGAGCATCATCAGTGGTTGAAAATTTTGAATTTCGCAGTCATGTGGTGACTGATTCTCCATTGCAGTGGGG
AAAGAAGAACTACCAATGCCATACCTTTTATATTTACAAGGTGGACCCGGATTTGAGTGTCCCCGACCAACTGAAGCGAGTGGATGGATACAAAAAGCATGTGAAGAATT
TCCGAGGAACAGGATTATCGACTCCTTTGACTCCATCGTCCATGTCGCAATTTCAAAGTGCAGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATAGTT
AATGATGCTGAATTTATTAGGAAGCGTCTTGTTCCTGATGCAGCACCTTGGACCATATTGGGTCAGAGCTATGGTGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACC
ACAAGGACTGAAACAAGTCCTCATAACTGGGGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATCATTCAAAATG
AGAAGTACTACAAGAGGTATCCTCAGGATGTTGAAATCGTCCGTGAAGTCGTGAAGTACTTGGCAGAGAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTATCTTGACA
CCTAAAGGGCTGCAAACTCTTGGTCTTTCTGCTTTAGGATCTAGTACAGGTTTTGAGCGCATGCACTATCTGTTTGAAAGAGTATGGGATCCTATAATAGTTCCTGGAGC
GCCGAAACGAATCAGTTATTTCTTCCTCAATGCTATTGATAGCTGGCTCTCACTTGATTCAAATCCTCTTTATGTTCTCTTGCACGAATCAATATATTGCCAGGGTGCCT
CGTCTCGTTGGTCTGCTCAAAGAATGAAGCATGAAATGGAAAGCAGATTCGATGCAAATAAAGCTGTCAAAGAAGGTTGTCCTGTGTATTTCACTGGAGAGATGATTTTC
CCGTGGATGTTAGACGAGATTCATGCCTTGAGACCATTCAAGGACGCCGCTCATATATTGGCCGAGAAGGAGGATTGGCCTCCTCTATATGACATTGCTGCTCTTAAAAA
TAACCAGGTTCCGGTCGCAGCTGCGGTTTATTACGAAGATATGTTCGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCATATTGCAGGAATAAGGCTGTGGATTACTA
ATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCAGCTCGCACAGCAGCGCCGCCATTTTCGATAAAGCCACTCCTTCTTCACTTCCCCTGCCTCTCCCTTTCATTAATTCCACTCCCAAACCTTCTCTCCGCCGC
CCATTGCCGGAGCTCGGTTCGATCATCGGCAATCATGGCCGGAACCAATCCCCCCAATGGAGCATCGCCGCCGGAGCACGTTGCTGGCACGTGGTACTCCGTGCCGGAGC
TCCGTCTACGGGATCATTACTTCTCTGTGCCTCTCAATTACTCTCTAGATAAGCTTGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGGTAAATCGAAT
CTTAATTTGACTTTGATTTTACTTTCGATTCCTTTTTATGGAGCATCATCAGTGGTTGAAAATTTTGAATTTCGCAGTCATGTGGTGACTGATTCTCCATTGCAGTGGGG
AAAGAAGAACTACCAATGCCATACCTTTTATATTTACAAGGTGGACCCGGATTTGAGTGTCCCCGACCAACTGAAGCGAGTGGATGGATACAAAAAGCATGTGAAGAATT
TCCGAGGAACAGGATTATCGACTCCTTTGACTCCATCGTCCATGTCGCAATTTCAAAGTGCAGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATAGTT
AATGATGCTGAATTTATTAGGAAGCGTCTTGTTCCTGATGCAGCACCTTGGACCATATTGGGTCAGAGCTATGGTGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACC
ACAAGGACTGAAACAAGTCCTCATAACTGGGGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATCATTCAAAATG
AGAAGTACTACAAGAGGTATCCTCAGGATGTTGAAATCGTCCGTGAAGTCGTGAAGTACTTGGCAGAGAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTATCTTGACA
CCTAAAGGGCTGCAAACTCTTGGTCTTTCTGCTTTAGGATCTAGTACAGGTTTTGAGCGCATGCACTATCTGTTTGAAAGAGTATGGGATCCTATAATAGTTCCTGGAGC
GCCGAAACGAATCAGTTATTTCTTCCTCAATGCTATTGATAGCTGGCTCTCACTTGATTCAAATCCTCTTTATGTTCTCTTGCACGAATCAATATATTGCCAGGGTGCCT
CGTCTCGTTGGTCTGCTCAAAGAATGAAGCATGAAATGGAAAGCAGATTCGATGCAAATAAAGCTGTCAAAGAAGGTTGTCCTGTGTATTTCACTGGAGAGATGATTTTC
CCGTGGATGTTAGACGAGATTCATGCCTTGAGACCATTCAAGGACGCCGCTCATATATTGGCCGAGAAGGAGGATTGGCCTCCTCTATATGACATTGCTGCTCTTAAAAA
TAACCAGGTTCCGGTCGCAGCTGCGGTTTATTACGAAGATATGTTCGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCATATTGCAGGAATAAGGCTGTGGATTACTA
ATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
Protein sequenceShow/hide protein sequence
MFAARTAAPPFSIKPLLLHFPCLSLSLIPLPNLLSAAHCRSSVRSSAIMAGTNPPNGASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDKLASPKISVFAREVVSGKSN
LNLTLILLSIPFYGASSVVENFEFRSHVVTDSPLQWGKKNYQCHTFYIYKVDPDLSVPDQLKRVDGYKKHVKNFRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIV
NDAEFIRKRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT
PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQRMKHEMESRFDANKAVKEGCPVYFTGEMIF
PWMLDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNQVPVAAAVYYEDMFVNFKLAMETASHIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF