; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C006077 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C006077
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationchr06:891880..895774
RNA-Seq ExpressionMELO3C006077
SyntenyMELO3C006077
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048938.1 proline iminopeptidase [Cucumis melo var. makuwa]4.6e-27799.78Show/hide
Query:  MAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVI
        MAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKAC+EFRVI
Subjt:  MAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVI

Query:  LMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
        LMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
Subjt:  LMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS

Query:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
        VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
Subjt:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID

Query:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
        NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
Subjt:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]3.9e-29295.87Show/hide
Query:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
        MFA RTAAPPLLLHFHSLP R+LPLIPL NFLSAAHCRRSVRLSAAMAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAR
Subjt:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR

Query:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]1.3e-303100Show/hide
Query:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
        MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
Subjt:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR

Query:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]8.7e-26087.32Show/hide
Query:  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM
        L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPM
Subjt:  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM

Query:  PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLLYLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF+SAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGG+ LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST

Query:  GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]1.5e-25987.32Show/hide
Query:  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM
        L FHS P     LIPL   LSA HCR SVR    MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKED PM
Subjt:  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM

Query:  PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL+YLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST
        FCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD+EIV EVVKYL +NGGGV LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST

Query:  GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein1.9e-29295.87Show/hide
Query:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
        MFA RTAAPPLLLHFHSLP R+LPLIPL NFLSAAHCRRSVRLSAAMAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAR
Subjt:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR

Query:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

A0A1S3AUX5 proline iminopeptidase6.2e-304100Show/hide
Query:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
        MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
Subjt:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR

Query:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

A0A5A7U143 Proline iminopeptidase2.2e-27799.78Show/hide
Query:  MAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVI
        MAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKAC+EFRVI
Subjt:  MAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVI

Query:  LMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
        LMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
Subjt:  LMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS

Query:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
        VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
Subjt:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID

Query:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
        NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
Subjt:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase6.2e-304100Show/hide
Query:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
        MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR
Subjt:  MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAR

Query:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406904.2e-26087.32Show/hide
Query:  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM
        L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPM
Subjt:  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM

Query:  PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLLYLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF+SAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGG+ LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST

Query:  GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH1.0e-6935.55Show/hide
Query:  RLRDHHFSVPLNYSL-DQGSSTRISVFAREVVSV-GKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEE-FRVILMDQRGTGLSTPLTPSSMSQFRS
        R  +  F VPLN+S  D+G+   + +FAR +  V G +D  +P++LYLQGGPG  C  P E + W+    E+ +RV+ +D+RGTG S+P+T  +++Q   
Subjt:  RLRDHHFSVPLNYSL-DQGSSTRISVFAREVVSV-GKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEE-FRVILMDQRGTGLSTPLTPSSMSQFRS

Query:  AEDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKR
         +  A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+
Subjt:  AEDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKR

Query:  YPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIY
        YP+D+  V+ ++KYL +N   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I+N   +  N +Y LL E +Y
Subjt:  YPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIY

Query:  CQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM
        CQG +  W A + + + + +F  N   +    ++FTGE IF  MF+    L+  K  A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L  
Subjt:  CQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM

Query:  ETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
         TAS++  ++  + N + H  +     +V+  L  L
Subjt:  ETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase2.0e-10244.49Show/hide
Query:  SPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLS
        S P+H     Y +  +    H F+VPL++       T I++F R +    + D  +P+LLYLQGGPGF   RPS   GWI++A +EFRV+L+DQRGTG S
Subjt:  SPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLS

Query:  TPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVI
        TP+    ++     +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA +++V 
Subjt:  TPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVI

Query:  IQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPL
         +N  ++ R+P    I   +  +L  +   V LP+G  LT + LQ  GL  LG S  FE L+YL E  +         ++++  FL  +      ++NP+
Subjt:  IQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPL

Query:  YVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYED
        + +LHE IYC+GA+S W+A+R++ E    F A  A  +G    FTGEMIFPWMF++   L P K+AAH+LA+K DW PLYD   L  NKVPVA AVY ED
Subjt:  YVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYED

Query:  MFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        M+V F  + ET   ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  MFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase9.1e-0529.17Show/hide
Query:  LLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +++L GGPG   A PS    +     E +R++L DQRG G STP                  L+     ++VND E +R  L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT2G14260.2 proline iminopeptidase9.1e-0529.17Show/hide
Query:  LLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +++L GGPG   A PS    +     E +R++L DQRG G STP                  L+     ++VND E +R  L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein8.5e-21370.76Show/hide
Query:  AVRTAAPPL---LLHFHSLPFRLLPLIP--LPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISV
        A+ T  P L   +L   + P RL   +P  +  +     CR    ++ A +  +     S   HV G W+SVPELRLRDH F VPL+YS    SS +I+V
Subjt:  AVRTAAPPL---LLHFHSLPFRLLPLIP--LPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISV

Query:  FAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLV
        FARE+V+VGKE+Q MPYLLYLQGGPGFE  RPSEASGWIQ+ACEEFRV+L+DQRGTGLSTPLT SSM QF+SA++LA+YL HFRADNIV DAEFIR RLV
Subjt:  FAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLV

Query:  PDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADN-GGGVLLPSGGILT
        P A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE+V  QNEKYYKR+PQDIEIVRE+V YLA++ GGGV LPSGGILT
Subjt:  PDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADN-GGGVLLPSGGILT

Query:  PKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
        PKGLQTLGLS LG+STGFERLHY+ ERVWDPILV GAPK IS FFLNA ++W S D+NPLY LLHE+IYC+GASS WSA R++++ E KFDA KAVKE  
Subjt:  PKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PV FTGEMIFPWMFDEIHAL+PFK AA +LA KEDWPPLYD+  L+NNKVPVAAAVYYEDM+VNFKL  ETAS I+GIRLW+TNEFMHSGLRDAG Q++D
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HL+G++NGKKPLF
Subjt:  HLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGTTCGCACGGCAGCGCCGCCACTTCTTCTTCACTTCCACTCTTTACCCTTCCGCCTCCTTCCATTAATTCCACTCCCCAATTTTCTCTCAGCCGCTCATTG
CCGAAGGTCGGTCCGTTTATCGGCAGCTATGGCCGGAATCCTTTCCCCTCGTGCACCATCGCCGCCAGTGCACGTGGCTGGCACGTGGTACTCCGTTCCGGAGCTCCGTC
TCCGGGACCATCACTTCTCTGTGCCTCTCAATTACTCTCTAGATCAGGGTTCTTCTACTAGGATCTCTGTTTTTGCGCGGGAAGTTGTTTCAGTGGGAAAAGAGGATCAA
CCAATGCCGTACCTTTTGTACTTACAAGGTGGACCTGGATTTGAGTGTGCCCGACCATCGGAAGCGAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGTTATATT
GATGGATCAGCGAGGAACAGGATTATCGACTCCTTTGACTCCATCATCCATGTCACAATTTCGAAGTGCAGAGGATTTAGCCAACTACTTGAAACATTTTCGAGCTGATA
ATATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTGCCTGATGCTGCACCTTGGACTATTTTGGGTCAGAGCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGT
TTTGCACCACAAGGATTGAAGCAAGTTCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAAT
TCAAAATGAAAAGTACTACAAGAGGTATCCACAGGATATTGAAATTGTCCGTGAAGTTGTGAAATACTTGGCCGATAACGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTA
TTTTGACACCCAAAGGACTGCAAACTCTTGGTCTCTCTGCTCTGGGAACTAGTACAGGTTTTGAGCGCTTGCATTATCTGTTTGAGAGAGTTTGGGATCCTATACTAGTT
CCTGGAGCACCGAAACGGATTAGTTTTTTCTTCCTCAATGCAATTGATAACTGGCTCTCACTTGATTCAAATCCTCTATACGTTCTCTTACACGAATCAATATATTGCCA
GGGAGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAGTTTGATGCAAATAAAGCTGTAAAAGAAGGATGTCCCGTGTATTTCACAGGAGAGA
TGATCTTCCCATGGATGTTTGACGAGATACATGCCTTGAGACCATTCAAAGATGCAGCGCATATATTAGCCGATAAGGAGGATTGGCCTCCTCTATATGACATCGCTGCT
CTTAAAAATAACAAGGTTCCGGTGGCAGCTGCGGTGTATTACGAAGATATGTTTGTGAACTTTAAACTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATAAGGTTATG
GATTACTAATGAATTTATGCATTCTGGTCTACGTGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTATTAAATGGAAAAAAGCCTTTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ACCGTCCAAAAATGTTCGCAGTTCGCACGGCAGCGCCGCCACTTCTTCTTCACTTCCACTCTTTACCCTTCCGCCTCCTTCCATTAATTCCACTCCCCAATTTTCTCTCA
GCCGCTCATTGCCGAAGGTCGGTCCGTTTATCGGCAGCTATGGCCGGAATCCTTTCCCCTCGTGCACCATCGCCGCCAGTGCACGTGGCTGGCACGTGGTACTCCGTTCC
GGAGCTCCGTCTCCGGGACCATCACTTCTCTGTGCCTCTCAATTACTCTCTAGATCAGGGTTCTTCTACTAGGATCTCTGTTTTTGCGCGGGAAGTTGTTTCAGTGGGAA
AAGAGGATCAACCAATGCCGTACCTTTTGTACTTACAAGGTGGACCTGGATTTGAGTGTGCCCGACCATCGGAAGCGAGTGGATGGATACAAAAAGCATGTGAAGAATTT
CGTGTTATATTGATGGATCAGCGAGGAACAGGATTATCGACTCCTTTGACTCCATCATCCATGTCACAATTTCGAAGTGCAGAGGATTTAGCCAACTACTTGAAACATTT
TCGAGCTGATAATATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTGCCTGATGCTGCACCTTGGACTATTTTGGGTCAGAGCTACGGCGGTTTTTGTGCAGTTA
CGTATTTGAGTTTTGCACCACAAGGATTGAAGCAAGTTCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAA
AAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCACAGGATATTGAAATTGTCCGTGAAGTTGTGAAATACTTGGCCGATAACGGAGGCGGGGTTCTTCTTCC
CTCTGGTGGTATTTTGACACCCAAAGGACTGCAAACTCTTGGTCTCTCTGCTCTGGGAACTAGTACAGGTTTTGAGCGCTTGCATTATCTGTTTGAGAGAGTTTGGGATC
CTATACTAGTTCCTGGAGCACCGAAACGGATTAGTTTTTTCTTCCTCAATGCAATTGATAACTGGCTCTCACTTGATTCAAATCCTCTATACGTTCTCTTACACGAATCA
ATATATTGCCAGGGAGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAGTTTGATGCAAATAAAGCTGTAAAAGAAGGATGTCCCGTGTATTT
CACAGGAGAGATGATCTTCCCATGGATGTTTGACGAGATACATGCCTTGAGACCATTCAAAGATGCAGCGCATATATTAGCCGATAAGGAGGATTGGCCTCCTCTATATG
ACATCGCTGCTCTTAAAAATAACAAGGTTCCGGTGGCAGCTGCGGTGTATTACGAAGATATGTTTGTGAACTTTAAACTGGCCATGGAGACAGCTTCCCAAATAGCAGGA
ATAAGGTTATGGATTACTAATGAATTTATGCATTCTGGTCTACGTGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTATTAAATGGAAAAAAGCCTTTATTTTG
AGGTTTTTCTCATCTTTGTTATTTTTCTTCATTATTATTGGATTGAGTTTATGTTATTTTCTCCTTAATAAGCTTGTTATTTGTTAGTGGATGAGATTAGGAGATTGTAA
ACTGGTTAATCCTTTCTCCTAAATAAGTTTTTTCCCCATTAATTATAAGTAAATGAATAAATAAAATATAAAACAAAATTTAT
Protein sequenceShow/hide protein sequence
MFAVRTAAPPLLLHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQ
PMPYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLS
FAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILV
PGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAA
LKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF