; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G010800 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G010800
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationCmo_Chr06:8344640..8348886
RNA-Seq ExpressionCmoCh06G010800
SyntenyCmoCh06G010800
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]2.2e-29598.4Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFD  +AVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

KAG7028582.1 Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]4.4e-28894.75Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ---------------GASSRWSAQRIMNELENKFDATKAVKEG
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRI NELENKFD TKAVKEG
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ---------------GASSRWSAQRIMNELENKFDATKAVKEG

Query:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
        CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
Subjt:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]5.5e-299100Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

XP_022974658.1 uncharacterized protein LOC111473341 isoform X1 [Cucurbita maxima]5.0e-29297.6Show/hide
Query:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALG
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFDATKAVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWM

Query:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMG LNGKKPLF
Subjt:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]1.3e-29598.4Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSL VMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKE+ 
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDV+IVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein1.6e-25686.12Show/hide
Query:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPM
        L FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPM
Subjt:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPM

Query:  PYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLL+LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQS++DLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDE
        GFER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSRWSAQRI NE+ENKFDA KAVKEGC VYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase9.2e-26087.32Show/hide
Query:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM
        L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPM
Subjt:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM

Query:  PYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLLYLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF+SAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGG+ LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDE
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase9.2e-26087.32Show/hide
Query:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM
        L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPM
Subjt:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM

Query:  PYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLLYLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF+SAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGG+ LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDE
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406902.7e-299100Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X12.4e-29297.6Show/hide
Query:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALG
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFDATKAVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWM

Query:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMG LNGKKPLF
Subjt:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH6.1e-6735.54Show/hide
Query:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEE-FRVILMDQRGTGLSTPLSPSSMSQFQSAE
        R  +  F VPLN+S       + ++AR +  V G ++  +P++LYLQGGPG  C  P E + W+    E+ +RV+ +D+RGTG S+P++  +++Q    +
Subjt:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEE-FRVILMDQRGTGLSTPLSPSSMSQFQSAE

Query:  DLADYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP
          AD LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+YP
Subjt:  DLADYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP

Query:  QDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDS----NPLYGLMHES
        +DV  V  ++KYL+EN   +     G LTP+  Q LG+  LG   G + +H + +R  + + +        + FL A +  L  +S    N +Y L+ E 
Subjt:  QDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDS----NPLYGLMHES

Query:  IYCQGASSRWSAQRIMNELENKFDATKAVKE-GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFK
        +YCQG +  W A +       K D   ++ E    ++FTGE IF  MF+    LK  K  A +LA   DW  LY+ A L  N+VPV  A   EDMYV++ 
Subjt:  IYCQGASSRWSAQRIMNELENKFDATKAVKE-GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFK

Query:  LAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
        L   TAS++  ++  V N + H  +     +V+  L  L
Subjt:  LAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

P46547 Proline iminopeptidase2.4e-10343.09Show/hide
Query:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQF
        Y +  +    H+F+VPL++        I+++ R +    + +  +P+LLYLQGGPGF  PRP+   GWI++A +EFRV+L+DQRGTG STP+    ++  
Subjt:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQF

Query:  QSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP
           +  ADYL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA ++++  +N  ++ R+P
Subjt:  QSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP

Query:  QDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ
            I + +  +L+ +   + LP G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  +      ++NP++ ++HE IYC+
Subjt:  QDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ

Query:  GASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMET
        GA+S W+A+R+  E         A  +G    FTGEMIFPWMF++   L P K+AA++LAEK DW PLYD   L  NKVPVA AVY EDMYV F  + ET
Subjt:  GASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMET

Query:  ASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
           ++  R W+TNE+ H+GLR  G Q+LD L+ L
Subjt:  ASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase1.5e-0428.33Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +++L GGPG      T  S       E +R++L DQRG G STP +                L+     ++VND E +R  L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT2G14260.2 proline iminopeptidase1.5e-0428.33Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +++L GGPG      T  S       E +R++L DQRG G STP +                L+     ++VND E +R  L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein1.5e-21476.7Show/hide
Query:  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGL
        G S  EH  G W+SVPELRLRDH F VPL+YS   SSPKI+V+ARE+V+VGKEEQ MPYLLYLQGGPGFE PRP+EASGWIQ+ACEEFRV+L+DQRGTGL
Subjt:  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGL

Query:  STPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKI
        STPL+ SSM QF+SA++LADYL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE++
Subjt:  STPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKI

Query:  IIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN
          QNEKYYKR+PQD++IV E+V YL E  GGG+PLP GGILTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA   W S D+N
Subjt:  IIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN

Query:  PLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYY
        PLY L+HE+IYC+GASS WSA R+ ++ E KFDA KAVKE  PV FTGEMIFPWMFDEIHALKPFK AA++LA+KEDWPPLYD+  L+NNKVPVAAAVYY
Subjt:  PLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYY

Query:  EDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        EDMYVNFKL  ETAS I+GIRLWVTNEFMHSGLRD G Q++DHL+G++NGKKPLF
Subjt:  EDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTGGCAGT
CATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTTTCTGTGCCTC
TCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTCTATACTTACAAGGT
GGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGCGAAGAATTTCGTGTTATATTGATGGACCAGCGGGGAACAGGATTATCGAC
TCCTTTGTCTCCATCATCCATGTCCCAATTCCAAAGTGCAGAGGACTTGGCCGACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGA
CTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTC
ATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCC
TCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGATTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTG
GGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTC
TTCCTTAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGGGCGCCTCGTCTCGCTGGTCTGCTCAAAG
AATAATGAATGAACTGGAGAACAAATTCGATGCAACAAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTC
ATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCA
GCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCT
GCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
TTCCGCCATTTTTGATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTC
CGTTCATTGGCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTA
CTTTTCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTC
TATACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGCGAAGAATTTCGTGTTATATTGATGGACCAGCGGGGA
ACAGGATTATCGACTCCTTTGTCTCCATCATCCATGTCCCAATTCCAAAGTGCAGAGGACTTGGCCGACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGC
TGAATTCATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGAT
TGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATAC
TACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGATTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGG
GCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAAC
GAATCAGTTATTTCTTCCTTAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGGGCGCCTCGTCTCGC
TGGTCTGCTCAAAGAATAATGAATGAACTGGAGAACAAATTCGATGCAACAAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGAT
GTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGG
TCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTT
ATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGAGGTTTTTTTCTCTAAACTTTGTTGCT
TTTCCTCATGATTATTGGATGCAGTCTTTGCCATGACCTTTTCATTTCCTCCTCAATAAGCTTTATCTCCTCCTCAATAATGTGTGTG
Protein sequenceShow/hide protein sequence
MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQG
GPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVL
ITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYF
FLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAA
AVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF