; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g273600 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g273600
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationCsor_Chr06:8046848..8051003
RNA-Seq ExpressionCsor.00g273600
SyntenyCsor.00g273600
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]0.0100Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

KAG7028582.1 Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]0.095.53Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ---------------GASSRWSAQRIRNELENKFDVIRAVKEG
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRIRNELENKFDV +AVKEG
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ---------------GASSRWSAQRIRNELENKFDVIRAVKEG

Query:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
        CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
Subjt:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]0.098.4Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFD  +AVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

XP_022974658.1 uncharacterized protein LOC111473341 isoform X1 [Cucurbita maxima]0.097.6Show/hide
Query:  MKALLFHSFPS-PARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFPS PARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPS-PARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFD  +AVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWM

Query:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMG LNGKKPLF
Subjt:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]0.098Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSL VMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKE+ 
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDV+IVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFD  +AVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein0.085.86Show/hide
Query:  FHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPMPY
        FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPMPY
Subjt:  FHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPMPY

Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
        L++LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQ+++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC

Query:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGF
        AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALG+STGF
Subjt:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGF

Query:  ERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIH
        ER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSRWSAQRI+NE+ENKFD  +AVKEGC VYFTGEMIFPWMFDEIH
Subjt:  ERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIH

Query:  ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        AL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase0.087.07Show/hide
Query:  FHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPY
        FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPMPY
Subjt:  FHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPY

Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
        L+YLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC

Query:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGF
        AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGGV LP GGILTPKGLQTLGLSALG+STGF
Subjt:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGF

Query:  ERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIH
        ER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI+NE+ENKFD  +AVKEGCPVYFTGEMIFPWMFDEIH
Subjt:  ERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIH

Query:  ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        AL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase0.087.07Show/hide
Query:  FHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPY
        FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPMPY
Subjt:  FHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPY

Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
        L+YLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC

Query:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGF
        AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGGV LP GGILTPKGLQTLGLSALG+STGF
Subjt:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGF

Query:  ERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIH
        ER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI+NE+ENKFD  +AVKEGCPVYFTGEMIFPWMFDEIH
Subjt:  ERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIH

Query:  ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        AL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406900.098.4Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFD  +AVKEGCPVYFTGEMIFPWMF
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMF

Query:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
Subjt:  DEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X10.097.6Show/hide
Query:  MKALLFHSFPS-PARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFPS PARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPS-PARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWM
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFD  +AVKEGCPVYFTGEMIFPWM
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWM

Query:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMG LNGKKPLF
Subjt:  FDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH3.9e-6634.93Show/hide
Query:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-FRVILMDQRGTGLSTPLSPSSMSQFQTAE
        R  +  F VPLN+S       + ++AR +  V G ++  +P+++YLQGGPG  C  P E + W+    E+ +RV+ +D+RGTG S+P++  +++Q    +
Subjt:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-FRVILMDQRGTGLSTPLSPSSMSQFQTAE

Query:  DLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP
          A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+YP
Subjt:  DLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP

Query:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDS----NPLYGLMHES
        +DV  V  ++KYL+EN   +     G LTP+  Q LG+  LG   G + +H + +R  + + +        + FL A +  L  +S    N +Y L+ E 
Subjt:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDS----NPLYGLMHES

Query:  IYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKL
        +YCQG +  W A + R + + +F +    +    ++FTGE IF  MF+    LK  K  A +LA   DW  LY+ A L  N+VPV  A   EDMYV++ L
Subjt:  IYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKL

Query:  AMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
           TAS++  ++  V N + H  +     +V+  L  L
Subjt:  AMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

P46547 Proline iminopeptidase4.0e-10343.09Show/hide
Query:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQF
        Y +  +    H+F+VPL++        I+++ R +    + +  +P+L+YLQGGPGF  PRP+   GWI++A +EFRV+L+DQRGTG STP+    ++  
Subjt:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQF

Query:  QTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP
           +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA ++++  +N  ++ R+P
Subjt:  QTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP

Query:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ
            I + +  +L+ +   V LP G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  +      ++NP++ ++HE IYC+
Subjt:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ

Query:  GASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMET
        GA+S W+A+R+R E         A  +G    FTGEMIFPWMF++   L P K+AA++LAEK DW PLYD   L  NKVPVA AVY EDMYV F  + ET
Subjt:  GASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMET

Query:  ASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
           ++  R W+TNE+ H+GLR  G Q+LD L+ L
Subjt:  ASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

P93732 Proline iminopeptidase9.6e-0430.83Show/hide
Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +V+L GGPG      T  S       E +R++L DQRG G ST   P +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase6.8e-0530.83Show/hide
Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +V+L GGPG      T  S       E +R++L DQRG G ST   P +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT2G14260.2 proline iminopeptidase6.8e-0530.83Show/hide
Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +V+L GGPG      T  S       E +R++L DQRG G ST   P +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein1.7e-21376.04Show/hide
Query:  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGL
        G S  EH  G W+SVPELRLRDH F VPL+YS   SSPKI+V+ARE+V+VGKEEQ MPYL+YLQGGPGFE PRP+EASGWIQ+ACEEFRV+L+DQRGTGL
Subjt:  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGL

Query:  STPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKI
        STPL+ SSM QF++A++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE++
Subjt:  STPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKI

Query:  IIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN
          QNEKYYKR+PQD++IV E+V YL E  GGGVPLP GGILTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA   W S D+N
Subjt:  IIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN

Query:  PLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYY
        PLY L+HE+IYC+GASS WSA R+R++ E KFD ++AVKE  PV FTGEMIFPWMFDEIHALKPFK AA++LA+KEDWPPLYD+  L+NNKVPVAAAVYY
Subjt:  PLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYY

Query:  EDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
        EDMYVNFKL  ETAS I+GIRLWVTNEFMHSGLRD G Q++DHL+G++NGKKPLF
Subjt:  EDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTG
GCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTC
TCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTT
GTATACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGTTATATTGATGGACCAG
CGGGGAACAGGATTATCGACTCCTTTGTCTCCATCGTCCATGTCACAATTCCAAACTGCAGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATA
GTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGT
TTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATT
ATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTTCCTCTTCCC
TGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGG
GATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTCAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATG
CACGAGTCGATATATTGCCAGGGCGCCTCGTCTCGTTGGTCTGCTCAAAGAATAAGGAATGAACTGGAGAACAAATTCGATGTAATTAGGGCTGTAAAAGAAGGA
TGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAG
GATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATG
GAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGA
TTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTG
GCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTC
TCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTT
GTATACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGTTATATTGATGGACCAG
CGGGGAACAGGATTATCGACTCCTTTGTCTCCATCGTCCATGTCACAATTCCAAACTGCAGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATA
GTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGT
TTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATT
ATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTTCCTCTTCCC
TGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGG
GATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTCAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATG
CACGAGTCGATATATTGCCAGGGCGCCTCGTCTCGTTGGTCTGCTCAAAGAATAAGGAATGAACTGGAGAACAAATTCGATGTAATTAGGGCTGTAAAAGAAGGA
TGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAG
GATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATG
GAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGA
TTGTTAAATGGAAAGAAGCCTTTATTCTGA
Protein sequenceShow/hide protein sequence
MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYL
VYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLS
FAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVW
DPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKE
DWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF