; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24636 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24636
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationCarg_Chr06:6923252..6927658
RNA-Seq ExpressionCarg24636
SyntenyCarg24636
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]3.1e-28995.53Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRIRNELENKFDV +AVKEG
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG

Query:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
        CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
Subjt:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

KAG7028582.1 Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]2.0e-304100Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAV
        PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAV
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAV

Query:  TYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFER
        TYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFER
Subjt:  TYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFER

Query:  MHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFT
        MHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFT
Subjt:  MHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFT

Query:  GEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
        GEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
Subjt:  GEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

Query:  LNGKKPLF
        LNGKKPLF
Subjt:  LNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]1.3e-28794.75Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRI NELENKFD TKAVKEG
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG

Query:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
        CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
Subjt:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

XP_022974658.1 uncharacterized protein LOC111473341 isoform X1 [Cucurbita maxima]3.9e-28493.98Show/hide
Query:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKE
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRIRNELENKFD TKAVKE
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKE

Query:  GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQV
        GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQV
Subjt:  GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQV

Query:  LDHLMGLLNGKKPLF
        LDHLMG LNGKKPLF
Subjt:  LDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]2.4e-28694.36Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSL VMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKE+ 
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDV+IVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRI NELENKFD TKAVKEG
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG

Query:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
        CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
Subjt:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein1.1e-24782.23Show/hide
Query:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPM
        L FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPM
Subjt:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL++LQGGPGFEC RPTEASGWIQKACEEFR      RGTGLSTPL+PSSMSQFQ+++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCP
        GFER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQ               GASSRWSAQRI+NE+ENKFD  KAVKEGC 
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCP

Query:  VYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH
        VYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GPQVLDH
Subjt:  VYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH

Query:  LMGLLNGKKPLF
        LMGLLNGKKPLF
Subjt:  LMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase6.1e-25183.4Show/hide
Query:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM
        L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPM
Subjt:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL+YLQGGPGFEC RP+EASGWIQKACEEFR      RGTGLSTPL+PSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGGV LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCP
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQ               GASSRWSAQRI+NE+ENKFD  KAVKEGCP
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCP

Query:  VYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH
        VYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDH
Subjt:  VYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH

Query:  LMGLLNGKKPLF
        LMGLLNGKKPLF
Subjt:  LMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase6.1e-25183.4Show/hide
Query:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM
        L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPM
Subjt:  LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL+YLQGGPGFEC RP+EASGWIQKACEEFR      RGTGLSTPL+PSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGGV LP GGILTPKGLQTLGLSALG+ST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCP
        GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQ               GASSRWSAQRI+NE+ENKFD  KAVKEGCP
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCP

Query:  VYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH
        VYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDH
Subjt:  VYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH

Query:  LMGLLNGKKPLF
        LMGLLNGKKPLF
Subjt:  LMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406906.2e-28894.75Show/hide
Query:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
        MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ
Subjt:  MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQ

Query:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
        PMPYL+YLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLPCGGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGS

Query:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG
        STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRI NELENKFD TKAVKEG
Subjt:  STGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEG

Query:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
        CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL
Subjt:  CPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVL

Query:  DHLMGLLNGKKPLF
        DHLMGLLNGKKPLF
Subjt:  DHLMGLLNGKKPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X11.9e-28493.98Show/hide
Query:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFP-SPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG
Subjt:  YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALG

Query:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKE
        SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRIRNELENKFD TKAVKE
Subjt:  SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKE

Query:  GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQV
        GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQV
Subjt:  GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQV

Query:  LDHLMGLLNGKKPLF
        LDHLMG LNGKKPLF
Subjt:  LDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH4.8e-5933.11Show/hide
Query:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-------FRRGTGLSTPLSPSSMSQFQTAE
        R  +  F VPLN+S       + ++AR +  V G ++  +P+++YLQGGPG  C  P E + W+    E+         RGTG S+P++  +++Q    +
Subjt:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-------FRRGTGLSTPLSPSSMSQFQTAE

Query:  DLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP
          A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+YP
Subjt:  DLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP

Query:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDS----NPLYGLMHES
        +DV  V  ++KYL+EN   +     G LTP+  Q LG+  LG   G + +H + +R  + + +        + FL A +  L  +S    N +Y L+ E 
Subjt:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDS----NPLYGLMHES

Query:  IYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPV
        +YCQ               G +  W A + R + + +F +    +    ++FTGE IF  MF+    LK  K  A +LA   DW  LY+ A L  N+VPV
Subjt:  IYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPV

Query:  AAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
          A   EDMYV++ L   TAS++  ++  V N + H  +     +V+  L  L
Subjt:  AAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

P46547 Proline iminopeptidase5.4e-9540.76Show/hide
Query:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQF
        Y +  +    H+F+VPL++        I+++ R +    + +  +P+L+YLQGGPGF  PRP+   GWI++A +EFR      RGTG STP+    ++  
Subjt:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQF

Query:  QTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP
           +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA ++++  +N  ++ R+P
Subjt:  QTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYP

Query:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ
            I + +  +L+ +   V LP G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  +      ++NP++ ++HE IYC+
Subjt:  QDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ

Query:  SELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAV
                       GA+S W+A+R+R E         A  +G    FTGEMIFPWMF++   L P K+AA++LAEK DW PLYD   L  NKVPVA AV
Subjt:  SELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAV

Query:  YYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL
        Y EDMYV F  + ET   ++  R W+TNE+ H+GLR  G Q+LD L+ L
Subjt:  YYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein2.9e-20572.98Show/hide
Query:  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGL
        G S  EH  G W+SVPELRLRDH F VPL+YS   SSPKI+V+ARE+V+VGKEEQ MPYL+YLQGGPGFE PRP+EASGWIQ+ACEEFR      RGTGL
Subjt:  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGL

Query:  STPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKI
        STPL+ SSM QF++A++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE++
Subjt:  STPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKI

Query:  IIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN
          QNEKYYKR+PQD++IV E+V YL E  GGGVPLP GGILTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA   W S D+N
Subjt:  IIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN

Query:  PLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIA
        PLY L+HE+IYC+               GASS WSA R+R++ E KFD  KAVKE  PV FTGEMIFPWMFDEIHALKPFK AA++LA+KEDWPPLYD+ 
Subjt:  PLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIA

Query:  ALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF
         L+NNKVPVAAAVYYEDMYVNFKL  ETAS I+GIRLWVTNEFMHSGLRD G Q++DHL+G++NGKKPLF
Subjt:  ALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTGGCAGT
CATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTCTCTGTGCCTC
TCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTGTATACTTACAAGGT
GGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTCGGGGAACAGGATTATCGACTCCTTTGTCTCCATCGTC
CATGTCACAATTCCAAACTGCAGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTGATG
CTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCT
CCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGT
CCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGAT
CTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTCAATGCTATCAGT
GGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGAGTGAACTTTTTGCCACTTCCTGTTCCTGTTCCTGTTCCCCAAAGGG
CGCCTCGTCTCGTTGGTCTGCTCAAAGAATAAGGAATGAACTGGAGAACAAATTCGATGTAACTAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGATGA
TCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTT
AAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGT
TACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
CGCCATTTTTGATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGT
TCATTGGCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTT
CTCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTGTAT
ACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTCGGGGAACAGGATTATCGACTCCTTTG
TCTCCATCGTCCATGTCACAATTCCAAACTGCAGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCT
TGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTG
GAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGAT
GTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTC
TGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTCA
ATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGAGTGAACTTTTTGCCACTTCCTGTTCCTGTTCCTGT
TCCCCAAAGGGCGCCTCGTCTCGTTGGTCTGCTCAAAGAATAAGGAATGAACTGGAGAACAAATTCGATGTAACTAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCAC
TGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACA
TTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATC
AGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGAGT
TTTTTTCTCAAAACTTTGTTGCTTTTCTCCATAATTATTGGATGCAGTCTTTGCCATGACCTTTTCATTTCCTCCTCAATAAGCTTTGTTTCCTTCCATCTCAATAATTG
CACCGCAGGGTTTTAGTTGCGTATTTGCATGTGTGCGTGTGCGTGGTGTGTTGCGTGTGAAAGAATTGTCTAGGGAATGATAATGATATTAAAATTATTTTT
Protein sequenceShow/hide protein sequence
MKALLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQG
GPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIP
PIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIS
GWLSLDSNPLYGLMHESIYCQSELFATSCSCSCSPKGASSRWSAQRIRNELENKFDVTKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAAL
KNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF