; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh06G010210 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh06G010210
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationCma_Chr06:6898998..6903901
RNA-Seq ExpressionCmaCh06G010210
SyntenyCmaCh06G010210
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]3.4e-28092.83Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
Subjt:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP

Query:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
        CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFD  +
Subjt:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK

Query:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
        AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
Subjt:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG

Query:  GPQVLDHLMGFLNGKK
        GPQVLDHLMG LNGKK
Subjt:  GPQVLDHLMGFLNGKK

KAG7028582.1 Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]3.4e-28091.62Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCA
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCA
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCA

Query:  ITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILT
        +TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILT
Subjt:  ITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILT

Query:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ---------------GASSRWSAQRIRNE
        PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ               GASSRWSAQRIRNE
Subjt:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ---------------GASSRWSAQRIRNE

Query:  LENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNE
        LENKFD TKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNE
Subjt:  LENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNE

Query:  FMHSGLRDGGPQVLDHLMGFLNGKK
        FMHSGLRDGGPQVLDHLMG LNGKK
Subjt:  FMHSGLRDGGPQVLDHLMGFLNGKK

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]2.6e-28092.83Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYL+YLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLP
Subjt:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP

Query:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
        CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFDATK
Subjt:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK

Query:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
        AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
Subjt:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG

Query:  GPQVLDHLMGFLNGKK
        GPQVLDHLMG LNGKK
Subjt:  GPQVLDHLMGFLNGKK

XP_022974658.1 uncharacterized protein LOC111473341 isoform X1 [Cucurbita maxima]7.6e-28895.16Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
        YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
Subjt:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP

Query:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
        CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
Subjt:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK

Query:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
        AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
Subjt:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG

Query:  GPQVLDHLMGFLNGKK
        GPQVLDHLMGFLNGKK
Subjt:  GPQVLDHLMGFLNGKK

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]5.9e-28092.83Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSL VMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKE+
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
         PMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
        YGGFCA+TYLSFAPKGLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDV+IVHEVVKYLEENGGGVPLP
Subjt:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP

Query:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
        CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFDATK
Subjt:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK

Query:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
        AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
Subjt:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG

Query:  GPQVLDHLMGFLNGKK
        GPQVLDHLMG LNGKK
Subjt:  GPQVLDHLMGFLNGKK

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein4.3e-24481.32Show/hide
Query:  LLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQP
        L FHS P      LIPL   LSA HCR SVR  A MAG +    ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QP
Subjt:  LLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQP

Query:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
        MPYL++LQGGPGFEC RPTEASGWIQKACEEFR      RGTGLSTPL+PSSMSQFQS++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
Subjt:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG

Query:  GFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCG
        GFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGGV LP G
Subjt:  GFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCG

Query:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV
        GILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSRWSAQRI+NE+ENKFDA KAV
Subjt:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV

Query:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP
        KEGC VYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GP
Subjt:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP

Query:  QVLDHLMGFLNGKK
        QVLDHLMG LNGKK
Subjt:  QVLDHLMGFLNGKK

A0A1S3AUX5 proline iminopeptidase2.4e-24782.49Show/hide
Query:  LLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQP
        L FHS P      LIPL   LSA HCR SVR  A MAG +     SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QP
Subjt:  LLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQP

Query:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
        MPYL+YLQGGPGFEC RP+EASGWIQKACEEFR      RGTGLSTPL+PSSMSQF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
Subjt:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG

Query:  GFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCG
        GFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGGV LP G
Subjt:  GFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCG

Query:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV
        GILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI+NE+ENKFDA KAV
Subjt:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV

Query:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP
        KEGCPVYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GP
Subjt:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP

Query:  QVLDHLMGFLNGKK
        QVLDHLMG LNGKK
Subjt:  QVLDHLMGFLNGKK

A0A5D3D1Y5 Proline iminopeptidase2.4e-24782.49Show/hide
Query:  LLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQP
        L FHS P      LIPL   LSA HCR SVR  A MAG +     SPP H AGTWYSVPELRLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QP
Subjt:  LLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQP

Query:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
        MPYL+YLQGGPGFEC RP+EASGWIQKACEEFR      RGTGLSTPL+PSSMSQF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
Subjt:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG

Query:  GFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCG
        GFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEK+IIQNEKYYKRYPQD++IV EVVKYL +NGGGV LP G
Subjt:  GFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCG

Query:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV
        GILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI+NE+ENKFDA KAV
Subjt:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV

Query:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP
        KEGCPVYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GP
Subjt:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP

Query:  QVLDHLMGFLNGKK
        QVLDHLMG LNGKK
Subjt:  QVLDHLMGFLNGKK

A0A6J1F4P5 uncharacterized protein LOC1114406901.3e-28092.83Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFP SPARSLIPLTRLLSAVHCRSSVRSLAVMA T PSN ASPPEH+AGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYL+YLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
        YGGFCA+TYLSFAP+GLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG+PLP
Subjt:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP

Query:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
        CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRI NELENKFDATK
Subjt:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK

Query:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
        AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
Subjt:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG

Query:  GPQVLDHLMGFLNGKK
        GPQVLDHLMG LNGKK
Subjt:  GPQVLDHLMGFLNGKK

A0A6J1II94 uncharacterized protein LOC111473341 isoform X13.7e-28895.16Show/hide
Query:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
        MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE
Subjt:  MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEE

Query:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
        QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR      RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS
Subjt:  QPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQS

Query:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
        YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSV                   YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP
Subjt:  YGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLP

Query:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
        CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK
Subjt:  CGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATK

Query:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
        AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG
Subjt:  AVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDG

Query:  GPQVLDHLMGFLNGKK
        GPQVLDHLMGFLNGKK
Subjt:  GPQVLDHLMGFLNGKK

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH1.4e-5833.19Show/hide
Query:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-------FRRGTGLSTPLSPSSMSQFQSAE
        R  +  F VPLN+S       + ++AR +  V G ++  +P+++YLQGGPG  C  P E + W+    E+         RGTG S+P++  +++Q    +
Subjt:  RLRDHYFSVPLNYSLDHSSPKISVYAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-------FRRGTGLSTPLSPSSMSQFQSAE

Query:  DLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMY
          A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCAI+Y+S  P  L +V I GG  P+ N                 +  +   R++
Subjt:  DLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMY

Query:  RACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGW
             +    NE YYK+YP+DV  V  ++KYL+EN   +     G LTP+  Q LG+  LG   G + +H + +R  + + +        + FL A +  
Subjt:  RACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGW

Query:  LSLDS----NPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAVKE-GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKN
        L  +S    N +Y L+ E +YCQG +  W A + R     K D   ++ E    ++FTGE IF  MF+    LK  K  A +LA   DW  LY+ A L  
Subjt:  LSLDS----NPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAVKE-GCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKN

Query:  NKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHL
        N+VPV  A   EDMYV++ L   TAS++  ++  V N + H  +     +V+  L
Subjt:  NKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHL

P46547 Proline iminopeptidase1.0e-9340.35Show/hide
Query:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQF
        Y +  +    H+F+VPL++        I+++ R +    + +  +P+L+YLQGGPGF  PRP+   GWI++A +EFR      RGTG STP+    ++  
Subjt:  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQF

Query:  QSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMY
           +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD V                   Y
Subjt:  QSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMY

Query:  RACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGW
        RA ++++  +N  ++ R+P    I + +  +L+ +   V LP G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  +   
Subjt:  RACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGW

Query:  LSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPV
           ++NP++ ++HE IYC+GA+S W+A+R+R E         A  +G    FTGEMIFPWMF++   L P K+AA++LAEK DW PLYD   L  NKVPV
Subjt:  LSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPV

Query:  AAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLM
        A AVY EDMYV F  + ET   ++  R W+TNE+ H+GLR  G Q+LD L+
Subjt:  AAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLM

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein2.0e-20168.29Show/hide
Query:  SFPSSPARSLIPLTRLLSAVHCRSSVRSLAVM--AGTIPSNEA--SPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPM
        S  ++P+R    +  L+     R   R +  M  AG++  + A  S  EH  G W+SVPELRLRDH F VPL+YS   SSPKI+V+ARE+V+VGKEEQ M
Subjt:  SFPSSPARSLIPLTRLLSAVHCRSSVRSLAVM--AGTIPSNEA--SPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL+YLQGGPGFE PRP+EASGWIQ+ACEEFR      RGTGLSTPL+ SSM QF+SA++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFR------RGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGVPLPCG
        FCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD V                   Y A FE++  QNEKYYKR+PQD++IV E+V YL E  GGGVPLP G
Subjt:  FCAITYLSFAPKGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYL-EENGGGVPLPCG

Query:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV
        GILTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA   W S D+NPLY L+HE+IYC+GASS WSA R+R++ E KFDA KAV
Subjt:  GILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAV

Query:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP
        KE  PV FTGEMIFPWMFDEIHALKPFK AA++LA+KEDWPPLYD+  L+NNKVPVAAAVYYEDMYVNFKL  ETAS I+GIRLWVTNEFMHSGLRD G 
Subjt:  KEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP

Query:  QVLDHLMGFLNGKK
        Q++DHL+G +NGKK
Subjt:  QVLDHLMGFLNGKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTTCGTTCATTGGC
AGTCATGGCCGGCACCATTCCTTCTAATGAAGCATCTCCGCCGGAGCACTCAGCTGGCACGTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTCTCTGTGC
CTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTGTATACTTACAA
GGTGGACCTGGATTTGAGTGTCCGCGACCTACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTCGGGGAACAGGATTATCGACTCCTTTGTCTCCATC
GTCCATGTCGCAATTCCAAAGTGCTGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTG
ATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAATTACGTATTTGAGTTTTGCACCAAAAGGATTGAAACAAGTCCTCATAACTGGAGGAATC
CCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGATGTATAGAGCATGCTTTGA
AAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTTCCTCTTC
CCTGTGGTGGTATCCTGACTCCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAACGCATGCACTATTTGTTTGAGAGAGTATGGGAT
CCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTCAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTC
AATATATTGCCAGGGCGCCTCGTCTCGCTGGTCTGCTCAAAGAATAAGGAATGAACTGGAGAACAAATTCGATGCAACTAAGGCTGTAAAAGAAGGATGTCCTGTGTATT
TCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATAT
GACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTATGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGG
AATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTCTTAAATGGAAAGAAGTTCTGGTCTG
CGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTTCGTTCATTGGC
AGTCATGGCCGGCACCATTCCTTCTAATGAAGCATCTCCGCCGGAGCACTCAGCTGGCACGTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTCTCTGTGC
CTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTGTATACTTACAA
GGTGGACCTGGATTTGAGTGTCCGCGACCTACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTCGGGGAACAGGATTATCGACTCCTTTGTCTCCATC
GTCCATGTCGCAATTCCAAAGTGCTGAGGACTTGGCCAACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTG
ATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAATTACGTATTTGAGTTTTGCACCAAAAGGATTGAAACAAGTCCTCATAACTGGAGGAATC
CCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGATGTATAGAGCATGCTTTGA
AAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTTCCTCTTC
CCTGTGGTGGTATCCTGACTCCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAACGCATGCACTATTTGTTTGAGAGAGTATGGGAT
CCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTCAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTC
AATATATTGCCAGGGCGCCTCGTCTCGCTGGTCTGCTCAAAGAATAAGGAATGAACTGGAGAACAAATTCGATGCAACTAAGGCTGTAAAAGAAGGATGTCCTGTGTATT
TCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATAT
GACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTATGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGG
AATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTCTTAAATGGAAAGAAGTTCTGGTCTG
CGTGA
Protein sequenceShow/hide protein sequence
MKALLFHSFPSSPARSLIPLTRLLSAVHCRSSVRSLAVMAGTIPSNEASPPEHSAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLVYLQ
GGPGFECPRPTEASGWIQKACEEFRRGTGLSTPLSPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAITYLSFAPKGLKQVLITGGI
PPIGNGCTADSVYRACFEKIIIQNEKYYKRMYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWD
PIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIRNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLY
DIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGFLNGKKFWSA