; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G021270 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G021270
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationCmo_Chr14:15378755..15387452
RNA-Seq ExpressionCmoCh14G021270
SyntenyCmoCh14G021270
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582635.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]7.1e-29389.46Show/hide
Query:  RRAAPPSSTWLPAFPDFPQILVSSVNSSYTTATKTECIKFAGNPALRIQSILNLVLPARESTCPMTGSQVHPGSIPSTTPNVSSVTPLPNLLSAAHCRSP
        R  +PP +T          +  S VNSSYTTATKTECIKFAGNPALRI                    QVHPGSI STTPNVSS+     LLSAAHCRSP
Subjt:  RRAAPPSSTWLPAFPDFPQILVSSVNSSYTTATKTECIKFAGNPALRIQSILNLVLPARESTCPMTGSQVHPGSIPSTTPNVSSVTPLPNLLSAAHCRSP

Query:  VPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKAC
        V SLAVMAGTNPPNG  PHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQ SSPKISVFAREAVSVGKEEQPM YLLYLQGGPGFECPRPT ASGWIQKAC
Subjt:  VPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKAC

Query:  DEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGN
        +EFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRA   VNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGN
Subjt:  DEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGN

Query:  GCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFF
        GCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGG+LTPRGLQTLGLSALGFS+GFERLHYLFERAWDPIIVPGAPKRISFF
Subjt:  GCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFF

Query:  FLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAA
        FLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEM FPWMFDEIHALKPFKDVANILANKEDWPPLYDIAA
Subjt:  FLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAA

Query:  LKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        LKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGK+PLF
Subjt:  LKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]1.5e-24784.98Show/hide
Query:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
        S+ PL  LLSA HCRS V SLAVMA TNP NG SP  H AGTWYSVPELRLRDHYFSVPL+YSLD  SSPKISV+ARE VSVGKEEQPM YL+YLQGGPG
Subjt:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG

Query:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
        FECPRPTEASGWIQKAC+EFRVILMDQRGTGLSTPL+PSSMSQFQTAEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCA+TYLSFAP
Subjt:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP

Query:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
        QGLKQVLI+GGIPPIGNGC ADSVYRA  E+++IQNEKYYKRYPQDV+ V EVVKYLEENGGGVPLP GGILTP+GLQTLGLSALG STGFER+HYLFER
Subjt:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER

Query:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
         WDPIIVPGAPKRIS+FFLN I  W+   SNPLY L++ES+YCQGASSRWSA+RIRNE+ +KFD  +AVK+G PVYFTGEM FPWMFDEIHALKPFKD A
Subjt:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA

Query:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        NILA KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKL METASQI G RLW+TNE MHSGLRD GPQV+D LMGLLNGK+PLF
Subjt:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

XP_022924281.1 uncharacterized protein LOC111431808 isoform X1 [Cucurbita moschata]8.1e-28999.59Show/hide
Query:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
        S+ PLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
Subjt:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG

Query:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
        FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
Subjt:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP

Query:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
        QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
Subjt:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER

Query:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
        AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
Subjt:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA

Query:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
Subjt:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]3.4e-24784.77Show/hide
Query:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
        S+ PL  LLSA HCRS V SLAVMA TNP NG SP  H AGTWYSVPELRLRDHYFSVPL+YSLD  SSPKISV+ARE VSVGKEEQPM YLLYLQGGPG
Subjt:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG

Query:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
        FECPRPTEASGWIQKAC+EFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCA+TYLSFAP
Subjt:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP

Query:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
        QGLKQVLI+GGIPPIGNGC ADSVYRA  E+++IQNEKYYKRYPQDV+ V EVVKYLEENGGG+PLP GGILTP+GLQTLGLSALG STGFER+HYLFER
Subjt:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER

Query:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
         WDPIIVPGAPKRIS+FFLN I  W+   SNPLY L++ES+YCQGASSRWSA+RI NE+ +KFDA KAVK+G PVYFTGEM FPWMFDEIHALKPFKD A
Subjt:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA

Query:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        NILA KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKL METASQI G RLW+TNE MHSGLRD GPQV+D LMGLLNGK+PLF
Subjt:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

XP_023526363.1 uncharacterized protein LOC111789877 isoform X2 [Cucurbita pepo subsp. pepo]5.8e-27197.41Show/hide
Query:  MAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDEFRVI
        MAGTNPPNGESP+VHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPM YLLYLQGGPGFECPRPTEASGWIQKAC+EFRVI
Subjt:  MAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDEFRVI

Query:  LMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGNGCMADS
        LMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPP GNGCMADS
Subjt:  LMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGNGCMADS

Query:  VYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEID
        VYRAGLEQV+IQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER WDPIIVPGAPKRISFFFLNEID
Subjt:  VYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEID

Query:  NWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKV
        NWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKA+K+GFPVYFTGEM FPWMFDEIH LKPFKDVANILANKEDWPPLYDIAALKNNKV
Subjt:  NWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        PVAAAVYYEDMFVNFKLT+ETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGK+PLF
Subjt:  PVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

TrEMBL top hitse value%identityAlignment
A0A1S3AUX5 proline iminopeptidase1.4e-24685.3Show/hide
Query:  PLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFEC
        PLPN LSAAHCR  V   A MAG   P   SP VHVAGTWYSVPELRLRDH+FSVPL+YSLDQ SS +ISVFARE VSVGKE+QPM YLLYLQGGPGFEC
Subjt:  PLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFEC

Query:  PRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGL
         RP+EASGWIQKAC+EFRVILMDQRGTGLSTPLTPSSMSQF++AEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCA+TYLSFAPQGL
Subjt:  PRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGL

Query:  KQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWD
        KQVLI+GGIPPIGNGC ADSVYRA  E+V+IQNEKYYKRYPQD+E V+EVVKYL +NGGGV LPSGGILTP+GLQTLGLSALG STGFERLHYLFER WD
Subjt:  KQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWD

Query:  PIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANIL
        PI+VPGAPKRISFFFLN IDNW+   SNPLY LL+ES+YCQGASSRWSA+RI+NEV +KFDANKAVK+G PVYFTGEM FPWMFDEIHAL+PFKD A+IL
Subjt:  PIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANIL

Query:  ANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        A+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKL METASQI G RLWITNE MHSGLRDAGPQV+D LMGLLNGK+PLF
Subjt:  ANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

A0A5D3D1Y5 Proline iminopeptidase1.4e-24685.3Show/hide
Query:  PLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFEC
        PLPN LSAAHCR  V   A MAG   P   SP VHVAGTWYSVPELRLRDH+FSVPL+YSLDQ SS +ISVFARE VSVGKE+QPM YLLYLQGGPGFEC
Subjt:  PLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFEC

Query:  PRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGL
         RP+EASGWIQKAC+EFRVILMDQRGTGLSTPLTPSSMSQF++AEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCA+TYLSFAPQGL
Subjt:  PRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGL

Query:  KQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWD
        KQVLI+GGIPPIGNGC ADSVYRA  E+V+IQNEKYYKRYPQD+E V+EVVKYL +NGGGV LPSGGILTP+GLQTLGLSALG STGFERLHYLFER WD
Subjt:  KQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWD

Query:  PIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANIL
        PI+VPGAPKRISFFFLN IDNW+   SNPLY LL+ES+YCQGASSRWSA+RI+NEV +KFDANKAVK+G PVYFTGEM FPWMFDEIHAL+PFKD A+IL
Subjt:  PIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANIL

Query:  ANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        A+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKL METASQI G RLWITNE MHSGLRDAGPQV+D LMGLLNGK+PLF
Subjt:  ANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

A0A6J1E946 uncharacterized protein LOC111431808 isoform X13.9e-28999.59Show/hide
Query:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
        S+ PLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
Subjt:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG

Query:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
        FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
Subjt:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP

Query:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
        QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
Subjt:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER

Query:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
        AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
Subjt:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA

Query:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
Subjt:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

A0A6J1F4P5 uncharacterized protein LOC1114406901.7e-24784.77Show/hide
Query:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG
        S+ PL  LLSA HCRS V SLAVMA TNP NG SP  H AGTWYSVPELRLRDHYFSVPL+YSLD  SSPKISV+ARE VSVGKEEQPM YLLYLQGGPG
Subjt:  SVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPG

Query:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP
        FECPRPTEASGWIQKAC+EFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGFCA+TYLSFAP
Subjt:  FECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAP

Query:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER
        QGLKQVLI+GGIPPIGNGC ADSVYRA  E+++IQNEKYYKRYPQDV+ V EVVKYLEENGGG+PLP GGILTP+GLQTLGLSALG STGFER+HYLFER
Subjt:  QGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFER

Query:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA
         WDPIIVPGAPKRIS+FFLN I  W+   SNPLY L++ES+YCQGASSRWSA+RI NE+ +KFDA KAVK+G PVYFTGEM FPWMFDEIHALKPFKD A
Subjt:  AWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVA

Query:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        NILA KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKL METASQI G RLW+TNE MHSGLRD GPQV+D LMGLLNGK+PLF
Subjt:  NILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X15.3e-24683.47Show/hide
Query:  SIPSTTPNVSSVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPML
        S PS+     S+ PL  LLSA HCRS V SLAVMAGT P N  SP  H AGTWYSVPELRLRDHYFSVPL+YSLD  SSPKISV+ARE VSVGKEEQPM 
Subjt:  SIPSTTPNVSSVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPML

Query:  YLLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGF
        YL+YLQGGPGFECPRPTEASGWIQKAC+EFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIR RLVPDAAPWTILGQSYGGF
Subjt:  YLLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGF

Query:  CALTYLSFAPQGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTG
        CA+TYLSFAP+GLKQVLI+GGIPPIGNGC ADSVYRA  E+++IQNEKYYKRYPQDV+ V EVVKYLEENGGGVPLP GGILTP+GLQTLGLSALG STG
Subjt:  CALTYLSFAPQGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTG

Query:  FERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEI
        FER+HYLFER WDPIIVPGAPKRIS+FFLN I  W+   SNPLY L++ES+YCQGASSRWSA+RIRNE+ +KFDA KAVK+G PVYFTGEM FPWMFDEI
Subjt:  FERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEI

Query:  HALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        HALKPFKD ANILA KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKL METASQI G RLW+TNE MHSGLRD GPQV+D LMG LNGK+PLF
Subjt:  HALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH4.4e-6434.78Show/hide
Query:  RLRDHYFSVPLSYSLDQVSSPKISVFAREAVSV-GKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDE-FRVILMDQRGTGLSTPLTPSSMSQFQTA
        R  +  F VPL++S  +     + +FAR    V G ++  + ++LYLQGGPG  C  P E + W+    ++ +RV+ +D+RGTG S+P+T  +++Q    
Subjt:  RLRDHYFSVPLSYSLDQVSSPKISVFAREAVSV-GKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDE-FRVILMDQRGTGLSTPLTPSSMSQFQTA

Query:  EDLANYLKHFRADNIVNDAEFIRIRLVPDA----APWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGN--GCMADSVYRAGLEQVLIQNEKYYK
        +  A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N  G +   ++         +NE YYK
Subjt:  EDLANYLKHFRADNIVNDAEFIRIRLVPDA----APWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGN--GCMADSVYRAGLEQVLIQNEKYYK

Query:  RYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESV
        +YP+DV  V+ ++KYL+EN   +   S G LTP   Q LG+  LG   G + +H + +R  + +      K ++   L+ I+N      N +Y+LL E +
Subjt:  RYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESV

Query:  YCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLT
        YCQG +  W A++ R +   +F  N+       ++FTGE  F  MF+    LK  K VA +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L 
Subjt:  YCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLT

Query:  METASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGL
          TAS++   +  + N   H  +     +V+  L  L
Subjt:  METASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGL

P46547 Proline iminopeptidase1.8e-9743.91Show/hide
Query:  YSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQ
        Y +  +    H+F+VPL +         I++F R      + +  + +LLYLQGGPGF  PRP+   GWI++A  EFRV+L+DQRGTG STP+    ++ 
Subjt:  YSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQ

Query:  FQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRY
            +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC+LTYLS  P  L +V ++GG+ PIG    AD VYRA  ++V  +N  ++ R+
Subjt:  FQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGNGCMADSVYRAGLEQVLIQNEKYYKRY

Query:  PQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYC
        P        +  +L+ +   V LP+G  LT   LQ  GL  LG S  FE L+YL E A+         ++++  FL ++    PF +NP++A+L+E +YC
Subjt:  PQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGSNPLYALLNESVYC

Query:  QGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTME
        +GA+S W+AER+R E    F A  A   G    FTGEM FPWMF++   L P K+ A++LA K DW PLYD   L  NKVPVA AVY EDM+V F  + E
Subjt:  QGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLTME

Query:  TASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGL
        T   +  +R WITNE  H+GLR  G Q++D+L+ L
Subjt:  TASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGL

Q9PD69 Proline iminopeptidase5.2e-0428.57Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFC
        ++ L GGPG  C              D++R++L DQRG G STP               AN   +   D +V D E +R+ L      W + G S+G   
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFC

Query:  ALTYLSFAPQGLKQVLISG
        AL Y    P+   ++++ G
Subjt:  ALTYLSFAPQGLKQVLISG

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase3.1e-0430Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRL-VPDAAPWTILGQSYGGF
        +++L GGPG      T  S       + +R++L DQRG G S   TP +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRL-VPDAAPWTILGQSYGGF

Query:  CALTYLSFAPQGLKQVLISG
         AL Y    P  +  +++ G
Subjt:  CALTYLSFAPQGLKQVLISG

AT2G14260.2 proline iminopeptidase3.1e-0430Show/hide
Query:  LLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRL-VPDAAPWTILGQSYGGF
        +++L GGPG      T  S       + +R++L DQRG G S   TP +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRL-VPDAAPWTILGQSYGGF

Query:  CALTYLSFAPQGLKQVLISG
         AL Y    P  +  +++ G
Subjt:  CALTYLSFAPQGLKQVLISG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein9.1e-20674.34Show/hide
Query:  GESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTG
        GES   HV G W+SVPELRLRDH F VPL YS    SSPKI+VFARE V+VGKEEQ M YLLYLQGGPGFE PRP+EASGWIQ+AC+EFRV+L+DQRGTG
Subjt:  GESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRPTEASGWIQKACDEFRVILMDQRGTG

Query:  LSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGNGCMADSVYRAGLEQ
        LSTPLT SSM QF++A++LA+YL HFRADNIV DAEFIR+RLVP A PWTILGQS+GGFCALTYLSFAP+GLKQVLI+GGIPPIG  C AD VY AG EQ
Subjt:  LSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIGNGCMADSVYRAGLEQ

Query:  VLIQNEKYYKRYPQDVESVQEVVKYL-EENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGS
        V  QNEKYYKR+PQD+E V+E+V YL E  GGGVPLPSGGILTP+GLQTLGLS LG STGFERLHY+ ER WDPI+V GAPK IS FFLN  ++W  F +
Subjt:  VLIQNEKYYKRYPQDVESVQEVVKYL-EENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWIPFGS

Query:  NPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVY
        NPLYALL+E++YC+GASS WSA R+R++   KFDA KAVK+  PV FTGEM FPWMFDEIHALKPFK  A++LA KEDWPPLYD+  L+NNKVPVAAAVY
Subjt:  NPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVY

Query:  YEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF
        YEDM+VNFKL  ETAS I G RLW+TNE MHSGLRDAG Q++D L+G++NGK+PLF
Subjt:  YEDMFVNFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTGGGTTTTGTGGGAATTTTGGAGTTTGTTGACTAATGAAAAGAAATGGAATGGAATCAAGGCATGGATGACGTGTCACTCGCTGCTCCGCCGTGCCGCTCCTCC
CTCCTCGACTTGGCTCCCCGCCTTTCCCGACTTCCCTCAAATTCTCGTTAGTTCAGTCAATTCTAGCTACACAACGGCTACAAAGACCGAGTGCATTAAGTTTGCTGGTA
ATCCTGCATTACGCATCCAGTCTATCCTTAACCTTGTGCTACCAGCACGCGAGTCAACTTGTCCTATGACGGGCTCGCAGGTGCACCCTGGTAGTATACCATCTACGACT
CCTAACGTCTCCTCAGTAACTCCACTCCCGAACCTTCTCTCCGCTGCCCATTGCCGGAGCCCGGTCCCTTCGTTGGCAGTCATGGCCGGAACCAATCCCCCTAATGGAGA
ATCGCCGCATGTGCATGTGGCTGGCACATGGTACTCCGTGCCGGAGCTTCGTCTCCGGGACCATTACTTCTCCGTGCCTCTCAGCTACTCTCTAGATCAGGTTTCTTCTC
CCAAGATCTCCGTTTTTGCCCGGGAAGCTGTTTCAGTGGGGAAAGAGGAGCAACCAATGCTATACCTTTTATACTTACAAGGAGGACCCGGGTTCGAGTGCCCCCGACCA
ACCGAAGCCAGTGGATGGATACAAAAAGCATGTGATGAATTTCGTGTTATATTGATGGATCAGCGAGGAACAGGATTATCAACTCCCTTGACTCCATCGTCCATGTCACA
ATTTCAAACCGCAGAGGATTTAGCCAATTATTTGAAACATTTTCGAGCTGATAACATAGTTAATGATGCTGAATTTATTAGGATTCGTCTTGTTCCTGATGCTGCTCCTT
GGACCATTTTGGGTCAGAGCTATGGGGGTTTTTGTGCACTTACGTATTTGAGCTTTGCACCACAAGGATTGAAACAAGTCCTCATAAGTGGTGGGATCCCTCCAATTGGG
AATGGATGCATGGCAGATTCTGTATATAGAGCAGGCCTTGAACAGGTTCTAATCCAAAATGAAAAGTACTACAAGAGATACCCTCAGGATGTTGAATCAGTCCAGGAAGT
TGTGAAATACTTGGAAGAGAATGGCGGCGGGGTTCCTCTTCCCTCTGGTGGTATCTTGACACCCAGAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGGTTCAGCACTG
GTTTTGAGCGCTTGCACTATCTGTTTGAGAGAGCATGGGATCCTATAATTGTGCCTGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAACGAGATTGATAACTGGATA
CCATTCGGTTCGAATCCTCTTTATGCTCTCTTGAATGAATCTGTTTACTGCCAGGGCGCCTCATCTCGTTGGTCTGCTGAAAGAATAAGGAATGAAGTGGGTAGCAAATT
TGATGCAAATAAGGCTGTAAAGGATGGATTTCCTGTGTATTTCACTGGAGAGATGAGCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGTGG
CTAATATATTGGCTAATAAGGAGGATTGGCCTCCTCTATATGACATTGCTGCTCTTAAAAATAACAAGGTTCCAGTCGCAGCTGCAGTATATTACGAAGATATGTTTGTC
AACTTCAAGCTGACCATGGAGACAGCTTCCCAAATAGGAGGAACAAGGCTGTGGATTACTAATGAATTAATGCATTCTGGTCTGCGTGATGCGGGGCCCCAAGTTGTGGA
TCAGTTGATGGGATTGTTAAATGGAAAGAGGCCTTTATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTGGGTTTTGTGGGAATTTTGGAGTTTGTTGACTAATGAAAAGAAATGGAATGGAATCAAGGCATGGATGACGTGTCACTCGCTGCTCCGCCGTGCCGCTCCTCC
CTCCTCGACTTGGCTCCCCGCCTTTCCCGACTTCCCTCAAATTCTCGTTAGTTCAGTCAATTCTAGCTACACAACGGCTACAAAGACCGAGTGCATTAAGTTTGCTGGTA
ATCCTGCATTACGCATCCAGTCTATCCTTAACCTTGTGCTACCAGCACGCGAGTCAACTTGTCCTATGACGGGCTCGCAGGTGCACCCTGGTAGTATACCATCTACGACT
CCTAACGTCTCCTCAGTAACTCCACTCCCGAACCTTCTCTCCGCTGCCCATTGCCGGAGCCCGGTCCCTTCGTTGGCAGTCATGGCCGGAACCAATCCCCCTAATGGAGA
ATCGCCGCATGTGCATGTGGCTGGCACATGGTACTCCGTGCCGGAGCTTCGTCTCCGGGACCATTACTTCTCCGTGCCTCTCAGCTACTCTCTAGATCAGGTTTCTTCTC
CCAAGATCTCCGTTTTTGCCCGGGAAGCTGTTTCAGTGGGGAAAGAGGAGCAACCAATGCTATACCTTTTATACTTACAAGGAGGACCCGGGTTCGAGTGCCCCCGACCA
ACCGAAGCCAGTGGATGGATACAAAAAGCATGTGATGAATTTCGTGTTATATTGATGGATCAGCGAGGAACAGGATTATCAACTCCCTTGACTCCATCGTCCATGTCACA
ATTTCAAACCGCAGAGGATTTAGCCAATTATTTGAAACATTTTCGAGCTGATAACATAGTTAATGATGCTGAATTTATTAGGATTCGTCTTGTTCCTGATGCTGCTCCTT
GGACCATTTTGGGTCAGAGCTATGGGGGTTTTTGTGCACTTACGTATTTGAGCTTTGCACCACAAGGATTGAAACAAGTCCTCATAAGTGGTGGGATCCCTCCAATTGGG
AATGGATGCATGGCAGATTCTGTATATAGAGCAGGCCTTGAACAGGTTCTAATCCAAAATGAAAAGTACTACAAGAGATACCCTCAGGATGTTGAATCAGTCCAGGAAGT
TGTGAAATACTTGGAAGAGAATGGCGGCGGGGTTCCTCTTCCCTCTGGTGGTATCTTGACACCCAGAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGGTTCAGCACTG
GTTTTGAGCGCTTGCACTATCTGTTTGAGAGAGCATGGGATCCTATAATTGTGCCTGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAACGAGATTGATAACTGGATA
CCATTCGGTTCGAATCCTCTTTATGCTCTCTTGAATGAATCTGTTTACTGCCAGGGCGCCTCATCTCGTTGGTCTGCTGAAAGAATAAGGAATGAAGTGGGTAGCAAATT
TGATGCAAATAAGGCTGTAAAGGATGGATTTCCTGTGTATTTCACTGGAGAGATGAGCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGTGG
CTAATATATTGGCTAATAAGGAGGATTGGCCTCCTCTATATGACATTGCTGCTCTTAAAAATAACAAGGTTCCAGTCGCAGCTGCAGTATATTACGAAGATATGTTTGTC
AACTTCAAGCTGACCATGGAGACAGCTTCCCAAATAGGAGGAACAAGGCTGTGGATTACTAATGAATTAATGCATTCTGGTCTGCGTGATGCGGGGCCCCAAGTTGTGGA
TCAGTTGATGGGATTGTTAAATGGAAAGAGGCCTTTATTCTAA
Protein sequenceShow/hide protein sequence
MEWVLWEFWSLLTNEKKWNGIKAWMTCHSLLRRAAPPSSTWLPAFPDFPQILVSSVNSSYTTATKTECIKFAGNPALRIQSILNLVLPARESTCPMTGSQVHPGSIPSTT
PNVSSVTPLPNLLSAAHCRSPVPSLAVMAGTNPPNGESPHVHVAGTWYSVPELRLRDHYFSVPLSYSLDQVSSPKISVFAREAVSVGKEEQPMLYLLYLQGGPGFECPRP
TEASGWIQKACDEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRIRLVPDAAPWTILGQSYGGFCALTYLSFAPQGLKQVLISGGIPPIG
NGCMADSVYRAGLEQVLIQNEKYYKRYPQDVESVQEVVKYLEENGGGVPLPSGGILTPRGLQTLGLSALGFSTGFERLHYLFERAWDPIIVPGAPKRISFFFLNEIDNWI
PFGSNPLYALLNESVYCQGASSRWSAERIRNEVGSKFDANKAVKDGFPVYFTGEMSFPWMFDEIHALKPFKDVANILANKEDWPPLYDIAALKNNKVPVAAAVYYEDMFV
NFKLTMETASQIGGTRLWITNELMHSGLRDAGPQVVDQLMGLLNGKRPLF