; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16116 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16116
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProline iminopeptidase
Genome locationctg2194:179024..184348
RNA-Seq ExpressionCucsat.G16116
SyntenyCucsat.G16116
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044299.1 proline iminopeptidase [Cucumis melo var. makuwa]5.33e-29296.71Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNS SPLFSF SN H RLPVPRL N CCL GAKG VFTAQLGYKSD QSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTA GNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_008454439.1 PREDICTED: proline iminopeptidase [Cucumis melo]3.91e-29497.22Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNS SPLFSF SN H RLPVPRL N CCL GAKG VFTAQLGYKSD QSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_011653440.1 proline iminopeptidase [Cucumis sativus]2.48e-30499.75Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_023552784.1 proline iminopeptidase [Cucurbita pepo subsp. pepo]2.53e-27490.02Show/hide
Query:  MSLLGLCPNNS-SSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSD
        M LLGLCPNNS ++PL SF SNSH R      PV R+SN  C+SG KG V  AQ GYKSDSQSEFQ KDLMAGEKEI GI + PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPNNS-SSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
        Q+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
Subjt:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD

Query:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
         FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVAANEKLKNILQKNG
Subjt:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]8.38e-28693.48Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSF SN H R      PVPR+SN CC+ G KG   TA  GYKSDSQSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGT PGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRG+DDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIIS+AGHSANEPGIAAELVAANEKLKNILQKNG
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase1.20e-30499.75Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A1S3BYQ6 Proline iminopeptidase1.89e-29497.22Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNS SPLFSF SN H RLPVPRL N CCL GAKG VFTAQLGYKSD QSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5A7TSV1 Proline iminopeptidase2.58e-29296.71Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNS SPLFSF SN H RLPVPRL N CCL GAKG VFTAQLGYKSD QSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTA GNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5D3E1M3 Proline iminopeptidase1.89e-29497.22Show/hide
Query:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW
        MSLLGLCPNNS SPLFSF SN H RLPVPRL N CCL GAKG VFTAQLGYKSD QSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSDLHTIYW
Subjt:  MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYW

Query:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
        EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK
Subjt:  EQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEK

Query:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
        VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF
Subjt:  VTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAF

Query:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        ARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  ARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1E5K8 Proline iminopeptidase3.64e-27189.03Show/hide
Query:  MSLLGLCPNNS-SSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSD
        M LLGLCPNNS ++PL SF SN H R      PV R+SN  C+SG KG V  AQ GYKSDSQSEFQ KDLMAGEKEI GI + PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPNNS-SSPLFSFFSNSHLR-----LPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LI DIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
        Q+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
Subjt:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD

Query:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
         FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIR INAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVA NEKLKNILQKNG
Subjt:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase6.4e-10454.52Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG +P +R+ FDP+ Y+++LFDQRG G+S PHA L++NTTW+L+ DIE+LRE   +
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
         +W VFGGSWGSTLALAY+Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S D + Q  AA+ W+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T  LLP+ E+   GEDD F+LAFARIENHYF + GF  SD  LL N+  IR I AVIV GRYD+ C + +AWDL K WPEAEL I+  AGHS +EPGI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
          +L+ A ++
Subjt:  AAELVAANEK

O83041 Probable proline iminopeptidase2.9e-11260.97Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP I PY +G L VS LHTIY+EQSGNP G PVVFLHGGPGGGT P  R++FDP  +RIILFDQRGAGKSTPHA L +NTTW+L+ DIEKLR HL I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          W VFGGSWGSTL+LAYSQ+HP++  GL+LRGIFLLR+KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL SKD E +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L+ +     +  DD F+ AFARIE HYF+N+GFF +D  LL N D+I  I  VIVQGRYDV CPM SAW LHK  PE+EL ++ DAGHS  E GI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
         + L+ A ++
Subjt:  AAELVAANEK

P93732 Proline iminopeptidase1.1e-16483.86Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW
        PEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKW
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW

Query:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG
        EMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS LLDN+DKIR I   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPG
Subjt:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG

Query:  IAAELVAANEKLKNIL
        I+AELV ANEK+K ++
Subjt:  IAAELVAANEKLKNIL

Q87DF8 Proline iminopeptidase2.7e-10256.96Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPGGG     RRF DPD YRI+LFDQRGAG+S PHA L +NTTW+L+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  +  LL +  +I  I  VIV GRYDV CP+ +AWDLHK WP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Q9PD69 Proline iminopeptidase4.2e-10357.28Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPG G     RRF DPD YRI+LFDQRGAG+STPHA L +NTTW+L+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S+D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  ++ LL +  +I  I  VIV GRYDV CP+ +AWDLHKVWP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase7.9e-16683.86Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW
        PEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKW
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW

Query:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG
        EMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS LLDN+DKIR I   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPG
Subjt:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG

Query:  IAAELVAANEKLKNIL
        I+AELV ANEK+K ++
Subjt:  IAAELVAANEKLKNIL

AT2G14260.2 proline iminopeptidase7.9e-16683.86Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW
        PEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKW
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW

Query:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG
        EMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS LLDN+DKIR I   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPG
Subjt:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG

Query:  IAAELVAANEKLKNIL
        I+AELV ANEK+K ++
Subjt:  IAAELVAANEKLKNIL

AT3G61540.1 alpha/beta-Hydrolases superfamily protein3.7e-0628.57Show/hide
Query:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---------------LEDNTTWNLIDDIEKLREHL--EIPEWQVFGGSWGSTL
        +++L GGPG  G  P     +     + +R++L DQRG G STP  C               L      N++ D E +R  L  +   W + G S+G   
Subjt:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---------------LEDNTTWNLIDDIEKLREHL--EIPEWQVFGGSWGSTL

Query:  ALAYSQSHPEKVTGLVLRG
        AL Y    PE +  +++ G
Subjt:  ALAYSQSHPEKVTGLVLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTGTTAGGCCTTTGCCCTAACAACTCCTCTTCTCCTCTTTTCTCCTTCTTCTCCAATTCTCATCTTCGTCTCCCCGTCCCTCGCCTTTCTAACCGTTGTTGCCT
TTCAGGGGCAAAAGGTTCGGTCTTTACAGCACAGTTGGGTTATAAAAGCGACAGTCAGAGTGAGTTCCAACCAAAGGATTTGATGGCTGGAGAAAAGGAAATTTCAGGAA
TATACAGAAACCCTTACCCACCTATTGAGCCATACAGTACTGGATTTTTGAAAGTGTCGGATCTTCATACTATTTATTGGGAGCAATCTGGGAATCCCACTGGTCATCCA
GTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCGAGGTGCAGGGAA
AAGTACACCACATGCTTGCTTGGAGGATAATACCACGTGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGCAGGTATTTGGAGGTT
CCTGGGGTAGTACGTTGGCTCTTGCTTATAGTCAATCTCATCCAGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAGGAAATTGATTGGTTT
TATGAAGGTGGTGCTGCCGCTATATATCCTGATGCCTGGGAGTCTTTTAGGGATCTAATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAAAGATTGAA
TTCGAAGGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTTTGCCAAATGAAGAGAACATTAAAAGAGGGGAAGATG
ATAATTTTTCATTGGCATTTGCAAGGATTGAAAACCATTACTTCGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATATTGACAAAATACGACGTATC
AATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGTCCAATGATGTCTGCTTGGGATCTTCATAAAGTGTGGCCCGAGGCGGAATTAAAGATCATTTCCGACGCAGG
CCATTCCGCTAATGAGCCTGGAATAGCTGCCGAGCTCGTGGCTGCGAATGAGAAGCTGAAGAACATCCTCCAGAAGAATGGACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTTGTTAGGCCTTTGCCCTAACAACTCCTCTTCTCCTCTTTTCTCCTTCTTCTCCAATTCTCATCTTCGTCTCCCCGTCCCTCGCCTTTCTAACCGTTGTTGCCT
TTCAGGGGCAAAAGGTTCGGTCTTTACAGCACAGTTGGGTTATAAAAGCGACAGTCAGAGTGAGTTCCAACCAAAGGATTTGATGGCTGGAGAAAAGGAAATTTCAGGAA
TATACAGAAACCCTTACCCACCTATTGAGCCATACAGTACTGGATTTTTGAAAGTGTCGGATCTTCATACTATTTATTGGGAGCAATCTGGGAATCCCACTGGTCATCCA
GTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCGAGGTGCAGGGAA
AAGTACACCACATGCTTGCTTGGAGGATAATACCACGTGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGCAGGTATTTGGAGGTT
CCTGGGGTAGTACGTTGGCTCTTGCTTATAGTCAATCTCATCCAGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAGGAAATTGATTGGTTT
TATGAAGGTGGTGCTGCCGCTATATATCCTGATGCCTGGGAGTCTTTTAGGGATCTAATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAAAGATTGAA
TTCGAAGGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTTTGCCAAATGAAGAGAACATTAAAAGAGGGGAAGATG
ATAATTTTTCATTGGCATTTGCAAGGATTGAAAACCATTACTTCGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATATTGACAAAATACGACGTATC
AATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGTCCAATGATGTCTGCTTGGGATCTTCATAAAGTGTGGCCCGAGGCGGAATTAAAGATCATTTCCGACGCAGG
CCATTCCGCTAATGAGCCTGGAATAGCTGCCGAGCTCGTGGCTGCGAATGAGAAGCTGAAGAACATCCTCCAGAAGAATGGACCATAG
Protein sequenceShow/hide protein sequence
MSLLGLCPNNSSSPLFSFFSNSHLRLPVPRLSNRCCLSGAKGSVFTAQLGYKSDSQSEFQPKDLMAGEKEISGIYRNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHP
VVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWF
YEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRRI
NAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP