; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027967 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027967
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProline iminopeptidase
Genome locationchr07:160888..167720
RNA-Seq ExpressionPI0027967
SyntenyPI0027967
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044299.1 proline iminopeptidase [Cucumis melo var. makuwa]4.8e-19184.85Show/hide
Query:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
        MSLLGLCP N P  LFS S+  F F  P      HCCL GAKG V TAQLGYKSD QSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
Subjt:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY

Query:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
        WEQSGNPTGHPVVFLHGGPGGGTA GNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
Subjt:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE

Query:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII
                       KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+     + 
Subjt:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII

Query:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_008454439.1 PREDICTED: proline iminopeptidase [Cucumis melo]5.7e-19285.1Show/hide
Query:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
        MSLLGLCP N P  LFS S+  F F  P      HCCL GAKG V TAQLGYKSD QSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
Subjt:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY

Query:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
        WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
Subjt:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE

Query:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII
                       KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+     + 
Subjt:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII

Query:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_011653440.1 proline iminopeptidase [Cucumis sativus]4.0e-19384.29Show/hide
Query:  MSLLGLCPNIPLLLFSPSSPIFIFLS------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSD
        MSLLGLCPN      + SSP+F F S      P    S  CCLSGAKGSV TAQLGYKSDSQSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSD
Subjt:  MSLLGLCPNIPLLLFSPSSPIFIFLS------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK--
        QSHPE               KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+  
Subjt:  QSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK--

Query:  --MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
           + F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
Subjt:  --MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

XP_023552784.1 proline iminopeptidase [Cucurbita pepo subsp. pepo]2.7e-18179.8Show/hide
Query:  MSLLGLCPN----IPLLLFSPSSPI--FIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSD
        M LLGLCPN     PLL F  +S       L P    S H C+SG KG VL AQ GYKSDSQSEFQ KDLMAGEKEI GIN+ PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPN----IPLLLFSPSSPI--FIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTW+LIDDIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK--
        Q+HPE               KEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLP    +K G+  
Subjt:  QSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK--

Query:  --MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
           + F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVAANEKLKNILQKNG
Subjt:  --MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]8.5e-18881.23Show/hide
Query:  MSLLGLCPNIPLLLFSPSSPIFIFLS-----------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGF
        MSLLGLCPN      + SSP+F F+S           P    S HCC+ G KG  LTA  GYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGF
Subjt:  MSLLGLCPNIPLLLFSPSSPIFIFLS-----------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGF

Query:  LKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTL
        LKVSDLHTIYWEQSGNP GHPVVFLHGGPGGGT PGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTL
Subjt:  LKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTL

Query:  ALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLK
        ALAYSQSHPE               KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K
Subjt:  ALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLK

Query:  EGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNI
         G      + F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIIS+AGHSANEPGIAAELVAANEKLKNI
Subjt:  EGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNI

Query:  LQKNG
        LQKNG
Subjt:  LQKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase1.9e-19384.29Show/hide
Query:  MSLLGLCPNIPLLLFSPSSPIFIFLS------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSD
        MSLLGLCPN      + SSP+F F S      P    S  CCLSGAKGSV TAQLGYKSDSQSEFQPKDLMAGEKEISGI RNPYPPIEPYSTGFLKVSD
Subjt:  MSLLGLCPNIPLLLFSPSSPIFIFLS------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK--
        QSHPE               KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+  
Subjt:  QSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK--

Query:  --MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
           + F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
Subjt:  --MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

A0A1S3BYQ6 Proline iminopeptidase2.8e-19285.1Show/hide
Query:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
        MSLLGLCP N P  LFS S+  F F  P      HCCL GAKG V TAQLGYKSD QSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
Subjt:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY

Query:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
        WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
Subjt:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE

Query:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII
                       KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+     + 
Subjt:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII

Query:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5A7TSV1 Proline iminopeptidase2.3e-19184.85Show/hide
Query:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
        MSLLGLCP N P  LFS S+  F F  P      HCCL GAKG V TAQLGYKSD QSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
Subjt:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY

Query:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
        WEQSGNPTGHPVVFLHGGPGGGTA GNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
Subjt:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE

Query:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII
                       KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+     + 
Subjt:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII

Query:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5D3E1M3 Proline iminopeptidase2.8e-19285.1Show/hide
Query:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
        MSLLGLCP N P  LFS S+  F F  P      HCCL GAKG V TAQLGYKSD QSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY
Subjt:  MSLLGLCP-NIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIY

Query:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
        WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE
Subjt:  WEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE

Query:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII
                       KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLP    +K G+     + 
Subjt:  ---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MII

Query:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1E5K8 Proline iminopeptidase1.2e-17977.59Show/hide
Query:  MSLLGLCPNIPLLLFSPSSPIFIFLS-----------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGF
        M LLGLCPN      S ++P+  F+S           P    S H C+SG KG VL AQ GYKSDSQSEFQ KDLMAGEKEI GIN+ PYPPIEPYSTG 
Subjt:  MSLLGLCPNIPLLLFSPSSPIFIFLS-----------PSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGF

Query:  LKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTL
        LKVSDLHTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDP+FYRIILFDQ    +STPHACLE+NTTW+LI DIEKLREHL+IPEWQVFGGSWGSTL
Subjt:  LKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTL

Query:  ALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLK
        ALAYSQ+HPE               KEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLP    +K
Subjt:  ALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLK

Query:  EGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNI
         G+     + F  IENHYF+NKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVA NEKLKNI
Subjt:  EGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNI

Query:  LQKNGP
        LQKNGP
Subjt:  LQKNGP

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase2.8e-8548.22Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG +P +R+ FDP  Y+++LFDQ     S PHA L++NTTW+L+ DIE+LRE   +
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEK---------------EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
         +W VFGGSWGSTLALAY+Q+HPE+                + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S D + Q  AA+ W+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEK---------------EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPMRRTLKEGK---MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIA
          T  LLP R +   G+    + F  IENHYF + GF  SD  LL N+  IRHI AVIV GRYD+ C + +AWDL K WPEAEL I+  AGHS +EPGI 
Subjt:  MMTAHLLPMRRTLKEGK---MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIA

Query:  AELVAANEK
         +L+ A ++
Subjt:  AELVAANEK

O83041 Probable proline iminopeptidase8.2e-9354.84Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP I PY +G L VS LHTIY+EQSGNP G PVVFLHGGPGGGT P  R++FDP+ +RIILFDQ    +STPHA L +NTTW+L+ DIEKLR HL I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          W VFGGSWGSTL+LAYSQ+HP+               KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL SKD E +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLL---PMRRTLKEGKMI-IFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L+    ++    + +    F  IE HYFIN+GFF +D  LL N D+I HI  VIVQGRYDV CPM SAW LHK  PE+EL ++ DAGHS  E GI
Subjt:  MMTAHLL---PMRRTLKEGKMI-IFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
         + L+ A ++
Subjt:  AAELVAANEK

P93732 Proline iminopeptidase3.4e-13970.62Show/hide
Query:  QSEFQPKDLMAGEKEISGIN-RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACL
        +SE    D M   +  + +N R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP FYRI+LFDQ    +STPHACL
Subjt:  QSEFQPKDLMAGEKEISGIN-RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACL

Query:  EDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS
        E+NTTW+L++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+               KEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY 
Subjt:  EDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS

Query:  KRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHK
        KRLNS D+E QYAAARAWTKWEMMTA+L P    +++ +     + F  IENHYF+NKGFFPSDS LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK
Subjt:  KRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHK

Query:  VWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL
         WPEAELKI+ DAGHSANEPGI+AELV ANEK+K ++
Subjt:  VWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL

Q87DF8 Proline iminopeptidase3.2e-8149.51Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPGGG     RRF DP+ YRI+LFDQ     S PHA L +NTTW+L+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEK---------------EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+               E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEK---------------EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L   +  +   +     + F  IENHYF+N GFF  +  LL +  +I +I  VIV GRYDV CP+ +AWDLHK WP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Q9PD69 Proline iminopeptidase5.0e-8249.84Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPG G     RRF DP+ YRI+LFDQ     STPHA L +NTTW+L+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEK---------------EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+               E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S+D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEK---------------EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L   +  +   +     + F  IENHYF+N GFF  ++ LL +  +I +I  VIV GRYDV CP+ +AWDLHKVWP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase2.4e-14070.62Show/hide
Query:  QSEFQPKDLMAGEKEISGIN-RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACL
        +SE    D M   +  + +N R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP FYRI+LFDQ    +STPHACL
Subjt:  QSEFQPKDLMAGEKEISGIN-RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACL

Query:  EDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS
        E+NTTW+L++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+               KEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY 
Subjt:  EDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS

Query:  KRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHK
        KRLNS D+E QYAAARAWTKWEMMTA+L P    +++ +     + F  IENHYF+NKGFFPSDS LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK
Subjt:  KRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHK

Query:  VWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL
         WPEAELKI+ DAGHSANEPGI+AELV ANEK+K ++
Subjt:  VWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL

AT2G14260.2 proline iminopeptidase1.2e-13973.73Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI
        R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP FYRI+LFDQ    +STPHACLE+NTTW+L++DIEKLREHL+I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQ---VESTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW
        PEW VFGGSWGSTLALAYSQSHP+               KEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKW
Subjt:  PEWQVFGGSWGSTLALAYSQSHPE---------------KEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKW

Query:  EMMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG
        EMMTA+L P    +++ +     + F  IENHYF+NKGFFPSDS LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPG
Subjt:  EMMTAHLLPMRRTLKEGK----MIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPG

Query:  IAAELVAANEKLKNIL
        I+AELV ANEK+K ++
Subjt:  IAAELVAANEKLKNIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTGTTAGGCCTTTGCCCTAACATTCCTCTTCTCCTCTTTTCTCCTTCTTCTCCAATTTTCATTTTCCTCTCCCCGTCCCTCGCGTTTTCCACCCATTGCTGCCT
TTCAGGGGCAAAAGGTTCCGTCTTAACAGCACAATTGGGTTATAAGAGCGACAGTCAGAGTGAGTTCCAACCAAAGGATTTGATGGCTGGAGAAAAGGAAATTTCAGGAA
TAAACAGAAACCCTTACCCACCTATTGAGCCATACAGTACTGGTTTTTTGAAAGTGTCGGATCTTCATACTATTTATTGGGAGCAATCTGGGAATCCCACTGGTCATCCA
GTGGTCTTTCTACATGGGGGACCAGGGGGAGGTACTGCTCCAGGCAATAGAAGATTCTTTGACCCAAATTTTTATAGAATTATTTTGTTTGATCAGGTGGAAAGTACACC
ACATGCTTGCTTGGAGGATAATACCACATGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTGGGGTA
GTACGTTGGCTCTTGCTTATAGTCAATCTCATCCGGAAAAGGAAATTGATTGGTTTTATGAAGGTGGTGCTGCCGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGAT
CTCATTCCCGAAAGTGAGAGGGGATGTTTTGTTGATGCTTATAGTAAAAGATTGAATTCAAAGGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGA
AATGATGACTGCTCATCTTTTGCCAATGAGGAGAACATTAAAAGAGGGGAAGATGATAATTTTTCGTTGGATTGAAAACCATTACTTCATAAATAAGGGGTTTTTCCCTT
CTGATTCCTTTCTGCTAGATAATATTGACAAGATACGACATATCAATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGTCCAATGATGTCTGCTTGGGATCTTCAT
AAAGTGTGGCCCGAGGCGGAATTAAAGATCATTTCCGACGCAGGACACTCTGCTAATGAGCCTGGAATAGCTGCTGAGCTCGTGGCTGCAAACGAGAAGCTGAAGAACAT
CCTCCAGAAGAATGGACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTTGTTAGGCCTTTGCCCTAACATTCCTCTTCTCCTCTTTTCTCCTTCTTCTCCAATTTTCATTTTCCTCTCCCCGTCCCTCGCGTTTTCCACCCATTGCTGCCT
TTCAGGGGCAAAAGGTTCCGTCTTAACAGCACAATTGGGTTATAAGAGCGACAGTCAGAGTGAGTTCCAACCAAAGGATTTGATGGCTGGAGAAAAGGAAATTTCAGGAA
TAAACAGAAACCCTTACCCACCTATTGAGCCATACAGTACTGGTTTTTTGAAAGTGTCGGATCTTCATACTATTTATTGGGAGCAATCTGGGAATCCCACTGGTCATCCA
GTGGTCTTTCTACATGGGGGACCAGGGGGAGGTACTGCTCCAGGCAATAGAAGATTCTTTGACCCAAATTTTTATAGAATTATTTTGTTTGATCAGGTGGAAAGTACACC
ACATGCTTGCTTGGAGGATAATACCACATGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTGGGGTA
GTACGTTGGCTCTTGCTTATAGTCAATCTCATCCGGAAAAGGAAATTGATTGGTTTTATGAAGGTGGTGCTGCCGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGAT
CTCATTCCCGAAAGTGAGAGGGGATGTTTTGTTGATGCTTATAGTAAAAGATTGAATTCAAAGGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGA
AATGATGACTGCTCATCTTTTGCCAATGAGGAGAACATTAAAAGAGGGGAAGATGATAATTTTTCGTTGGATTGAAAACCATTACTTCATAAATAAGGGGTTTTTCCCTT
CTGATTCCTTTCTGCTAGATAATATTGACAAGATACGACATATCAATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGTCCAATGATGTCTGCTTGGGATCTTCAT
AAAGTGTGGCCCGAGGCGGAATTAAAGATCATTTCCGACGCAGGACACTCTGCTAATGAGCCTGGAATAGCTGCTGAGCTCGTGGCTGCAAACGAGAAGCTGAAGAACAT
CCTCCAGAAGAATGGACCATAG
Protein sequenceShow/hide protein sequence
MSLLGLCPNIPLLLFSPSSPIFIFLSPSLAFSTHCCLSGAKGSVLTAQLGYKSDSQSEFQPKDLMAGEKEISGINRNPYPPIEPYSTGFLKVSDLHTIYWEQSGNPTGHP
VVFLHGGPGGGTAPGNRRFFDPNFYRIILFDQVESTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKEIDWFYEGGAAAIYPDAWESFRD
LIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPMRRTLKEGKMIIFRWIENHYFINKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLH
KVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP