; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012360 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012360
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProline iminopeptidase
Genome locationscaffold63:198179..204186
RNA-Seq ExpressionMS012360
SyntenyMS012360
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008454439.1 PREDICTED: proline iminopeptidase [Cucumis melo]5.8e-20292.72Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        G KG V +A  GYKSDRQSEFQ +DLMA EKE S +NRNPYPPIEPYS GFLKVSD+HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

XP_011653440.1 proline iminopeptidase [Cucumis sativus]1.9e-20092.16Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        G KG V +A  GYKSD QSEFQ +DLMA EKE S + RNPYPPIEPYS GFLKVSD+HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

XP_022149275.1 proline iminopeptidase [Momordica charantia]7.8e-21599.72Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        GGKGLVLSAHFGYKSDR SEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]2.4e-20393.52Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        GGKGL L+AHFGYKSD QSEFQ +DLMA EKE S +NRNPYPPIEPYS GFLKVSD+HTIYWEQSGNPAGHPVVFLHGGPGGGT PGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRG+DDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKN
        YDVCCPMMSAWDLHKVWPEAELKII NAGHSANEPGIAAELVAANEKLKNILQKN
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKN

XP_038905844.1 proline iminopeptidase isoform X2 [Benincasa hispida]1.2e-20293.5Show/hide
Query:  GKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIIL
        GKGL L+AHFGYKSD QSEFQ +DLMA EKE S +NRNPYPPIEPYS GFLKVSD+HTIYWEQSGNPAGHPVVFLHGGPGGGT PGNRRFFDPDFYRIIL
Subjt:  GKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIIL

Query:  FDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDL
        FDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRDL
Subjt:  FDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDL

Query:  IPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRY
        IPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRG+DDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGRY
Subjt:  IPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRY

Query:  DVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKN
        DVCCPMMSAWDLHKVWPEAELKII NAGHSANEPGIAAELVAANEKLKNILQKN
Subjt:  DVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKN

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase9.1e-20192.16Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        G KG V +A  GYKSD QSEFQ +DLMA EKE S + RNPYPPIEPYS GFLKVSD+HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

A0A1S3BYQ6 Proline iminopeptidase2.8e-20292.72Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        G KG V +A  GYKSDRQSEFQ +DLMA EKE S +NRNPYPPIEPYS GFLKVSD+HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

A0A5A7TSV1 Proline iminopeptidase1.2e-20092.16Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        G KG V +A  GYKSDRQSEFQ +DLMA EKE S +NRNPYPPIEPYS GFLKVSD+HTIYWEQSGNP GHPVVFLHGGPGGGTA GNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDD FSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

A0A5D3E1M3 Proline iminopeptidase2.8e-20292.72Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        G KG V +A  GYKSDRQSEFQ +DLMA EKE S +NRNPYPPIEPYS GFLKVSD+HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAY QSHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDNFSLAFARIENHYF+NKGFFPSDSFLLDN+DKIRHINA+IVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

A0A6J1D5A1 Proline iminopeptidase3.8e-21599.72Show/hide
Query:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
        GGKGLVLSAHFGYKSDR SEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII
Subjt:  GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRII

Query:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
        LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD
Subjt:  LFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRD

Query:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
        LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR
Subjt:  LIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGR

Query:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
        YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP
Subjt:  YDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNILQKNEP

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase7.6e-10454.52Show/hide
Query:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG +P +R+ FDP+ Y+++LFDQRG G+S PHA L++NTTW L+ DIE+LRE   +
Subjt:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
         +W VFGGSWGSTLALAY Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S D + Q  AA+ W+ WE
Subjt:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI
          T  L+P+ E+   GEDD F+LAFARIENHYF + GF  SD  LL NV  IRHI A+IV GRYD+ C + +AWDL K WPEAEL I++ AGHS +EPGI
Subjt:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI

Query:  AAELVAANEK
          +L+ A ++
Subjt:  AAELVAANEK

O83041 Probable proline iminopeptidase3.8e-11160Show/hide
Query:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YP I PY +G L VS +HTIY+EQSGNP G PVVFLHGGPGGGT P  R++FDP  +RIILFDQRGAGKSTPHA L +NTTWDL+ DIEKLR HL I
Subjt:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
          W VFGGSWGSTL+LAY Q+HP++  GL+LRGIFLLR+KE+ WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL S D E +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI
          T+ L+ +     +  DD F+ AFARIE HYFIN+GFF +D  LL N D+I HI  +IVQGRYDV CPM SAW LHK  PE+EL ++ +AGHS  E GI
Subjt:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI

Query:  AAELVAANEK
         + L+ A ++
Subjt:  AAELVAANEK

P93732 Proline iminopeptidase7.0e-16680.12Show/hide
Query:  QSEFQTEDLMAREKESSEVN-RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACL
        +SE    D M   +  + VN R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACL
Subjt:  QSEFQTEDLMAREKESSEVN-RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACL

Query:  EDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS
        E+NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAY QSHP+KVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY 
Subjt:  EDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS

Query:  KRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHK
        KRLNS+D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYF+NKGFFPSDS LLDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK
Subjt:  KRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHK

Query:  VWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNIL
         WPEAELKI+ +AGHSANEPGI+AELV ANEK+K ++
Subjt:  VWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNIL

Q87DF8 Proline iminopeptidase3.8e-10356.63Show/hide
Query:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YP + P+ +G L V D H +Y+EQ GNP G PVV LHGGPGGG     RRF DPD YRI+LFDQRGAG+S PHA L +NTTWDL+ DIEKLR  L I
Subjt:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S+D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYF+N GFF  +  LL +  +I +I  +IV GRYDV CP+ +AWDLHK WP+A LKI   AGHSA EP  
Subjt:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Q9PD69 Proline iminopeptidase1.3e-10356.96Show/hide
Query:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YP + P+ +G L V D H +Y+EQ GNP G PVV LHGGPG G     RRF DPD YRI+LFDQRGAG+STPHA L +NTTWDL+ DIEKLR  L I
Subjt:  RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYF+N GFF  ++ LL +  +I +I  +IV GRYDV CP+ +AWDLHKVWP+A LKI   AGHSA EP  
Subjt:  MMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase5.0e-16780.12Show/hide
Query:  QSEFQTEDLMAREKESSEVN-RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACL
        +SE    D M   +  + VN R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACL
Subjt:  QSEFQTEDLMAREKESSEVN-RNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACL

Query:  EDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS
        E+NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAY QSHP+KVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY 
Subjt:  EDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYS

Query:  KRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHK
        KRLNS+D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYF+NKGFFPSDS LLDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK
Subjt:  KRLNSNDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHK

Query:  VWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNIL
         WPEAELKI+ +AGHSANEPGI+AELV ANEK+K ++
Subjt:  VWPEAELKIIQNAGHSANEPGIAAELVAANEKLKNIL

AT2G14260.2 proline iminopeptidase2.5e-16682.1Show/hide
Query:  EKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIE
        E E+    R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTWDL++DIE
Subjt:  EKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIE

Query:  KLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYA
        KLREHL+IPEW VFGGSWGSTLALAY QSHP+KVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS+D+E QYA
Subjt:  KLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYA

Query:  AARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNA
        AARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYF+NKGFFPSDS LLDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ +A
Subjt:  AARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNA

Query:  GHSANEPGIAAELVAANEKLKNIL
        GHSANEPGI+AELV ANEK+K ++
Subjt:  GHSANEPGIAAELVAANEKLKNIL

AT3G61540.1 alpha/beta-Hydrolases superfamily protein1.3e-0529.41Show/hide
Query:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---LEDNTTWDLID------------DIEKLREHL--EIPEWQVFGGSWGSTL
        +++L GGPG  G  P     +     + +R++L DQRG G STP  C   L+  +  +L D            D E +R  L  +   W + G S+G   
Subjt:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---LEDNTTWDLID------------DIEKLREHL--EIPEWQVFGGSWGSTL

Query:  ALAYGQSHPEKVTGLVLRG
        AL Y    PE +  +++ G
Subjt:  ALAYGQSHPEKVTGLVLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGGGGAAAGGGCTTGGTCTTGAGTGCACATTTTGGTTATAAGAGCGATAGACAGAGTGAGTTCCAAACGGAGGATTTGATGGCTCGAGAAAAGGAATCTTCAGAAGTAAA
CAGAAACCCTTACCCACCAATAGAGCCTTACAGTAATGGTTTTTTGAAGGTGTCAGATATTCATACTATTTACTGGGAGCAATCGGGGAATCCCGCTGGCCATCCAGTGG
TGTTTCTACACGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGACCAGCGAGGTGCAGGGAAAAGT
ACCCCGCATGCTTGCTTGGAGGATAATACCACGTGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTG
GGGTAGTACGCTGGCTCTTGCTTATGGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGTAAAAAAGAAGTTGATTGGTTCTATG
AAGGTGGTGCTGCTGCTATATATCCTGACGCTTGGGAGTCTTTTAGAGACCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAGAGACTAAATTCA
AATGACATGGAAACCCAGTATGCTGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTATGCCAAACGAGGAGAATATTAAGAGAGGGGAGGATGATAA
TTTTTCATTGGCGTTTGCAAGGATTGAAAACCATTACTTTATAAATAAGGGTTTTTTCCCTTCTGATTCCTTTCTGTTAGATAATGTTGACAAGATACGACATATCAATG
CTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCAATGATGTCGGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGATCATTCAAAACGCAGGCCAT
TCTGCCAATGAACCTGGAATAGCTGCAGAGCTCGTGGCTGCGAACGAGAAGCTGAAGAACATCCTCCAGAAGAACGAACCA
mRNA sequenceShow/hide mRNA sequence
GGGGGAAAGGGCTTGGTCTTGAGTGCACATTTTGGTTATAAGAGCGATAGACAGAGTGAGTTCCAAACGGAGGATTTGATGGCTCGAGAAAAGGAATCTTCAGAAGTAAA
CAGAAACCCTTACCCACCAATAGAGCCTTACAGTAATGGTTTTTTGAAGGTGTCAGATATTCATACTATTTACTGGGAGCAATCGGGGAATCCCGCTGGCCATCCAGTGG
TGTTTCTACACGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGACCAGCGAGGTGCAGGGAAAAGT
ACCCCGCATGCTTGCTTGGAGGATAATACCACGTGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTG
GGGTAGTACGCTGGCTCTTGCTTATGGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGTAAAAAAGAAGTTGATTGGTTCTATG
AAGGTGGTGCTGCTGCTATATATCCTGACGCTTGGGAGTCTTTTAGAGACCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAGAGACTAAATTCA
AATGACATGGAAACCCAGTATGCTGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTATGCCAAACGAGGAGAATATTAAGAGAGGGGAGGATGATAA
TTTTTCATTGGCGTTTGCAAGGATTGAAAACCATTACTTTATAAATAAGGGTTTTTTCCCTTCTGATTCCTTTCTGTTAGATAATGTTGACAAGATACGACATATCAATG
CTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCAATGATGTCGGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGATCATTCAAAACGCAGGCCAT
TCTGCCAATGAACCTGGAATAGCTGCAGAGCTCGTGGCTGCGAACGAGAAGCTGAAGAACATCCTCCAGAAGAACGAACCA
Protein sequenceShow/hide protein sequence
GGKGLVLSAHFGYKSDRQSEFQTEDLMAREKESSEVNRNPYPPIEPYSNGFLKVSDIHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKS
TPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYGQSHPEKVTGLVLRGIFLLRKKEVDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNS
NDMETQYAAARAWTKWEMMTAHLMPNEENIKRGEDDNFSLAFARIENHYFINKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIQNAGH
SANEPGIAAELVAANEKLKNILQKNEP