; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029920 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029920
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProline iminopeptidase
Genome locationtig00153554:1140899..1148787
RNA-Seq ExpressionSgr029920
SyntenySgr029920
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014918.1 Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]1.1e-21788.94Show/hide
Query:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT
        LG CP NS++ P  +F SN HYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QSEF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHT
Subjt:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT

Query:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPG+AAELVAANE LKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

XP_022149275.1 proline iminopeptidase [Momordica charantia]7.6e-21990.25Show/hide
Query:  MRLGFCPKNSYSPPFFAFSNSH-YRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDL
        M LGFCP  S S PF + SNSH  RHC+ L  VSR+SNH  +SGGKGLVLSAHFGYKSD  SEF++EDLMARE+E SE NRNPYPPIEPYS GFLKVSD+
Subjt:  MRLGFCPKNSYSPPFFAFSNSH-YRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDL

Query:  HTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAY Q
Subjt:  HTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        FSLAFARIENHYF+NKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANE LKNILQKN P
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

XP_022922976.1 proline iminopeptidase [Cucurbita moschata]3.2e-21788.69Show/hide
Query:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT
        LG CP NS++ P  +F SN HYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QSEF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHT
Subjt:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT

Query:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPG+AAELVA NE LKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

XP_023552784.1 proline iminopeptidase [Cucurbita pepo subsp. pepo]1.0e-21889.2Show/hide
Query:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT
        LG CP NS++ P  +F SNSHYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QSEF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHT
Subjt:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT

Query:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLIDDIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPG+AAELVAANE LKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]1.3e-21890.15Show/hide
Query:  LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTI
        LG CP NS SP F   SN H+RHCL LFPV R+SNH C+ GGKGL L+AHFGYKSD+QSEF+ +DLMA E+E S  NRNPYPPIEPYSTGFLKVSDLHTI
Subjt:  LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTI

Query:  YWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
Subjt:  YWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRG+DDNFSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNG
        AFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANE LKNILQKNG
Subjt:  AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase6.1e-21489.42Show/hide
Query:  LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTI
        LG CP NS SP F  FSNSH R      PV RLSN  C+SG KG V +A  GYKSD+QSEF+ +DLMA E+E S   RNPYPPIEPYSTGFLKVSDLHTI
Subjt:  LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTI

Query:  YWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWE+SGNPTGHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
Subjt:  YWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        AFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPGIAAELVAANE LKNILQKNGP
Subjt:  AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

A0A5D3E1M3 Proline iminopeptidase1.4e-21389.17Show/hide
Query:  LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTI
        LG CP NS S P F+FSN H+R      PV RL NH C+ G KG V +A  GYKSD QSEF+ +DLMA E+E S  NRNPYPPIEPYSTGFLKVSDLHTI
Subjt:  LGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTI

Query:  YWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWE+SGNPTGHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
Subjt:  YWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        AFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPGIAAELVAANE LKNILQKNGP
Subjt:  AFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

A0A6J1D5A1 Proline iminopeptidase3.7e-21990.25Show/hide
Query:  MRLGFCPKNSYSPPFFAFSNSH-YRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDL
        M LGFCP  S S PF + SNSH  RHC+ L  VSR+SNH  +SGGKGLVLSAHFGYKSD  SEF++EDLMARE+E SE NRNPYPPIEPYS GFLKVSD+
Subjt:  MRLGFCPKNSYSPPFFAFSNSH-YRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDL

Query:  HTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAY Q
Subjt:  HTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        FSLAFARIENHYF+NKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKII +AGHSANEPGIAAELVAANE LKNILQKN P
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

A0A6J1E5K8 Proline iminopeptidase1.6e-21788.69Show/hide
Query:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT
        LG CP NS++ P  +F SN HYRHC  LFPVSR+SNH C+SGGKGLVL+A FGYKSD+QSEF+ +DLMA E+E    N+ PYPPIEPYSTG LKVSDLHT
Subjt:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT

Query:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        LAFARIENHYFVNKGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPG+AAELVA NE LKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

A0A6J1J3D7 Proline iminopeptidase2.2e-21688.19Show/hide
Query:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT
        LG CP NS++ P  +F SNSHYRHC  LFPVSR+ N  C+SGGKGLVL+A FGYKSD+QS+F+ +DLMA E+E   TN+ PYPPIEPYSTG LKVSDLHT
Subjt:  LGFCPKNSYSPPFFAF-SNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHT

Query:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWE+SGNP GHPVVFLHGGPGGGT+PGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP
        LAFARIENHYFV+KGFFPSDSFLLDN+DKIRHINA+IVQGRYDVCCPMMSAWDLHK WPEAELKIIPDAGHSANEPG+AAELVAANE LKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase5.9e-10555.45Show/hide
Query:  ETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREH
        E  R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG SP +R+ FDP+ Y+++LFDQRG G+S PHA L+NNTTW L+ DIE+LRE 
Subjt:  ETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREH

Query:  LEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWT
          + +W VFGGSWGSTLALAY+Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S D + Q  AA+ W+
Subjt:  LEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWT

Query:  KWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANE
         WE  T  LLP+ E+   GEDD F+LAFARIENHYF + GF  SD  LL NV  IRHI A+IV GRYD+ C + +AWDL K WPEAEL I+  AGHS +E
Subjt:  KWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANE

Query:  PGIAAELVAANE
        PGI  +L+ A +
Subjt:  PGIAAELVAANE

O83041 Probable proline iminopeptidase5.9e-11361.17Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI
        R  YP I PY +G L VS LHTIY+E+SGNP G PVVFLHGGPGGGT P  R++FDP  +RIILFDQRGAGKSTPHA L  NTTWDL+ DIEKLR HL I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
          W VFGGSWGSTL+LAYSQ+HP++  GL+LRGIFLLR+KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL S D E +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI
          T+ L+ +     +  DD F+ AFARIE HYF+N+GFF +D  LL N D+I HI  +IVQGRYDV CPM SAW LHK  PE+EL ++PDAGHS  E GI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI

Query:  AAELVAANE
         + L+ A +
Subjt:  AAELVAANE

P93732 Proline iminopeptidase9.2e-16774.86Show/hide
Query:  CLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGG
        C+  FP +  + ++   G + + +S   G KS+     KS+ +   E E     R  Y PIEPYS+G LKVSD+HT+YWE+SG P GHPVVFLHGGPGGG
Subjt:  CLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGG

Query:  TSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY
        T+P NRRFFDP+FYRI+LFDQRGAGKSTPHACLE NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFY
Subjt:  TSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY

Query:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL
        EGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS+D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS L
Subjt:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL

Query:  LDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNIL
        LDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV ANE +K ++
Subjt:  LDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNIL

Q87DF8 Proline iminopeptidase1.0e-10457.1Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI
        R  YP + P+  G L V D H +Y+E+ GNP G PVV LHGGPGGG +   RRF DPD YRI+LFDQRGAG+S PHA L NNTTWDL+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S+D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  +  LL +  +I +I  +IV GRYDV CP+ +AWDLHK WP+A LKI P AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI

Query:  AAELVAANEN
           LV A ++
Subjt:  AAELVAANEN

Q9PD69 Proline iminopeptidase3.4e-10557.42Show/hide
Query:  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI
        R  YP + P+  G L V D H +Y+E+ GNP G PVV LHGGPG G +   RRF DPD YRI+LFDQRGAG+STPHA L NNTTWDL+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  ++ LL +  +I +I  +IV GRYDV CP+ +AWDLHKVWP+A LKI P AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGI

Query:  AAELVAANEN
           LV A ++
Subjt:  AAELVAANEN

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase6.6e-16874.86Show/hide
Query:  CLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGG
        C+  FP +  + ++   G + + +S   G KS+     KS+ +   E E     R  Y PIEPYS+G LKVSD+HT+YWE+SG P GHPVVFLHGGPGGG
Subjt:  CLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGG

Query:  TSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY
        T+P NRRFFDP+FYRI+LFDQRGAGKSTPHACLE NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFY
Subjt:  TSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY

Query:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL
        EGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS+D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS L
Subjt:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL

Query:  LDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNIL
        LDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV ANE +K ++
Subjt:  LDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNIL

AT2G14260.2 proline iminopeptidase8.0e-16682.41Show/hide
Query:  EQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIE
        E E     R  Y PIEPYS+G LKVSD+HT+YWE+SG P GHPVVFLHGGPGGGT+P NRRFFDP+FYRI+LFDQRGAGKSTPHACLE NTTWDL++DIE
Subjt:  EQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNPTGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIE

Query:  KLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYA
        KLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS+D+E QYA
Subjt:  KLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSNDMETQYA

Query:  AARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDA
        AARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS LLDNVDKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DA
Subjt:  AARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDA

Query:  GHSANEPGIAAELVAANENLKNIL
        GHSANEPGI+AELV ANE +K ++
Subjt:  GHSANEPGIAAELVAANENLKNIL

AT3G61540.1 alpha/beta-Hydrolases superfamily protein8.4e-0626.92Show/hide
Query:  DNQSEFKSEDLMAREQEFSETNRNPYPPIEP--YSTGFLKVS----DLHTIYWEESGNPTGHPVVFLHGGPG-GGTSPGNRRFFDP---DFYRIILFDQR
        D   E KSE +  +     E     +  I P  YS    K++    ++  +  EE   P    +++L GGPG  G  P     +     + +R++L DQR
Subjt:  DNQSEFKSEDLMAREQEFSETNRNPYPPIEP--YSTGFLKVS----DLHTIYWEESGNPTGHPVVFLHGGPG-GGTSPGNRRFFDP---DFYRIILFDQR

Query:  GAGKSTPHAC---LENNTTWDLID------------DIEKLREHL--EIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRG
        G G STP  C   L+  +  +L D            D E +R  L  +   W + G S+G   AL Y    PE +  +++ G
Subjt:  GAGKSTPHAC---LENNTTWDLID------------DIEKLREHL--EIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTAGGCTTTTGCCCTAAGAATTCATATTCTCCTCCGTTTTTCGCCTTCTCCAATTCGCACTATCGTCACTGCCTCTGTCTCTTCCCCGTCTCTCGTCTTTCCAA
CCATATCTGCATCTCAGGGGGAAAAGGTTTGGTCTTAAGTGCTCATTTTGGTTATAAGAGCGATAATCAGAGTGAGTTCAAATCAGAGGACTTGATGGCTCGAGAACAGG
AATTTTCAGAGACAAACAGAAACCCTTACCCACCTATAGAACCATACAGTACTGGTTTTTTGAAGGTGTCGGATCTTCATACTATTTATTGGGAGGAATCAGGGAATCCC
ACTGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTTCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCG
AGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGAATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGG
TCTTTGGAGGTTCCTGGGGTAGTACGCTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAAGAA
ATTGATTGGTTCTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAG
TAAGAGATTAAATTCAAATGACATGGAAACCCAATATGCAGCCGCAAGAGCATGGACCAAATGGGAAATGATGACTGCTCATCTATTGCCAAATGAGGAGAACATTAAGA
GAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATGTTGACAAG
ATACGACATATCAATGCTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCCATGATGTCAGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGATCAT
TCCAGACGCAGGCCATTCAGCTAATGAACCTGGAATAGCTGCAGAGCTTGTCGCCGCAAATGAGAATCTGAAGAACATTCTCCAGAAGAATGGACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTAGGCTTTTGCCCTAAGAATTCATATTCTCCTCCGTTTTTCGCCTTCTCCAATTCGCACTATCGTCACTGCCTCTGTCTCTTCCCCGTCTCTCGTCTTTCCAA
CCATATCTGCATCTCAGGGGGAAAAGGTTTGGTCTTAAGTGCTCATTTTGGTTATAAGAGCGATAATCAGAGTGAGTTCAAATCAGAGGACTTGATGGCTCGAGAACAGG
AATTTTCAGAGACAAACAGAAACCCTTACCCACCTATAGAACCATACAGTACTGGTTTTTTGAAGGTGTCGGATCTTCATACTATTTATTGGGAGGAATCAGGGAATCCC
ACTGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTTCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCG
AGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGAATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGAGAACACTTGGAAATTCCAGAGTGGCAGG
TCTTTGGAGGTTCCTGGGGTAGTACGCTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAAGAA
ATTGATTGGTTCTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAG
TAAGAGATTAAATTCAAATGACATGGAAACCCAATATGCAGCCGCAAGAGCATGGACCAAATGGGAAATGATGACTGCTCATCTATTGCCAAATGAGGAGAACATTAAGA
GAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCCCTTCTGATTCCTTTCTGCTAGATAATGTTGACAAG
ATACGACATATCAATGCTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCCATGATGTCAGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTGAGTTAAAGATCAT
TCCAGACGCAGGCCATTCAGCTAATGAACCTGGAATAGCTGCAGAGCTTGTCGCCGCAAATGAGAATCTGAAGAACATTCTCCAGAAGAATGGACCATAA
Protein sequenceShow/hide protein sequence
MRLGFCPKNSYSPPFFAFSNSHYRHCLCLFPVSRLSNHICISGGKGLVLSAHFGYKSDNQSEFKSEDLMAREQEFSETNRNPYPPIEPYSTGFLKVSDLHTIYWEESGNP
TGHPVVFLHGGPGGGTSPGNRRFFDPDFYRIILFDQRGAGKSTPHACLENNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKE
IDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSNDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNVDK
IRHINAIIVQGRYDVCCPMMSAWDLHKVWPEAELKIIPDAGHSANEPGIAAELVAANENLKNILQKNGP