; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018734 (gene) of Snake gourd v1 genome

Gene IDTan0018734
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProline iminopeptidase
Genome locationLG01:5398930..5408194
RNA-Seq ExpressionTan0018734
SyntenyTan0018734
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576891.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]3.1e-22091.44Show/hide
Query:  GLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI
        GLCPNNS ++PL SFVSN H RHC RLFP SRVSN+FC SGGKGLVL AQFGYKSDSQ+ FQ KDLMA EK+   +N+ PYPPIEPYSTGLLKVSDLHTI
Subjt:  GLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI

Query:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+HP
Subjt:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        AFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPG+AAELVAANEKLKNILQKNGP
Subjt:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

KAG7014918.1 Proline iminopeptidase [Cucurbita argyrosperma subsp. argyrosperma]1.1e-22091.46Show/hide
Query:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT
        LGLCPNNS ++PL SFVSN H RHC RLFP SRVSN+FC SGGKGLVL AQFGYKSDSQ+ FQ KDLMA EK+   +N+ PYPPIEPYSTGLLKVSDLHT
Subjt:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT

Query:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        LAFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPG+AAELVAANEKLKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_022922976.1 proline iminopeptidase [Cucurbita moschata]3.1e-22091.21Show/hide
Query:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT
        LGLCPNNS ++PL SFVSN H RHC RLFP SRVSN+FC SGGKGLVL AQFGYKSDSQ+ FQ KDLMA EK+   +N+ PYPPIEPYSTGLLKVSDLHT
Subjt:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT

Query:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        LAFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPG+AAELVA NEKLKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_023552784.1 proline iminopeptidase [Cucurbita pepo subsp. pepo]6.7e-22392.21Show/hide
Query:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT
        LGLCPNNS ++PL SFVSNSH RHC RLFP SRVSN+FC SGGKGLVL AQFGYKSDSQ+ FQ KDLMA EK+   +N+ PYPPIEPYSTGLLKVSDLHT
Subjt:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT

Query:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        LAFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPG+AAELVAANEKLKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]1.1e-22292.42Show/hide
Query:  LGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI
        LGLCPNNSSSPLFSFVSN H RHC+RLFP  RVSN+ C  GGKGL LTA FGYKSDSQ+ FQPKDLMA EK+ S +NRNPYPPIEPYSTG LKVSDLHTI
Subjt:  LGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI

Query:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWEQSGNP GHPVVFLHGGPGGGT PGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
Subjt:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRG+DDNFSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNG
        AFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII +AGHSANEPGIAAELVAANEKLKNILQKNG
Subjt:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase9.1e-21891.44Show/hide
Query:  LGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI
        LGLCPNNSSSPLFSF SNSH R      P  R+SN  C SG KG V TAQ GYKSDSQ+ FQPKDLMA EK+ S + RNPYPPIEPYSTG LKVSDLHTI
Subjt:  LGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI

Query:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
Subjt:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        AFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5D3E1M3 Proline iminopeptidase2.1e-21490.43Show/hide
Query:  LGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI
        LGLCPNNS SPLFSF SN H R      P  R+ N+ C  G KG V TAQ GYKSD Q+ FQPKDLMA EK+ S +NRNPYPPIEPYSTG LKVSDLHTI
Subjt:  LGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTI

Query:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
        YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP
Subjt:  YWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHP

Query:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
        EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL
Subjt:  EKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSL

Query:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        AFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  AFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1D5A1 Proline iminopeptidase1.7e-21690Show/hide
Query:  MSLGLCPNNSSSPLFSFVSNSHS-RHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDL
        MSLG CPN SSS  FS VSNSHS RHCIRL   SRVSN+F  SGGKGLVL+A FGYKSD  + FQ +DLMA EK+ SE+NRNPYPPIEPYS G LKVSD+
Subjt:  MSLGLCPNNSSSPLFSFVSNSHS-RHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDL

Query:  HTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAY Q
Subjt:  HTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKE+DWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNS+DMETQYAAARAWTKWEMMTAHL+PNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYF+NKGFF SDSFLLDN+DKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEA LKII +AGHSANEPGIAAELVAANEKLKNILQKN P
Subjt:  FSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1E5K8 Proline iminopeptidase1.5e-22091.21Show/hide
Query:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT
        LGLCPNNS ++PL SFVSN H RHC RLFP SRVSN+FC SGGKGLVL AQFGYKSDSQ+ FQ KDLMA EK+   +N+ PYPPIEPYSTGLLKVSDLHT
Subjt:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT

Query:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        LAFARIENHYFVNKGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHKVWPEA LKII DAGHSANEPG+AAELVA NEKLKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1J3D7 Proline iminopeptidase2.8e-21990.7Show/hide
Query:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT
        LGLCPNNS ++PL SF+SNSH RHC RLFP SRV N FC SGGKGLVL AQFGYKSDSQ+ FQ KDLMA EK+    N+ PYPPIEPYSTGLLKVSDLHT
Subjt:  LGLCPNNS-SSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHT

Query:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH
        IYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTWDLI DIEKLREHL+IPEWQVFGGSWGSTLALAYSQ+H
Subjt:  IYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSH

Query:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS
        PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD FS
Subjt:  PEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFS

Query:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP
        LAFARIENHYFV+KGFF SDSFLLDNIDKIRHINA+IVQGRYDVCCPMMSAWDLHK WPEA LKII DAGHSANEPG+AAELVAANEKLKNILQKNGP
Subjt:  LAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase1.4e-10354.19Show/hide
Query:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YPP+  Y +G L   D H IYWE SGNP G P VF+HGGPGGG +P +R+ FDP+ Y+++LFDQRG G+S PHA L++NTTW L+ DIE+LRE   +
Subjt:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE
         +W VFGGSWGSTLALAY+Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S+D + Q  AA+ W+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI
          T  LLP+ E+   GEDD F+LAFARIENHYF + GF  SD  LL N+  IRHI A+IV GRYD+ C + +AWDL K WPEA L I+  AGHS +EPGI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI

Query:  AAELVAANEK
          +L+ A ++
Subjt:  AAELVAANEK

O83041 Probable proline iminopeptidase1.7e-11260.65Show/hide
Query:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YP I PY +G+L VS LHTIY+EQSGNP G PVVFLHGGPGGGT P  R++FDP  +RIILFDQRGAGKSTPHA L +NTTWDL+ DIEKLR HL I
Subjt:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE
          W VFGGSWGSTL+LAYSQ+HP++  GL+LRGIFLLR+KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL S D E +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI
          T+ L+ +     +  DD F+ AFARIE HYF+N+GFF +D  LL N D+I HI  +IVQGRYDV CPM SAW LHK  PE+ L ++ DAGHS  E GI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI

Query:  AAELVAANEK
         + L+ A ++
Subjt:  AAELVAANEK

P93732 Proline iminopeptidase5.4e-16774.59Show/hide
Query:  CIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGG
        C+R FP +  + N    G + + ++   G KS+   V +   +   E +     R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGG
Subjt:  CIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGG

Query:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY
        TAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFY
Subjt:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY

Query:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFL
        EGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFF SDS L
Subjt:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFL

Query:  LDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNIL
        LDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEA LKI++DAGHSANEPGI+AELV ANEK+K ++
Subjt:  LDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNIL

Q87DF8 Proline iminopeptidase5.5e-10356.96Show/hide
Query:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YP + P+  G+L V D H +Y+EQ GNP G PVV LHGGPGGG     RRF DPD YRI+LFDQRGAG+S PHA L +NTTWDL+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  +  LL +  +I +I  +IV GRYDV CP+ +AWDLHK WP+A+LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Q9PD69 Proline iminopeptidase1.4e-10357.28Show/hide
Query:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  YP + P+  G+L V D H +Y+EQ GNP G PVV LHGGPG G     RRF DPD YRI+LFDQRGAG+STPHA L +NTTWDL+ DIEKLR  L I
Subjt:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  ++ LL +  +I +I  +IV GRYDV CP+ +AWDLHKVWP+A+LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase3.8e-16874.59Show/hide
Query:  CIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGG
        C+R FP +  + N    G + + ++   G KS+   V +   +   E +     R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGG
Subjt:  CIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGG

Query:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY
        TAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTWDL++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFY
Subjt:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY

Query:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFL
        EGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFF SDS L
Subjt:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFL

Query:  LDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNIL
        LDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEA LKI++DAGHSANEPGI+AELV ANEK+K ++
Subjt:  LDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNIL

AT2G14260.2 proline iminopeptidase8.0e-16683.86Show/hide
Query:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI
        R  Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTWDL++DIEKLREHL+I
Subjt:  RNPYPPIEPYSTGLLKVSDLHTIYWEQSGNPTGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYCKRLNSSDMETQYAAARAWTKW
        PEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKW
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYCKRLNSSDMETQYAAARAWTKW

Query:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPG
        EMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFF SDS LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEA LKI++DAGHSANEPG
Subjt:  EMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDKIRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPG

Query:  IAAELVAANEKLKNIL
        I+AELV ANEK+K ++
Subjt:  IAAELVAANEKLKNIL

AT3G61540.1 alpha/beta-Hydrolases superfamily protein8.4e-0629.41Show/hide
Query:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---LEDNTTWDLID------------DIEKLREHL--EIPEWQVFGGSWGSTL
        +++L GGPG  G  P     +     + +R++L DQRG G STP  C   L+  +  +L D            D E +R  L  +   W + G S+G   
Subjt:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---LEDNTTWDLID------------DIEKLREHL--EIPEWQVFGGSWGSTL

Query:  ALAYSQSHPEKVTGLVLRG
        AL Y    PE +  +++ G
Subjt:  ALAYSQSHPEKVTGLVLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTAGGCCTCTGCCCTAACAATTCCTCTTCTCCTCTATTCTCCTTCGTCTCCAATTCGCATTCTCGTCACTGCATCCGCCTCTTCCCCGATTCTCGCGTTTCCAA
CAATTTCTGCGCATCAGGGGGAAAAGGTTTGGTCTTAACAGCGCAGTTTGGTTATAAAAGCGATAGTCAGACTGTGTTCCAACCAAAGGACTTGATGGCTGCAGAAAAGG
ATTTTTCAGAATTAAACAGAAACCCTTACCCACCTATTGAGCCATACAGTACTGGTCTGTTGAAAGTGTCAGATCTTCATACTATTTACTGGGAGCAATCTGGGAATCCT
ACCGGTCATCCAGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATTTTGTTTGATCAGCG
AGGTGCAGGGAAAAGTACCCCTCATGCTTGCTTGGAGGATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCGGAGTGGCAGG
TCTTTGGAGGTTCCTGGGGCAGTACGCTAGCTCTTGCTTACAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGAATCTTTCTTCTGCGGAAAAAAGAA
ATTGATTGGTTTTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGGTGTTTTGTTGATGCTTATTG
TAAGAGATTGAATTCAAGTGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTTTGCCAAATGAAGAGAACATTAAGA
GAGGCGAAGATGATAATTTTTCATTGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCTCTTCTGATTCCTTTCTCCTAGACAATATTGACAAG
ATACGACATATCAATGCTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCTATGATGTCTGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCTACATTAAAGATCAT
TCACGATGCAGGCCATTCGGCCAACGAGCCTGGAATAGCTGCAGAGCTCGTGGCTGCAAACGAGAAACTGAAGAACATCCTCCAGAAGAATGGACCATAA
mRNA sequenceShow/hide mRNA sequence
CGCCACCATTGACGATGAGCTTAGGCCTCTGCCCTAACAATTCCTCTTCTCCTCTATTCTCCTTCGTCTCCAATTCGCATTCTCGTCACTGCATCCGCCTCTTCCCCGAT
TCTCGCGTTTCCAACAATTTCTGCGCATCAGGGGGAAAAGGTTTGGTCTTAACAGCGCAGTTTGGTTATAAAAGCGATAGTCAGACTGTGTTCCAACCAAAGGACTTGAT
GGCTGCAGAAAAGGATTTTTCAGAATTAAACAGAAACCCTTACCCACCTATTGAGCCATACAGTACTGGTCTGTTGAAAGTGTCAGATCTTCATACTATTTACTGGGAGC
AATCTGGGAATCCTACCGGTCATCCAGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGAATTATT
TTGTTTGATCAGCGAGGTGCAGGGAAAAGTACCCCTCATGCTTGCTTGGAGGATAATACCACATGGGACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAAT
TCCGGAGTGGCAGGTCTTTGGAGGTTCCTGGGGCAGTACGCTAGCTCTTGCTTACAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGAATCTTTCTTC
TGCGGAAAAAAGAAATTGATTGGTTTTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGGTGTTTT
GTTGATGCTTATTGTAAGAGATTGAATTCAAGTGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTTTGCCAAATGA
AGAGAACATTAAGAGAGGCGAAGATGATAATTTTTCATTGGCATTTGCAAGGATCGAAAACCATTACTTTGTAAATAAGGGGTTTTTCTCTTCTGATTCCTTTCTCCTAG
ACAATATTGACAAGATACGACATATCAATGCTATAATTGTACAGGGAAGATATGATGTTTGCTGCCCTATGATGTCTGCTTGGGATCTTCATAAAGTGTGGCCAGAGGCT
ACATTAAAGATCATTCACGATGCAGGCCATTCGGCCAACGAGCCTGGAATAGCTGCAGAGCTCGTGGCTGCAAACGAGAAACTGAAGAACATCCTCCAGAAGAATGGACC
ATAATAACACACTCCACAAGCTTCACCCCAATGCTCCCCAAAGAGACATTGTATTGTATTTCCAATGCCAGCTCTGGGAGACATCAAAGTTTTTATTTATTATGCTTTCT
AGTTGGAGGTTCGAACCTAAGATCTTGGAAACTTTCCTTCAAATTTCTCGAGTCCTGCCCTTCGAAGACTTTGTAACACCCTAGGGAGGGACCTTTTTTTTTTTTTAGGA
AACACAAACCATTTGGTTTTGTTTTGTTTGACCTTTATGAATGAAAGTACCAAAATAAGGGCTTGAAATCAAGAAATAAACAAAAGGTGCAAAAGGAAAATTGTCTCTCT
TTATATACATAGGGAGATTTTGTCAAAGATAAGATCCATACTCCCTTTCAAGGAAGAGAGATCTTAATTGATAAAGTTGGTAGAGATCAATGAGACTATTGGTTAGACCA
AAGACTAAAAAAATTTTTTACTCCCGTTTTTAAACATTCATGAATTTCTTGCTCTCGCTCTCCTGACTTAATTTTATCTTCTTTTTTTTCCGATCTTTTCTTCTTCAATC
TTACTCTCGGTCTTGCTCTTCTCTAACTTAATATTACCTTTTTCTTTCTTTTCGATCTGTCTTCTTCAATCTCACTCTTAACTTAATGTTATGTTCTTTCTTTCTTTTTT
TTCAATTATTTCTTCTTCAATCTTACTCTTAATCCTTGCGATTATGAGCGAGAGTGAGCAAGACTGCCTCATATTGTCCTCCTGGATAGTTTTGTATGCATTGATACGCC
AGAATGGTTATATATCATTAATTACAGGAAGTGGATCAAATGACATTAAAGTGTGAAAAATCCCAATATTGAAGATATTTAGTAAGTCTTTTTTTTAGCTCAACAGCAGG
TGAAGGTGAAGGTGAGAACTTGAACTTTTAATCTTTTGATTGATAATATAATACTTTAATCAATTGAATTATGCTCAGGTTAGTAATGTGAATGTAGCCATACTAATCAG
GTTTTATTAGTTATCTTACTTGGGAATGCGTACTATGAAAAACTAGCGCCTTCCATGTAAGTTGGACTCGCACCGCACGTGCAACCCAACCATAGTTGTTGAGACATCTG
AACATCATATCATCAACAAAGCATATGACTGCATCAAACTTCATAAGATGAACTAGCATGTTGGGGCTAAGGACTATCCCACTAATGCCATGTACAACTTCAGAGGAAGT
TTAGAAATCTGATTTTGAGTCACTTTATCTAGTTTATGCATGATGTACTAAGACTAATAATATTAATATTGAAAGTGTTCGTACGAAACATGTTAATTGTATTGGATAAA
GCACAGGCAAAATCTGTTATCAAGCTAAATTACGATATTTTGTCTGAAGAATGCAACAAGAAAGTACTGTATGTAATGGTATTAGTATCAAGATCACGTTCTTGAATAAT
TGGTTCAGAAGTAAAGAGAAAAAAACTAAGAC
Protein sequenceShow/hide protein sequence
MSLGLCPNNSSSPLFSFVSNSHSRHCIRLFPDSRVSNNFCASGGKGLVLTAQFGYKSDSQTVFQPKDLMAAEKDFSELNRNPYPPIEPYSTGLLKVSDLHTIYWEQSGNP
TGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWDLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKE
IDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYCKRLNSSDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFSSDSFLLDNIDK
IRHINAIIVQGRYDVCCPMMSAWDLHKVWPEATLKIIHDAGHSANEPGIAAELVAANEKLKNILQKNGP