; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G192760 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G192760
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionProline iminopeptidase
Genome locationCla97Chr10:20656892..20664380
RNA-Seq ExpressionCla97C10G192760
SyntenyCla97C10G192760
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044299.1 proline iminopeptidase [Cucumis melo var. makuwa]4.2e-22594Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD QSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTA GNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD 
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_008454439.1 PREDICTED: proline iminopeptidase [Cucumis melo]1.0e-22694.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD QSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_011653440.1 proline iminopeptidase [Cucumis sativus]2.4e-22895Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSF SN H R      PVPR+SN CC+SG KG V TAQ GYKSDSQSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_023552784.1 proline iminopeptidase [Cucurbita pepo subsp. pepo]3.2e-22593.02Show/hide
Query:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD
        M LLGLCPNNS ++PL SFVSN H+RHC RLFPV RVSNH CVSG KGLVL AQFGYKSDSQSEFQ KDLMA EKEI GI ++PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
        Q+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
Subjt:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD

Query:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
         FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVAANEKLKNILQKNG
Subjt:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]1.2e-23597.24Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCV G KGL LTA FGYKSDSQSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNPAGHPVVFLHGGPGGGT PGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRG+DDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIIS+AGHSANEPGIAAELVAANEKLKNILQKNG
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase1.2e-22895Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSF SN H R      PVPR+SN CC+SG KG V TAQ GYKSDSQSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A1S3BYQ6 Proline iminopeptidase4.8e-22794.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD QSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5A7TSV1 Proline iminopeptidase2.0e-22594Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD QSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTA GNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD 
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5D3E1M3 Proline iminopeptidase4.8e-22794.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD QSEFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1E5K8 Proline iminopeptidase2.9e-22492.52Show/hide
Query:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD
        M LLGLCPNNS ++PL SFVSNLH+RHC RLFPV RVSNH CVSG KGLVL AQFGYKSDSQSEFQ KDLMA EKEI GI ++PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LI DIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
        Q+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
Subjt:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDD

Query:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
         FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVA NEKLKNILQKNG
Subjt:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase1.7e-10454.84Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG +P +R+ FDP+ Y+++LFDQRG G+S PHA L++NTTW+L+ DIE+LRE   +
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
         +W VFGGSWGSTLALAY+Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S D + Q  AA+ W+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T  LLP+ E+   GEDD F+LAFARIENHYF + GF  SD  LL N+  IRHI AVIV GRYD+ C + +AWDL K WPEAEL I+  AGHS +EPGI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
          +L+ A ++
Subjt:  AAELVAANEK

O83041 Probable proline iminopeptidase7.7e-11361.29Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP I PY +G L VS LHTIY+EQSGNP G PVVFLHGGPGGGT P  R++FDP  +RIILFDQRGAGKSTPHA L +NTTW+L+ DIEKLR HL I
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          W VFGGSWGSTL+LAYSQ+HP++  GL+LRGIFLLR+KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL SKD E +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L+ +     +  DD F+ AFARIE HYF+N+GFF +D  LL N D+I HI  VIVQGRYDV CPM SAW LHK  PE+EL ++ DAGHS  E GI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
         + L+ A ++
Subjt:  AAELVAANEK

P93732 Proline iminopeptidase5.4e-16775.6Show/hide
Query:  CLRLFPVPRVSNHCCVSGTKGLVLTAQ--FGYKSDSQSEFQPKDLM-AEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGP
        C+R FP    SNH        L+   Q         +SE    D M   E E    KR+ Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGP
Subjt:  CLRLFPVPRVSNHCCVSGTKGLVLTAQ--FGYKSDSQSEFQPKDLM-AEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGP

Query:  GGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEID
        GGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEID
Subjt:  GGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEID

Query:  WFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSD
        WFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSD
Subjt:  WFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSD

Query:  SFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL
        S LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV ANEK+K ++
Subjt:  SFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL

Q87DF8 Proline iminopeptidase2.1e-10256.96Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R+ YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPGGG     RRF DPD YRI+LFDQRGAG+S PHA L +NTTW+L+ DIEKLR  L I
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  +  LL +  +I +I  VIV GRYDV CP+ +AWDLHK WP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Q9PD69 Proline iminopeptidase3.2e-10357.28Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R+ YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPG G     RRF DPD YRI+LFDQRGAG+STPHA L +NTTW+L+ DIEKLR  L I
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S+D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  ++ LL +  +I +I  VIV GRYDV CP+ +AWDLHKVWP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase3.8e-16875.6Show/hide
Query:  CLRLFPVPRVSNHCCVSGTKGLVLTAQ--FGYKSDSQSEFQPKDLM-AEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGP
        C+R FP    SNH        L+   Q         +SE    D M   E E    KR+ Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGP
Subjt:  CLRLFPVPRVSNHCCVSGTKGLVLTAQ--FGYKSDSQSEFQPKDLM-AEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGP

Query:  GGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEID
        GGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEID
Subjt:  GGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEID

Query:  WFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSD
        WFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSD
Subjt:  WFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSD

Query:  SFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL
        S LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV ANEK+K ++
Subjt:  SFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL

AT2G14260.2 proline iminopeptidase4.3e-16784.23Show/hide
Query:  KRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLE
        KR+ Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+
Subjt:  KRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLE

Query:  IPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTK
        IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D+E QYAAARAWTK
Subjt:  IPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMETQYAAARAWTK

Query:  WEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEP
        WEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEP
Subjt:  WEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEP

Query:  GIAAELVAANEKLKNIL
        GI+AELV ANEK+K ++
Subjt:  GIAAELVAANEKLKNIL

AT3G61540.1 alpha/beta-Hydrolases superfamily protein3.8e-0628.57Show/hide
Query:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---------------LEDNTTWNLIDDIEKLREHL--EIPEWQVFGGSWGSTL
        +++L GGPG  G  P     +     + +R++L DQRG G STP  C               L      N++ D E +R  L  +   W + G S+G   
Subjt:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---------------LEDNTTWNLIDDIEKLREHL--EIPEWQVFGGSWGSTL

Query:  ALAYSQSHPEKVTGLVLRG
        AL Y    PE +  +++ G
Subjt:  ALAYSQSHPEKVTGLVLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTGTTAGGCCTATGCCCTAACAATTCCTCTTCTCCTCTGTTCTCATTCGTCTCCAATTTGCATTTTCGTCACTGCCTTCGCCTCTTCCCCGTCCCTCGCGTTTC
CAACCATTGCTGTGTATCAGGGACAAAAGGTTTGGTCTTAACAGCGCAATTTGGTTATAAGAGCGATAGTCAGAGTGAGTTCCAACCAAAGGATTTGATGGCTGAAGAAA
AGGAAATTTCAGGAATAAAAAGAAGTCCTTACCCACCTATTGAGCCATACAGTACTGGTTTCTTGAAAGTGTCGGATCTTCACACTATTTACTGGGAGCAATCTGGGAAT
CCCGCCGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGGATTATTTTGTTTGATCA
GCGAGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGGATAATACCACGTGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGC
AGGTCTTTGGAGGTTCCTGGGGAAGTACGTTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAA
GAAATTGATTGGTTTTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTA
TAGTAAGAGATTGAATTCAAAGGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTTTGCCAAATGAGGAGAACATTA
AAAGAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATTGAAAACCATTACTTCGTAAATAAGGGGTTTTTCCCTTCAGATTCCTTTCTGCTAGATAATATTGAC
AAGATACGACATATCAATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGCCCAATGATGTCCGCTTGGGATCTTCATAAAGTGTGGCCCGAGGCTGAATTAAAGAT
CATTTCTGACGCAGGCCATTCTGCTAACGAGCCAGGAATAGCTGCTGAGCTCGTGGCTGCGAATGAGAAGCTGAAGAACATCCTCCAGAAGAATGGACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTTGTTAGGCCTATGCCCTAACAATTCCTCTTCTCCTCTGTTCTCATTCGTCTCCAATTTGCATTTTCGTCACTGCCTTCGCCTCTTCCCCGTCCCTCGCGTTTC
CAACCATTGCTGTGTATCAGGGACAAAAGGTTTGGTCTTAACAGCGCAATTTGGTTATAAGAGCGATAGTCAGAGTGAGTTCCAACCAAAGGATTTGATGGCTGAAGAAA
AGGAAATTTCAGGAATAAAAAGAAGTCCTTACCCACCTATTGAGCCATACAGTACTGGTTTCTTGAAAGTGTCGGATCTTCACACTATTTACTGGGAGCAATCTGGGAAT
CCCGCCGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGGATTATTTTGTTTGATCA
GCGAGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGGATAATACCACGTGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGC
AGGTCTTTGGAGGTTCCTGGGGAAGTACGTTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAA
GAAATTGATTGGTTTTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAGTCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTA
TAGTAAGAGATTGAATTCAAAGGATATGGAAACCCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACTGCTCATCTTTTGCCAAATGAGGAGAACATTA
AAAGAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATTGAAAACCATTACTTCGTAAATAAGGGGTTTTTCCCTTCAGATTCCTTTCTGCTAGATAATATTGAC
AAGATACGACATATCAATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGCCCAATGATGTCCGCTTGGGATCTTCATAAAGTGTGGCCCGAGGCTGAATTAAAGAT
CATTTCTGACGCAGGCCATTCTGCTAACGAGCCAGGAATAGCTGCTGAGCTCGTGGCTGCGAATGAGAAGCTGAAGAACATCCTCCAGAAGAATGGACCATAG
Protein sequenceShow/hide protein sequence
MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQSEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGN
PAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKK
EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMETQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNID
KIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP