; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G194920 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G194920
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProline iminopeptidase
Genome locationCicolChr10:19540133..19547839
RNA-Seq ExpressionCcUC10G194920
SyntenyCcUC10G194920
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004177 - aminopeptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR005944 - Proline iminopeptidase
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044299.1 proline iminopeptidase [Cucumis melo var. makuwa]2.1e-22493.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD Q EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTA GNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDD 
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_008454439.1 PREDICTED: proline iminopeptidase [Cucumis melo]5.0e-22694Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD Q EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_011653440.1 proline iminopeptidase [Cucumis sativus]1.2e-22794.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSF SN H R      PVPR+SN CC+SG KG V TAQ GYKSDSQ EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

XP_023552784.1 proline iminopeptidase [Cucurbita pepo subsp. pepo]1.6e-22492.52Show/hide
Query:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD
        M LLGLCPNNS ++PL SFVSN H+RHC RLFPV RVSNH CVSG KGLVL AQFGYKSDSQ EFQ KDLMA EKEI GI ++PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTW+LIDDIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
        Q+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
Subjt:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDD

Query:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
         FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVAANEKLKNILQKNG
Subjt:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

XP_038905843.1 proline iminopeptidase isoform X1 [Benincasa hispida]5.8e-23596.74Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCV G KGL LTA FGYKSDSQ EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNPAGHPVVFLHGGPGGGT PGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAY KRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRG+DDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIIS+AGHSANEPGIAAELVAANEKLKNILQKNG
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

TrEMBL top hitse value%identityAlignment
A0A0A0KW74 Proline iminopeptidase5.7e-22894.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNSSSPLFSF SN H R      PVPR+SN CC+SG KG V TAQ GYKSDSQ EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A1S3BYQ6 Proline iminopeptidase2.4e-22694Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD Q EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5A7TSV1 Proline iminopeptidase1.0e-22493.5Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD Q EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTA GNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDD 
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A5D3E1M3 Proline iminopeptidase2.4e-22694Show/hide
Query:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL
        MSLLGLCPNNS SPLFSF SN HFR      PVPR+ NHCC+ G KG V TAQ GYKSD Q EFQPKDLMA EKEISGI R+PYPPIEPYSTGFLKVSDL
Subjt:  MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDL

Query:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
        HTIYWEQSGNP GHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ
Subjt:  HTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQ

Query:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
        SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN
Subjt:  SHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDN

Query:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
        FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP
Subjt:  FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP

A0A6J1E5K8 Proline iminopeptidase1.5e-22392.02Show/hide
Query:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD
        M LLGLCPNNS ++PL SFVSNLH+RHC RLFPV RVSNH CVSG KGLVL AQFGYKSDSQ EFQ KDLMA EKEI GI ++PYPPIEPYSTG LKVSD
Subjt:  MSLLGLCPNNS-SSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSD

Query:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS
        LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLE+NTTW+LI DIEKLREHL+IPEWQVFGGSWGSTLALAYS
Subjt:  LHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYS

Query:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
        Q+HPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE+FRDLIPESERGCFVDAY KRLNS DM+TQYAAARAWTKWEMMTAHLLPNEENIKRGEDD
Subjt:  QSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDD

Query:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG
         FSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKII DAGHSANEPG+AAELVA NEKLKNILQKNG
Subjt:  NFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNG

Query:  P
        P
Subjt:  P

SwissProt top hitse value%identityAlignment
O32449 Proline iminopeptidase1.7e-10454.84Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YPP+  Y +G+L   D H IYWE SGNP G P VF+HGGPGGG +P +R+ FDP+ Y+++LFDQRG G+S PHA L++NTTW+L+ DIE+LRE   +
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE
         +W VFGGSWGSTLALAY+Q+HPE+V+ +VLRGIF LRK+ + W+Y+ GA+  +P+ WE    ++ + ER   + AY +RL S D   Q  AA+ W+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T  LLP+ E+   GEDD F+LAFARIENHYF + GF  SD  LL N+  IRHI AVIV GRYD+ C + +AWDL K WPEAEL I+  AGHS +EPGI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
          +L+ A ++
Subjt:  AAELVAANEK

O83041 Probable proline iminopeptidase1.3e-11260.97Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R  YP I PY +G L VS LHTIY+EQSGNP G PVVFLHGGPGGGT P  R++FDP  +RIILFDQRGAGKSTPHA L +NTTW+L+ DIEKLR HL I
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE
          W VFGGSWGSTL+LAYSQ+HP++  GL+LRGIFLLR+KEI WFY+ GA+ I+PDAWE + + IP  ER   + AY +RL SKD + +  AA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L+ +     +  DD F+ AFARIE HYF+N+GFF +D  LL N D+I HI  VIVQGRYDV CPM SAW LHK  PE+EL ++ DAGHS  E GI
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANEK
         + L+ A ++
Subjt:  AAELVAANEK

P93732 Proline iminopeptidase1.2e-16674.59Show/hide
Query:  CLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGG
        C+R FP    + +    G + + ++   G KS+     +   +   E E    KR+ Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGG
Subjt:  CLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGG

Query:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY
        TAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFY
Subjt:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY

Query:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL
        EGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D++ QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS L
Subjt:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL

Query:  LDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL
        LDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV ANEK+K ++
Subjt:  LDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL

Q87DF8 Proline iminopeptidase2.1e-10256.96Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R+ YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPGGG     RRF DPD YRI+LFDQRGAG+S PHA L +NTTW+L+ DIEKLR  L I
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +   IP  ER   + A+ +RL S D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  +  LL +  +I +I  VIV GRYDV CP+ +AWDLHK WP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Q9PD69 Proline iminopeptidase3.2e-10357.28Show/hide
Query:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI
        R+ YP + P+  G L V D H +Y+EQ GNP G PVV LHGGPG G     RRF DPD YRI+LFDQRGAG+STPHA L +NTTW+L+ DIEKLR  L I
Subjt:  RSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEI

Query:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE
          WQVFGGSWGSTLALAY+Q+HPE+ T LVLRGIF+LR+ E++WFY+ GA+ ++PDAW+ +  +IP  ER   + A+ +RL S+D  T+ AAA+AW+ WE
Subjt:  PEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWE

Query:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI
          T+ L  +++ I   E+ +F+LAFARIENHYFVN GFF  ++ LL +  +I +I  VIV GRYDV CP+ +AWDLHKVWP+A LKI   AGHSA EP  
Subjt:  MMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGI

Query:  AAELVAANE
           LV A +
Subjt:  AAELVAANE

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase8.6e-16874.59Show/hide
Query:  CLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGG
        C+R FP    + +    G + + ++   G KS+     +   +   E E    KR+ Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGG
Subjt:  CLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGG

Query:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY
        TAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFY
Subjt:  TAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFY

Query:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL
        EGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D++ QYAAARAWTKWEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS L
Subjt:  EGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFL

Query:  LDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL
        LDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEPGI+AELV ANEK+K ++
Subjt:  LDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNIL

AT2G14260.2 proline iminopeptidase5.6e-16783.91Show/hide
Query:  KRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLE
        KR+ Y PIEPYS+G LKVSD+HT+YWEQSG P GHPVVFLHGGPGGGTAP NRRFFDP+FYRI+LFDQRGAGKSTPHACLE+NTTW+L++DIEKLREHL+
Subjt:  KRSPYPPIEPYSTGFLKVSDLHTIYWEQSGNPAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLE

Query:  IPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMDTQYAAARAWTK
        IPEW VFGGSWGSTLALAYSQSHP+KVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWE FRDLIPE+ERG   VDAY KRLNS D++ QYAAARAWTK
Subjt:  IPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWESFRDLIPESERG-CFVDAYSKRLNSKDMDTQYAAARAWTK

Query:  WEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEP
        WEMMTA+L PN EN+++ EDD FSLAFARIENHYFVNKGFFPSDS LLDN+DKIRHI   IVQGRYDVCCPMMSAWDLHK WPEAELKI+ DAGHSANEP
Subjt:  WEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNIDKIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEP

Query:  GIAAELVAANEKLKNIL
        GI+AELV ANEK+K ++
Subjt:  GIAAELVAANEKLKNIL

AT3G61540.1 alpha/beta-Hydrolases superfamily protein3.8e-0628.57Show/hide
Query:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---------------LEDNTTWNLIDDIEKLREHL--EIPEWQVFGGSWGSTL
        +++L GGPG  G  P     +     + +R++L DQRG G STP  C               L      N++ D E +R  L  +   W + G S+G   
Subjt:  VVFLHGGPG-GGTAPGNRRFFDP---DFYRIILFDQRGAGKSTPHAC---------------LEDNTTWNLIDDIEKLREHL--EIPEWQVFGGSWGSTL

Query:  ALAYSQSHPEKVTGLVLRG
        AL Y    PE +  +++ G
Subjt:  ALAYSQSHPEKVTGLVLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTTGTTAGGCCTATGCCCTAACAATTCCTCTTCTCCTCTATTCTCATTCGTCTCCAATTTGCATTTTCGTCACTGCCTTCGCCTCTTCCCCGTCCCTCGCGTTTC
CAACCATTGCTGTGTATCAGGGACAAAAGGTTTGGTCTTAACAGCGCAATTTGGTTATAAGAGCGATAGTCAGATTGAGTTCCAACCAAAGGATTTGATGGCTGAAGAAA
AGGAAATTTCAGGAATAAAAAGAAGTCCTTACCCACCTATTGAGCCATACAGTACTGGTTTCTTGAAAGTGTCGGATCTTCACACTATTTACTGGGAGCAATCTGGGAAT
CCCGCTGGTCATCCGGTGGTCTTTCTACATGGGGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGGATTATTTTGTTTGATCA
GCGAGGTGCAGGGAAAAGTACCCCACATGCTTGCTTGGAGGATAATACCACGTGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGC
AGGTCTTTGGAGGTTCCTGGGGAAGTACGTTGGCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAA
GAAATTGATTGGTTTTATGAAGGTGGTGCTGCTGCTATATATCCTGATGCTTGGGAATCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTA
TAGTAAGAGATTGAATTCAAAGGATATGGATACTCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACCGCTCATCTTTTGCCAAATGAGGAGAACATTA
AAAGAGGGGAAGATGATAATTTTTCATTGGCATTTGCAAGGATTGAAAACCATTACTTCGTAAATAAGGGGTTTTTCCCTTCAGATTCCTTTCTGCTAGATAATATTGAC
AAGATACGACATATCAATGCTGTAATTGTACAGGGAAGATATGACGTTTGCTGCCCAATGATGTCTGCTTGGGATCTTCATAAAGTGTGGCCCGAGGCTGAATTAAAGAT
CATTTCTGACGCAGGCCATTCTGCTAACGAGCCAGGAATAGCTGCTGAGCTCGTGGCTGCGAATGAGAAGCTGAAGAACATCCTCCAGAAGAATGGACCATAG
mRNA sequenceShow/hide mRNA sequence
ATAAATAATAGGAGTAATAATATTAATTGTCTTACTGTTAGACTGGGGAGCTCAGGATAATGGTCTTCACCATTGACGATGAGCTTGTTAGGCCTATGCCCTAACAATTC
CTCTTCTCCTCTATTCTCATTCGTCTCCAATTTGCATTTTCGTCACTGCCTTCGCCTCTTCCCCGTCCCTCGCGTTTCCAACCATTGCTGTGTATCAGGGACAAAAGGTT
TGGTCTTAACAGCGCAATTTGGTTATAAGAGCGATAGTCAGATTGAGTTCCAACCAAAGGATTTGATGGCTGAAGAAAAGGAAATTTCAGGAATAAAAAGAAGTCCTTAC
CCACCTATTGAGCCATACAGTACTGGTTTCTTGAAAGTGTCGGATCTTCACACTATTTACTGGGAGCAATCTGGGAATCCCGCTGGTCATCCGGTGGTCTTTCTACATGG
GGGACCAGGGGGAGGAACTGCTCCAGGCAATAGAAGATTCTTTGACCCAGATTTTTATAGGATTATTTTGTTTGATCAGCGAGGTGCAGGGAAAAGTACCCCACATGCTT
GCTTGGAGGATAATACCACGTGGAACCTCATTGATGACATTGAGAAGCTAAGGGAACACTTGGAAATTCCAGAGTGGCAGGTCTTTGGAGGTTCCTGGGGAAGTACGTTG
GCTCTTGCTTATAGTCAATCTCATCCTGAAAAGGTTACGGGATTAGTTCTTAGAGGGATCTTTCTTCTGCGGAAAAAAGAAATTGATTGGTTTTATGAAGGTGGTGCTGC
TGCTATATATCCTGATGCTTGGGAATCTTTTAGAGATCTCATTCCCGAAAGTGAGAGAGGATGTTTTGTTGATGCTTATAGTAAGAGATTGAATTCAAAGGATATGGATA
CTCAATATGCAGCTGCAAGAGCGTGGACCAAATGGGAAATGATGACCGCTCATCTTTTGCCAAATGAGGAGAACATTAAAAGAGGGGAAGATGATAATTTTTCATTGGCA
TTTGCAAGGATTGAAAACCATTACTTCGTAAATAAGGGGTTTTTCCCTTCAGATTCCTTTCTGCTAGATAATATTGACAAGATACGACATATCAATGCTGTAATTGTACA
GGGAAGATATGACGTTTGCTGCCCAATGATGTCTGCTTGGGATCTTCATAAAGTGTGGCCCGAGGCTGAATTAAAGATCATTTCTGACGCAGGCCATTCTGCTAACGAGC
CAGGAATAGCTGCTGAGCTCGTGGCTGCGAATGAGAAGCTGAAGAACATCCTCCAGAAGAATGGACCATAGTAACACTCCCAAGTATATTTCTCATAATTCTTACTCATG
GAATAAGTTTTCAATATCTGAAATAAATTGGGGATTCCAAAGTATTGTTTAGTGGCATGTGTTAATGCTCGAACATTGGCCATTTCTGAAATGCAAAAGGTTAGAGA
Protein sequenceShow/hide protein sequence
MSLLGLCPNNSSSPLFSFVSNLHFRHCLRLFPVPRVSNHCCVSGTKGLVLTAQFGYKSDSQIEFQPKDLMAEEKEISGIKRSPYPPIEPYSTGFLKVSDLHTIYWEQSGN
PAGHPVVFLHGGPGGGTAPGNRRFFDPDFYRIILFDQRGAGKSTPHACLEDNTTWNLIDDIEKLREHLEIPEWQVFGGSWGSTLALAYSQSHPEKVTGLVLRGIFLLRKK
EIDWFYEGGAAAIYPDAWESFRDLIPESERGCFVDAYSKRLNSKDMDTQYAAARAWTKWEMMTAHLLPNEENIKRGEDDNFSLAFARIENHYFVNKGFFPSDSFLLDNID
KIRHINAVIVQGRYDVCCPMMSAWDLHKVWPEAELKIISDAGHSANEPGIAAELVAANEKLKNILQKNGP