; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021158 (gene) of Snake gourd v1 genome

Gene IDTan0021158
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationLG10:1895089..1900887
RNA-Seq ExpressionTan0021158
SyntenyTan0021158
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004177 - aminopeptidase activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]7.2e-27091.75Show/hide
Query:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM
        LLFH FP  ARSLIPL +LLSA HCRSSVRS AVMA TNP N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  SS KISV+AREVVSVGKEEQPM
Subjt:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL EN GGV LP GGILTPKGLQTLGLSALGSST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE
        GFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ+I++E+ENKFD   AVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]1.4e-27088.89Show/hide
Query:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS
        MFAART APP     LL FH  PCR   LIPL   LSAAHCR SVR SA MAG   P AASPP HV+GTWYSVPELRLRDH+FSVPLNYSL+Q S ++IS
Subjt:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS

Query:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL
        VFAREVVSVGKE+QPMPYL++LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQ+++DLANYLKHFRADNIVNDAEFIRTRL
Subjt:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAEN GGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC
        PKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAID+WLSLDSNPLYVLLHE+IYCQGASSRWSAQ+IK+E+ENKFDAN AVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
         VYFTGEMIFPWMFDEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]2.4e-27390.25Show/hide
Query:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS
        MFA RT APP     LL FH  P R   LIPLP  LSAAHCR SVR SA MAG   P A SPP HVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS++IS
Subjt:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS

Query:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL
        VFAREVVSVGKE+QPMPYL+YLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRL
Subjt:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+N GGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC
        PKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHESIYCQGASSRWSAQ+IK+E+ENKFDAN AVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PVYFTGEMIFPWMFDEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]7.9e-26991.15Show/hide
Query:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM
        LLFH FP  ARSLIPL +LLSA HCRSSVRS AVMA TNP N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  SS KISV+AREVVSVGKEEQPM
Subjt:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL EN GG+ LP GGILTPKGLQTLGLSALGSST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE
        GFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ+I +E+ENKFDA  AVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]1.8e-26891.15Show/hide
Query:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM
        LLFH FP  ARSLIPL +LLSA HCRSSVRS  VMA TNP N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  SS KISV+AREVVSVGKE+ PM
Subjt:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAP+GLKQVLITGGIPPI NGC+ADSVYRACFEK+IIQNEKYYKRYPQDVEIV EVVKYL EN GGV LP GGILTPKGLQTLGLSALGSST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE
        GFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ+I +E+ENKFDA  AVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein7.0e-27188.89Show/hide
Query:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS
        MFAART APP     LL FH  PCR   LIPL   LSAAHCR SVR SA MAG   P AASPP HV+GTWYSVPELRLRDH+FSVPLNYSL+Q S ++IS
Subjt:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS

Query:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL
        VFAREVVSVGKE+QPMPYL++LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQ+++DLANYLKHFRADNIVNDAEFIRTRL
Subjt:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAEN GGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC
        PKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAID+WLSLDSNPLYVLLHE+IYCQGASSRWSAQ+IK+E+ENKFDAN AVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
         VYFTGEMIFPWMFDEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase1.2e-27390.25Show/hide
Query:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS
        MFA RT APP     LL FH  P R   LIPLP  LSAAHCR SVR SA MAG   P A SPP HVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS++IS
Subjt:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS

Query:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL
        VFAREVVSVGKE+QPMPYL+YLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRL
Subjt:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+N GGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC
        PKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHESIYCQGASSRWSAQ+IK+E+ENKFDAN AVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PVYFTGEMIFPWMFDEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase1.2e-27390.25Show/hide
Query:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS
        MFA RT APP     LL FH  P R   LIPLP  LSAAHCR SVR SA MAG   P A SPP HVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS++IS
Subjt:  MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKIS

Query:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL
        VFAREVVSVGKE+QPMPYL+YLQGGPGFEC RP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF++AEDLANYLKHFRADNIVNDAEFIRTRL
Subjt:  VFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+N GGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC
        PKGLQTLGLSALG+STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAID+WLSLDSNPLYVLLHESIYCQGASSRWSAQ+IK+E+ENKFDAN AVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PVYFTGEMIFPWMFDEIHALRPFKDAAHILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406903.8e-26991.15Show/hide
Query:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM
        LLFH FP  ARSLIPL +LLSA HCRSSVRS AVMA TNP N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  SS KISV+AREVVSVGKEEQPM
Subjt:  LLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPM

Query:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
        PYL+YLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG
Subjt:  PYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGG

Query:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST
        FCAVTYLSFAPQGLKQVLITGGIPPI NGC+ADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL EN GG+ LP GGILTPKGLQTLGLSALGSST
Subjt:  FCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSST

Query:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE
        GFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ+I +E+ENKFDA  AVKEGCPVYFTGEMIFPWMFDE
Subjt:  GFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDE

Query:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  IHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A6J1II94 uncharacterized protein LOC111473341 isoform X11.6e-26790.96Show/hide
Query:  LLFHPFPCR-ARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQP
        LLFH FP   ARSLIPL +LLSA HCRSSVRS AVMAGT P N ASPPEH AGTWYSVPELRLRDHYFSVPLNYSLD  SS KISV+AREVVSVGKEEQP
Subjt:  LLFHPFPCR-ARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQP

Query:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
        MPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG
Subjt:  MPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYG

Query:  GFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSS
        GFCA+TYLSFAP+GLKQVLITGGIPPI NGC+ADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL EN GGV LP GGILTPKGLQTLGLSALGSS
Subjt:  GFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGVLLPSGGILTPKGLQTLGLSALGSS

Query:  TGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFD
        TGFERMHYLFERVWDPIIVPGAPKRISYFFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQ+I++E+ENKFDA  AVKEGCPVYFTGEMIFPWMFD
Subjt:  TGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFD

Query:  EIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        EIHAL+PFKDAA+ILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMG LNGKKPLF
Subjt:  EIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH1.5e-6835.48Show/hide
Query:  RLRDHYFSVPLNYSLDQVSSSKISVFAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-FRVILMDQRGTGLSTPLTPSSMSQFQTA
        R  +  F VPLN+S  +     + +FAR +  V G ++  +P+++YLQGGPG  C  P E + W+    E+ +RV+ +D+RGTG S+P+T  +++Q    
Subjt:  RLRDHYFSVPLNYSLDQVSSSKISVFAREVVSV-GKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEE-FRVILMDQRGTGLSTPLTPSSMSQFQTA

Query:  EDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRY
        +  A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+Y
Subjt:  EDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIVREVVKYLAENGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQ
        P+DV  V+ ++KYL EN   L  S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I++   +  N +Y LL E +YCQ
Subjt:  PQDVEIVREVVKYLAENGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQ

Query:  GASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMET
        G +  W A K + + + +F  N   +    ++FTGE IF  MF+    L+  K  A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L   T
Subjt:  GASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMET

Query:  ASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        AS++  ++  + N + H  +     +V+  L  L
Subjt:  ASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase4.6e-10244.24Show/hide
Query:  YSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQ
        Y +  +    H+F+VPL++         I++F R +    + +  +P+L+YLQGGPGF  PRP+   GWI++A +EFRV+L+DQRGTG STP+    ++ 
Subjt:  YSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQ

Query:  FQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRY
            +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PI  G SAD VYRA +++V  +N  ++ R+
Subjt:  FQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIVREVVKYLAENGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQ
        P    I   +  +L +   V LP+G  LT + LQ  GL  LG+S  FE ++YL E  +         ++++  FL  + +    ++NP++ +LHE IYC+
Subjt:  PQDVEIVREVVKYLAENGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQ

Query:  GASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMET
        GA+S W+A++++ E    F A +A  +G    FTGEMIFPWMF++   L P K+AAH+LAEK DW PLYD   L  NKVPVA AVY EDM+V F  + ET
Subjt:  GASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMET

Query:  ASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
           ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  ASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P93732 Proline iminopeptidase9.8e-0430.83Show/hide
Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +V+L GGPG      T  S       E +R++L DQRG G S   TP +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase7.0e-0530.83Show/hide
Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +V+L GGPG      T  S       E +R++L DQRG G S   TP +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT2G14260.2 proline iminopeptidase7.0e-0530.83Show/hide
Query:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        +V+L GGPG      T  S       E +R++L DQRG G S   TP +  +  T  DL            VND E +R  L +P+   W + G S+G  
Subjt:  LVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein2.1e-21176.65Show/hide
Query:  SPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLS
        S  EHV G W+SVPELRLRDH F VPL+YS    SS KI+VFARE+V+VGKEEQ MPYL+YLQGGPGFE PRP+EASGWIQ+ACEEFRV+L+DQRGTGLS
Subjt:  SPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLS

Query:  TPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVI
        TPLT SSM QF++A++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPI   C+AD VY A FE+V 
Subjt:  TPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVI

Query:  IQNEKYYKRYPQDVEIVREVVKYLAEN--GGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNP
         QNEKYYKR+PQD+EIVRE+V YLAE+  GGV LPSGGILTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA +SW S D+NP
Subjt:  IQNEKYYKRYPQDVEIVREVVKYLAEN--GGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIDSWLSLDSNP

Query:  LYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYE
        LY LLHE+IYC+GASS WSA +++D+ E KFDA  AVKE  PV FTGEMIFPWMFDEIHAL+PFK AA +LA+KEDWPPLYD+  L+NNKVPVAAAVYYE
Subjt:  LYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYDIAALKNNKVPVAAAVYYE

Query:  DMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        DM+VNFKL  ETAS I+GIRLW+TNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  DMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGCTCGCACAGTAGCGCCGCCATTTTTTATAAAGCCACTCCTTCTCTTCCACCCTTTCCCCTGTCGCGCCCGTTCGTTAATTCCACTCCCTAAACTTCTTTC
CGCCGCCCATTGCCGGAGCTCGGTCCGTTCATCGGCAGTCATGGCCGGAACCAATCCTCCTAATGCAGCATCGCCGCCGGAGCACGTAGCTGGTACGTGGTACTCCGTGC
CGGAGCTCCGGCTCCGGGACCATTACTTCTCTGTGCCTCTCAATTACTCTCTAGATCAGGTTTCTTCTTCTAAGATCTCCGTTTTCGCGCGGGAAGTTGTTTCAGTGGGG
AAAGAAGAGCAACCAATGCCATACCTTGTATACTTACAAGGTGGACCTGGGTTTGAGTGTCCCCGACCAACTGAAGCCAGTGGATGGATACAAAAAGCATGCGAAGAATT
TCGTGTTATATTGATGGATCAGCGAGGAACAGGATTATCGACTCCTTTAACTCCATCATCCATGTCGCAATTTCAAACTGCTGAGGACTTGGCCAACTACTTGAAACACT
TTCGAGCTGATAACATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGTTTTTGTGCAGTT
ACGTATTTAAGTTTTGCACCACAAGGATTGAAGCAAGTCCTCATAACTGGAGGAATCCCTCCAATTGAGAATGGATGCTCTGCAGATTCTGTATATAGAGCATGCTTTGA
AAAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCTCAGGATGTTGAAATCGTCCGTGAAGTTGTGAAATACTTGGCAGAGAATGGAGGCGTTCTTCTTCCCT
CTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGCCTTTCTGCTTTAGGATCTAGTACAGGTTTTGAGCGCATGCACTATCTGTTTGAGAGAGTATGGGATCCT
ATAATAGTTCCCGGAGCACCGAAACGAATCAGTTATTTCTTCCTCAATGCTATTGATAGCTGGCTTTCTCTTGATTCAAATCCTCTTTATGTTCTCTTGCACGAATCGAT
ATATTGCCAGGGTGCCTCGTCTCGTTGGTCTGCTCAAAAAATAAAGGATGAAATGGAGAACAAATTCGATGCAAATATGGCTGTAAAAGAAGGATGTCCTGTGTATTTCA
CTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAGACCGTTCAAAGACGCTGCCCATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGAC
ATTGCTGCTCTTAAAAATAACAAGGTTCCGGTCGCAGCTGCAGTTTATTACGAAGATATGTTCGTAAACTTCAAGCTGGCCATGGAAACAGCTTCCCAAATAGCAGGAAT
AAGGCTGTGGATTACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCACAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
GAACATTACTTTTTTACTTTTTTCCCCATAATTCGCCCCCCACCATAGTTAAGCTATTCCACTCCACGTCAAAAAATGTTCGCAGCTCGCACAGTAGCGCCGCCATTTTT
TATAAAGCCACTCCTTCTCTTCCACCCTTTCCCCTGTCGCGCCCGTTCGTTAATTCCACTCCCTAAACTTCTTTCCGCCGCCCATTGCCGGAGCTCGGTCCGTTCATCGG
CAGTCATGGCCGGAACCAATCCTCCTAATGCAGCATCGCCGCCGGAGCACGTAGCTGGTACGTGGTACTCCGTGCCGGAGCTCCGGCTCCGGGACCATTACTTCTCTGTG
CCTCTCAATTACTCTCTAGATCAGGTTTCTTCTTCTAAGATCTCCGTTTTCGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTGTATACTT
ACAAGGTGGACCTGGGTTTGAGTGTCCCCGACCAACTGAAGCCAGTGGATGGATACAAAAAGCATGCGAAGAATTTCGTGTTATATTGATGGATCAGCGAGGAACAGGAT
TATCGACTCCTTTAACTCCATCATCCATGTCGCAATTTCAAACTGCTGAGGACTTGGCCAACTACTTGAAACACTTTCGAGCTGATAACATAGTTAATGATGCTGAATTT
ATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGTTTTTGTGCAGTTACGTATTTAAGTTTTGCACCACAAGGATTGAAGCA
AGTCCTCATAACTGGAGGAATCCCTCCAATTGAGAATGGATGCTCTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAATGAAAAGTACTACAAGA
GGTATCCTCAGGATGTTGAAATCGTCCGTGAAGTTGTGAAATACTTGGCAGAGAATGGAGGCGTTCTTCTTCCCTCTGGTGGTATCTTGACACCTAAAGGGCTGCAAACT
CTTGGCCTTTCTGCTTTAGGATCTAGTACAGGTTTTGAGCGCATGCACTATCTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCCGGAGCACCGAAACGAATCAGTTA
TTTCTTCCTCAATGCTATTGATAGCTGGCTTTCTCTTGATTCAAATCCTCTTTATGTTCTCTTGCACGAATCGATATATTGCCAGGGTGCCTCGTCTCGTTGGTCTGCTC
AAAAAATAAAGGATGAAATGGAGAACAAATTCGATGCAAATATGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAG
ATTCATGCCTTGAGACCGTTCAAAGACGCTGCCCATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTTCCGGTCGC
AGCTGCAGTTTATTACGAAGATATGTTCGTAAACTTCAAGCTGGCCATGGAAACAGCTTCCCAAATAGCAGGAATAAGGCTGTGGATTACTAATGAATTTATGCATTCTG
GTCTGCGTGATGCAGGGCCACAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGAGGTTTTTCTGATCTTTGTTGCTTTTCTTCATTTTCT
CCTCAATAATATTGTTAGTTGTTGGTGTATGAGATCAGGAGATTGTAAATTGATTGATCCTTTCTCCTAGATAAGTTATTTGCCATTAATTATAACAATGAAATAAATAA
AAATAAACAAAATTTAGAAAAAAATAAAAATCTTTAGTTCAG
Protein sequenceShow/hide protein sequence
MFAARTVAPPFFIKPLLLFHPFPCRARSLIPLPKLLSAAHCRSSVRSSAVMAGTNPPNAASPPEHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSSKISVFAREVVSVG
KEEQPMPYLVYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQTAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAV
TYLSFAPQGLKQVLITGGIPPIENGCSADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGVLLPSGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDP
IIVPGAPKRISYFFLNAIDSWLSLDSNPLYVLLHESIYCQGASSRWSAQKIKDEMENKFDANMAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILAEKEDWPPLYD
IAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF