; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013125 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013125
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationChr01:27102404..27106625
RNA-Seq ExpressionHG10013125
SyntenyHG10013125
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048938.1 proline iminopeptidase [Cucumis melo var. makuwa]9.9e-23787.9Show/hide
Query:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVI
        MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +ISVFAREVVSVGKE+QPMPYLLYLQGGPGFECARP+EASGWIQKAC+EFRVI
Subjt:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVI

Query:  LMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
        LMDQ    L T L+                              F  TRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
Subjt:  LMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS

Query:  VYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
        VYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
Subjt:  VYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID

Query:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
        NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
Subjt:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]3.7e-23682.59Show/hide
Query:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYL
        HS      SLIPL +L SA HC  SVR  AVMA TNP NGASPP H AGTWYSVPELRLRDHYF VPLNYSLD  SSPKISV+AREVVSVGKEEQPMPYL
Subjt:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYL

Query:  LYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCA
        +YLQGGPGFEC RPTEASGWIQKACEEFRVILMDQ    L T LS                              F  TRLVPDAAPWTILGQSYGGFCA
Subjt:  LYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCA

Query:  VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFE
        VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALGSSTGFE
Subjt:  VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFE

Query:  RLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHA
        R+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI+NE+ENKFD  +AVKEGCPVYFTGEMIFPWMFDEIHA
Subjt:  RLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHA

Query:  LRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        L+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]2.4e-25185.19Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFAARTAAPPL    LLH HSL CR L LIPL    SAAHC RSVRLSA MAG   P  ASPPVHV+GTWYSVPELRLRDH+F VPLNYSL+Q S  +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL
        VFAREVVSVGKE+QPMPYLL+LQGGPGFECARPTEASGWIQKACEEFRVILMDQ    L T L+                              F  TRL
Subjt:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
        PKGLQTLGLSALG+STGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
         VYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]4.0e-25486.35Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFA RTAAPPL    LLH HSL  R L LIPLP   SAAHC RSVRLSA MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL
        VFAREVVSVGKE+QPMPYLLYLQGGPGFECARP+EASGWIQKACEEFRVILMDQ    L T L+                              F  TRL
Subjt:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
        PKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]3.4e-23783Show/hide
Query:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYL
        HS      SLIPL +L SA HC  SVR  AVMA TNP NGASPP H AGTWYSVPELRLRDHYF VPLNYSLD  SSPKISV+AREVVSVGKEEQPMPYL
Subjt:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYL

Query:  LYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCA
        LYLQGGPGFEC RPTEASGWIQKACEEFRVILMDQ    L T LS                              F  TRLVPDAAPWTILGQSYGGFCA
Subjt:  LYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCA

Query:  VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFE
        VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALGSSTGFE
Subjt:  VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFE

Query:  RLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHA
        R+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDEIHA
Subjt:  RLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHA

Query:  LRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        L+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein1.2e-25185.19Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFAARTAAPPL    LLH HSL CR L LIPL    SAAHC RSVRLSA MAG   P  ASPPVHV+GTWYSVPELRLRDH+F VPLNYSL+Q S  +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL
        VFAREVVSVGKE+QPMPYLL+LQGGPGFECARPTEASGWIQKACEEFRVILMDQ    L T L+                              F  TRL
Subjt:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
        PKGLQTLGLSALG+STGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
         VYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase1.9e-25486.35Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFA RTAAPPL    LLH HSL  R L LIPLP   SAAHC RSVRLSA MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL
        VFAREVVSVGKE+QPMPYLLYLQGGPGFECARP+EASGWIQKACEEFRVILMDQ    L T L+                              F  TRL
Subjt:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
        PKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

A0A5A7U143 Proline iminopeptidase4.8e-23787.9Show/hide
Query:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVI
        MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +ISVFAREVVSVGKE+QPMPYLLYLQGGPGFECARP+EASGWIQKAC+EFRVI
Subjt:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVI

Query:  LMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
        LMDQ    L T L+                              F  TRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
Subjt:  LMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS

Query:  VYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
        VYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAID
Subjt:  VYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAID

Query:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
        NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
Subjt:  NWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  PVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase1.9e-25486.35Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFA RTAAPPL    LLH HSL  R L LIPLP   SAAHC RSVRLSA MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL
        VFAREVVSVGKE+QPMPYLLYLQGGPGFECARP+EASGWIQKACEEFRVILMDQ    L T L+                              F  TRL
Subjt:  VFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRL

Query:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT
        VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPSGGILT
Subjt:  VPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILT

Query:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
        PKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC
Subjt:  PKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGC

Query:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
        PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD
Subjt:  PVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLD

Query:  HLMGLLNGKKPLF
        HLMGLLNGKKPLF
Subjt:  HLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406901.6e-23783Show/hide
Query:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYL
        HS      SLIPL +L SA HC  SVR  AVMA TNP NGASPP H AGTWYSVPELRLRDHYF VPLNYSLD  SSPKISV+AREVVSVGKEEQPMPYL
Subjt:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYL

Query:  LYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCA
        LYLQGGPGFEC RPTEASGWIQKACEEFRVILMDQ    L T LS                              F  TRLVPDAAPWTILGQSYGGFCA
Subjt:  LYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-LLTALSEG----------------------------FETTRLVPDAAPWTILGQSYGGFCA

Query:  VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFE
        VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALGSSTGFE
Subjt:  VTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFE

Query:  RLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHA
        R+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDEIHA
Subjt:  RLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHA

Query:  LRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        L+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  LRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH3.5e-5131.26Show/hide
Query:  RLRDHYFYVPLNYSLDQVSSPKISVFAREVVSV-GKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEE-FRVILMDQAWL----LLTA---------
        R  +  F VPLN+S  +     + +FAR +  V G ++  +P++LYLQGGPG  C  P E + W+    E+ +RV+ +D+        +TA         
Subjt:  RLRDHYFYVPLNYSLDQVSSPKISVFAREVVSV-GKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEE-FRVILMDQAWL----LLTA---------

Query:  -----LSEGFETTRLV---------------PDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY
             L + F    +V                D + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+Y
Subjt:  -----LSEGFETTRLV---------------PDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYC
        P+DV  V+ ++KYL EN   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I+N   +  N +Y LL E +YC
Subjt:  PQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYC

Query:  QGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAME
        QG +  W A + + + + +F  N   +    ++FTGE IF  MF+    L+  K  A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L   
Subjt:  QGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAME

Query:  TASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        TAS++  ++  + N + H  +     +V+  L  L
Subjt:  TASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase1.4e-8439.95Show/hide
Query:  SPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL---
        S P+H     Y +  +    H+F VPL++         I++F R +    + +  +P+LLYLQGGPGF   RP+   GWI++A +EFRV+L+DQ      
Subjt:  SPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL---

Query:  ------LLTALS--------EGFETTRLVPDAA----------PWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQ
              LL  L+          F    +V DA           PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA +++V  +
Subjt:  ------LLTALS--------EGFETTRLVPDAA----------PWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQ

Query:  NEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYV
        N  ++ R+P    I   +  +L  +   V LP+G  LT + LQ  GL  LG+S  FE L+YL E  +         ++++  FL  +      ++NP++ 
Subjt:  NEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYV

Query:  LLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMF
        +LHE IYC+GA+S W+A+R++ E    F A  A  +G    FTGEMIFPWMF++   L P K+AAH+LA+K DW PLYD   L  NKVPVA AVY EDM+
Subjt:  LLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMF

Query:  VNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        V F  + ET   ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  VNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein4.8e-18971.05Show/hide
Query:  GASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-
        G S   HV G W+SVPELRLRDH F VPL+YS    SSPKI+VFARE+V+VGKEEQ MPYLLYLQGGPGFE  RP+EASGWIQ+ACEEFRV+L+DQ    
Subjt:  GASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVGKEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWL-

Query:  LLTALS----------------------------EGFETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK
        L T L+                              F   RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE+
Subjt:  LLTALS----------------------------EGFETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK

Query:  VIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDS
        V  QNEKYYKR+PQD+EIVRE+V YLAE+ GGGV LPSGGILTPKGLQTLGLS LGSSTGFERLHY+ ERVWDPILV GAPK IS FFLNA ++W S D+
Subjt:  VIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDS

Query:  NPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVY
        NPLY LLHE+IYC+GASS WSA R++++ E KFDA KAVKE  PV FTGEMIFPWMFDEIHAL+PFK AA +LA KEDWPPLYD+  L+NNKVPVAAAVY
Subjt:  NPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVY

Query:  YEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        YEDM+VNFKL  ETAS I+GIRLW+TNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  YEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGCTCGCACGGCAGCGCCGCCACTTTTGATAAAGCCACTCCTTCACTCCCACTCTTTAACCTGCCGCTTCCTTTCGTTAATTCCACTCCCGAAACTTTTCTC
CGCCGCCCATTGCCTGAGATCGGTCCGTTTATCGGCAGTTATGGCCGGAACCAATCCCCCTAATGGAGCATCGCCGCCAGTGCACGTAGCTGGCACGTGGTACTCCGTGC
CGGAGCTCCGTCTTCGGGACCATTACTTCTACGTGCCTCTCAATTACTCTCTAGATCAGGTTTCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGTGGGG
AAAGAGGAGCAACCAATGCCATACCTTTTGTACTTACAAGGTGGACCCGGATTTGAGTGCGCCCGACCAACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATT
TCGTGTTATATTGATGGATCAGGCATGGCTTTTGTTGACTGCTTTGAGCGAAGGATTTGAAACGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGA
GCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACTGCAGAT
TCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCACAAGATGTTGAAATTGTCCGCGAAGTTGTGAAATACTTGGCCGA
GAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTATCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGATCCAGTACAGGTTTTGAGCGCTTGCACT
ATCTGTTTGAGAGAGTGTGGGATCCTATACTAGTTCCCGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAATGCTATTGATAACTGGCTCTCACTCGATTCAAATCCT
CTTTACGTTCTCTTGCACGAATCGATATATTGCCAGGGCGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTCGATGCAAATAAAGCTGT
AAAAGAAGGATGTCCCGTGTATTTCACTGGAGAGATGATCTTCCCCTGGATGTTTGACGAGATTCATGCCTTGAGACCATTCAAAGACGCAGCGCATATATTGGCTGATA
AAGAGGATTGGCCTCCTCTATATGACATTGCTGCTCTTAAAAATAACAAGGTTCCAGTGGCAGCTGCGGTCTATTATGAAGATATGTTTGTGAACTTCAAGTTGGCCATG
GAGACAGCTTCCCAAATAGCAGGAATAAGGTTGTGGATAACTAATGAATTTATGCATTCTGGTCTGCGTGATGCGGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTT
AAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCAGCTCGCACGGCAGCGCCGCCACTTTTGATAAAGCCACTCCTTCACTCCCACTCTTTAACCTGCCGCTTCCTTTCGTTAATTCCACTCCCGAAACTTTTCTC
CGCCGCCCATTGCCTGAGATCGGTCCGTTTATCGGCAGTTATGGCCGGAACCAATCCCCCTAATGGAGCATCGCCGCCAGTGCACGTAGCTGGCACGTGGTACTCCGTGC
CGGAGCTCCGTCTTCGGGACCATTACTTCTACGTGCCTCTCAATTACTCTCTAGATCAGGTTTCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGTGGGG
AAAGAGGAGCAACCAATGCCATACCTTTTGTACTTACAAGGTGGACCCGGATTTGAGTGCGCCCGACCAACTGAAGCAAGTGGATGGATACAAAAAGCATGTGAAGAATT
TCGTGTTATATTGATGGATCAGGCATGGCTTTTGTTGACTGCTTTGAGCGAAGGATTTGAAACGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGA
GCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACTGCAGAT
TCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCACAAGATGTTGAAATTGTCCGCGAAGTTGTGAAATACTTGGCCGA
GAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTATCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGATCCAGTACAGGTTTTGAGCGCTTGCACT
ATCTGTTTGAGAGAGTGTGGGATCCTATACTAGTTCCCGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAATGCTATTGATAACTGGCTCTCACTCGATTCAAATCCT
CTTTACGTTCTCTTGCACGAATCGATATATTGCCAGGGCGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTCGATGCAAATAAAGCTGT
AAAAGAAGGATGTCCCGTGTATTTCACTGGAGAGATGATCTTCCCCTGGATGTTTGACGAGATTCATGCCTTGAGACCATTCAAAGACGCAGCGCATATATTGGCTGATA
AAGAGGATTGGCCTCCTCTATATGACATTGCTGCTCTTAAAAATAACAAGGTTCCAGTGGCAGCTGCGGTCTATTATGAAGATATGTTTGTGAACTTCAAGTTGGCCATG
GAGACAGCTTCCCAAATAGCAGGAATAAGGTTGTGGATAACTAATGAATTTATGCATTCTGGTCTGCGTGATGCGGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTT
AAATGGAAAGAAGCCTTTATTCTGA
Protein sequenceShow/hide protein sequence
MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSVG
KEEQPMPYLLYLQGGPGFECARPTEASGWIQKACEEFRVILMDQAWLLLTALSEGFETTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTAD
SVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNP
LYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM
ETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF