; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G020720 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G020720
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationchr05:27625431..27629896
RNA-Seq ExpressionLsi05G020720
SyntenyLsi05G020720
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]6.6e-21475.68Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFAARTAAPPL    LLH HSL CR L LIPL    SAAHC RSVRLSA MAG   P  ASPPVHV+GTWYSVPELRLRDH+F VPLNYSL+Q S  +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF
        VFAREVVS      GK           +  P        +     +K            RGTGLSTPLTPSSMSQFQS++DLANYLKHFRADNIVNDAEF
Subjt:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF

Query:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS
        IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGGVLLPS
Subjt:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS

Query:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
        GGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKA
Subjt:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA

Query:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
        VKEGC VYFTGE                                          VPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAG
Subjt:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG

Query:  PQVLDHLMGLLNGKKPLF
        PQVLDHLMGLLNGKKPLF
Subjt:  PQVLDHLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]6.4e-21777.22Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFA RTAAPPL    LLH HSL  R L LIPLP   SAAHC RSVRLSA MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF
        VFAREVVS      GK           +  P  + + P +      K  + ++      RGTGLSTPLTPSSMSQF+SAEDLANYLKHFRADNIVNDAEF
Subjt:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF

Query:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS
        IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPS
Subjt:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS

Query:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
        GGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
Subjt:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA

Query:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
        VKEGCPVYFTGE                                          VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
Subjt:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG

Query:  PQVLDHLMGLLNGKKPLF
        PQVLDHLMGLLNGKKPLF
Subjt:  PQVLDHLMGLLNGKKPLF

XP_022147514.1 uncharacterized protein LOC111016418 [Momordica charantia]4.4e-20272.22Show/hide
Query:  MFAARTAAPPLLIKPLL-HSHSLTCRFLSLIPLPKLF--SAAHCLRSVRLSAVMA-GTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSS
        MFAA  AAPP LIKPLL + HS  CR L L+PLP L    A H   SVR SAVMA  TNP N ASPP H  G WYSVPELRLRDH+F VPL+YSLDQ +S
Subjt:  MFAARTAAPPLLIKPLL-HSHSLTCRFLSLIPLPKLF--SAAHCLRSVRLSAVMA-GTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSS

Query:  PKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVN
        PKISVFAREVV       GK           +  P   +P   +     +K            RGTGLSTPLT SSMSQFQSAEDLANYLKHFRADNIVN
Subjt:  PKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVN

Query:  DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGV
        DAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQDVEI+REV KYLAE+GGGV
Subjt:  DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGV

Query:  LLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFD
        +LPSGGILTPKGLQ LGL ALGSSTGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFD
Subjt:  LLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFD

Query:  ANKAVKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGL
        AN A+KEGCP++FTGE                                          VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNE+MHSGL
Subjt:  ANKAVKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGL

Query:  RDAGPQVLDHLMGLLNGKKPLF
        RDAGPQVLDHLMGLLNGKKPLF
Subjt:  RDAGPQVLDHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]2.2e-20173.55Show/hide
Query:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSN
        HS      SLIPL +L SA HC  SVR  AVMA TNP NGASPP H AGTWYSVPELRLRDHYF VPLNYSLD  SSPKISV+AREVVS      GK   
Subjt:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSN

Query:  QCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
                +  P    P   +     +K            RGTGLSTPL+PSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  QCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGS

Query:  STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE-------
        STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGE       
Subjt:  STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE-------

Query:  -----------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
                                           VPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  -----------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]9.9e-20273.75Show/hide
Query:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSN
        HS      SLIPL +L SA HC  SVR   VMA TNP NGASPP H AGTWYSVPELRLRDHYF VPLNYSLD  SSPKISV+AREVVS      GK  +
Subjt:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSN

Query:  QCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
                +  P    P   +     +K            RGTGLSTPL+PSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  QCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAP+GLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDVEIV EVVKYL ENGGGV LP GGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGS

Query:  STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE-------
        STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGE       
Subjt:  STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE-------

Query:  -----------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
                                           VPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  -----------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein3.2e-21475.68Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFAARTAAPPL    LLH HSL CR L LIPL    SAAHC RSVRLSA MAG   P  ASPPVHV+GTWYSVPELRLRDH+F VPLNYSL+Q S  +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF
        VFAREVVS      GK           +  P        +     +K            RGTGLSTPLTPSSMSQFQS++DLANYLKHFRADNIVNDAEF
Subjt:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF

Query:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS
        IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGGVLLPS
Subjt:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS

Query:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
        GGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKA
Subjt:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA

Query:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
        VKEGC VYFTGE                                          VPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAG
Subjt:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG

Query:  PQVLDHLMGLLNGKKPLF
        PQVLDHLMGLLNGKKPLF
Subjt:  PQVLDHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase3.1e-21777.22Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFA RTAAPPL    LLH HSL  R L LIPLP   SAAHC RSVRLSA MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF
        VFAREVVS      GK           +  P  + + P +      K  + ++      RGTGLSTPLTPSSMSQF+SAEDLANYLKHFRADNIVNDAEF
Subjt:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF

Query:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS
        IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPS
Subjt:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS

Query:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
        GGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
Subjt:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA

Query:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
        VKEGCPVYFTGE                                          VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
Subjt:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG

Query:  PQVLDHLMGLLNGKKPLF
        PQVLDHLMGLLNGKKPLF
Subjt:  PQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase3.1e-21777.22Show/hide
Query:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS
        MFA RTAAPPL    LLH HSL  R L LIPLP   SAAHC RSVRLSA MAG   P   SPPVHVAGTWYSVPELRLRDH+F VPLNYSLDQ SS +IS
Subjt:  MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKIS

Query:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF
        VFAREVVS      GK           +  P  + + P +      K  + ++      RGTGLSTPLTPSSMSQF+SAEDLANYLKHFRADNIVNDAEF
Subjt:  VFAREVVSEYALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEF

Query:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS
        IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGGVLLPS
Subjt:  IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPS

Query:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
        GGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA
Subjt:  GGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKA

Query:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
        VKEGCPVYFTGE                                          VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG
Subjt:  VKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAG

Query:  PQVLDHLMGLLNGKKPLF
        PQVLDHLMGLLNGKKPLF
Subjt:  PQVLDHLMGLLNGKKPLF

A0A6J1D184 uncharacterized protein LOC1110164182.2e-20272.22Show/hide
Query:  MFAARTAAPPLLIKPLL-HSHSLTCRFLSLIPLPKLF--SAAHCLRSVRLSAVMA-GTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSS
        MFAA  AAPP LIKPLL + HS  CR L L+PLP L    A H   SVR SAVMA  TNP N ASPP H  G WYSVPELRLRDH+F VPL+YSLDQ +S
Subjt:  MFAARTAAPPLLIKPLL-HSHSLTCRFLSLIPLPKLF--SAAHCLRSVRLSAVMA-GTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSS

Query:  PKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVN
        PKISVFAREVV       GK           +  P   +P   +     +K            RGTGLSTPLT SSMSQFQSAEDLANYLKHFRADNIVN
Subjt:  PKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVN

Query:  DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGV
        DAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQDVEI+REV KYLAE+GGGV
Subjt:  DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGV

Query:  LLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFD
        +LPSGGILTPKGLQ LGL ALGSSTGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFD
Subjt:  LLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFD

Query:  ANKAVKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGL
        AN A+KEGCP++FTGE                                          VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNE+MHSGL
Subjt:  ANKAVKEGCPVYFTGE------------------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGL

Query:  RDAGPQVLDHLMGLLNGKKPLF
        RDAGPQVLDHLMGLLNGKKPLF
Subjt:  RDAGPQVLDHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406901.1e-20173.55Show/hide
Query:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSN
        HS      SLIPL +L SA HC  SVR  AVMA TNP NGASPP H AGTWYSVPELRLRDHYF VPLNYSLD  SSPKISV+AREVVS      GK   
Subjt:  HSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSN

Query:  QCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
                +  P    P   +     +K            RGTGLSTPL+PSSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY
Subjt:  QCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSY

Query:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGS
        GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALGS
Subjt:  GGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGS

Query:  STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE-------
        STGFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGE       
Subjt:  STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE-------

Query:  -----------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
                                           VPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  -----------------------------------VPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH2.7e-4029.25Show/hide
Query:  RLRDHYFYVPLNYSLDQVSSPKISVFAREV-----VSEYALQW-----GKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYK----KHRGTGLSTPLTPSSM
        R  +  F VPLN+S  +     + +FAR +     V +  L W     G     C T   Y   P +          GY+      RGTG S+P+T  ++
Subjt:  RLRDHYFYVPLNYSLDQVSSPKISVFAREV-----VSEYALQW-----GKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYK----KHRGTGLSTPLTPSSM

Query:  SQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNE
        +Q    +  A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE
Subjt:  SQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNE

Query:  KYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLL
         YYK+YP+DV  V+ ++KYL EN   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I+N   +  N +Y LL
Subjt:  KYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLL

Query:  HESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTG------------------------------------------EVPVAAAVYYEDMFVN
         E +YCQG +  W A + + + + +F  N   +    ++FTG                                          EVPV  A   EDM+V+
Subjt:  HESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTG------------------------------------------EVPVAAAVYYEDMFVN

Query:  FKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        + L   TAS++  ++  + N + H  +     +V+  L  L
Subjt:  FKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase2.4e-5733.56Show/hide
Query:  SPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HR
        S P+H     Y +  +    H+F VPL++         I++F R +  +  L      ++       +  P   AP         K+            R
Subjt:  SPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------HR

Query:  GTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRAC
        GTG STP+    ++     +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA 
Subjt:  GTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRAC

Query:  FEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSL
        +++V  +N  ++ R+P    I   +  +L  +   V LP+G  LT + LQ  GL  LG+S  FE L+YL E  +         ++++  FL  +      
Subjt:  FEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSL

Query:  DSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE------------------------------------------VPVAAA
        ++NP++ +LHE IYC+GA+S W+A+R++ E    F A  A  +G    FTGE                                          VPVA A
Subjt:  DSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE------------------------------------------VPVAAA

Query:  VYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        VY EDM+V F  + ET   ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  VYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein3.0e-15662.34Show/hide
Query:  GASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------
        G S   HV G W+SVPELRLRDH F VPL+YS    SSPKI+VFARE+V+      GK           +  P    P +  +  G+ +           
Subjt:  GASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEYALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------

Query:  -HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVY
          RGTGLSTPLT SSM QF+SA++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY
Subjt:  -HRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVY

Query:  RACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDN
         A FE+V  QNEKYYKR+PQD+EIVRE+V YLAE+ GGGV LPSGGILTPKGLQTLGLS LGSSTGFERLHY+ ERVWDPILV GAPK IS FFLNA ++
Subjt:  RACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDN

Query:  WLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE------------------------------------------VP
        W S D+NPLY LLHE+IYC+GASS WSA R++++ E KFDA KAVKE  PV FTGE                                          VP
Subjt:  WLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE------------------------------------------VP

Query:  VAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        VAAAVYYEDM+VNFKL  ETAS I+GIRLW+TNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  VAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGCTCGCACGGCAGCGCCGCCACTTTTGATAAAGCCACTCCTTCACTCCCACTCTTTAACCTGCCGCTTCCTTTCGTTAATTCCACTCCCGAAACTTTTCTC
CGCCGCCCATTGCCTGAGATCGGTCCGTTTATCGGCAGTTATGGCCGGAACCAATCCCCCTAATGGAGCATCGCCGCCAGTGCACGTAGCTGGCACGTGGTACTCCGTGC
CGGAGCTCCGTCTTCGGGACCATTACTTCTACGTGCCTCTCAATTACTCTCTAGATCAGGTTTCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGAATAT
GCATTGCAGTGGGGAAAGAGGAGCAACCAATGCCATACCTTTTGTACTTACAAGGTGGACCCGGATTTGAGTGCGCCCGACCAACTGAAGCAAGTGGATGGATACAAAAA
GCATCGAGGAACAGGATTATCAACTCCTTTGACTCCATCGTCCATGTCGCAATTTCAAAGTGCAGAGGACTTAGCTAACTACTTGAAACATTTTCGAGCTGATAACATAG
TTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCA
CCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAA
TGAAAAGTACTACAAGAGGTATCCACAAGATGTTGAAATTGTCCGCGAAGTTGTGAAATACTTGGCCGAGAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTATCTTGA
CACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGATCCAGTACAGGTTTTGAGCGCTTGCACTATCTGTTTGAGAGAGTGTGGGATCCTATACTAGTTCCCGGA
GCACCGAAACGAATCAGTTTTTTCTTCCTCAATGCTATTGATAACTGGCTCTCACTCGATTCAAATCCTCTTTACGTTCTCTTGCACGAATCGATATATTGCCAGGGCGC
CTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTCGATGCAAATAAAGCTGTAAAAGAAGGATGTCCCGTGTATTTCACTGGAGAGGTTCCAG
TGGCAGCTGCGGTCTATTATGAAGATATGTTTGTGAACTTCAAGTTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATAAGGTTGTGGATAACTAATGAATTTATGCAT
TCTGGTCTGCGTGATGCGGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
mRNA sequenceShow/hide mRNA sequence
CCATATTTCGCCCCCCCACCATAGTTAATCTAATCCACCCACCGTCCAAAAATGTTCGCAGCTCGCACGGCAGCGCCGCCACTTTTGATAAAGCCACTCCTTCACTCCCA
CTCTTTAACCTGCCGCTTCCTTTCGTTAATTCCACTCCCGAAACTTTTCTCCGCCGCCCATTGCCTGAGATCGGTCCGTTTATCGGCAGTTATGGCCGGAACCAATCCCC
CTAATGGAGCATCGCCGCCAGTGCACGTAGCTGGCACGTGGTACTCCGTGCCGGAGCTCCGTCTTCGGGACCATTACTTCTACGTGCCTCTCAATTACTCTCTAGATCAG
GTTTCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGAATATGCATTGCAGTGGGGAAAGAGGAGCAACCAATGCCATACCTTTTGTACTTACAAGGTGGA
CCCGGATTTGAGTGCGCCCGACCAACTGAAGCAAGTGGATGGATACAAAAAGCATCGAGGAACAGGATTATCAACTCCTTTGACTCCATCGTCCATGTCGCAATTTCAAA
GTGCAGAGGACTTAGCTAACTACTTGAAACATTTTCGAGCTGATAACATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATT
TTGGGTCAGAGCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATG
CACTGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCACAAGATGTTGAAATTGTCCGCGAAGTTGTGAAAT
ACTTGGCCGAGAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTATCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGATCCAGTACAGGTTTTGAG
CGCTTGCACTATCTGTTTGAGAGAGTGTGGGATCCTATACTAGTTCCCGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAATGCTATTGATAACTGGCTCTCACTCGA
TTCAAATCCTCTTTACGTTCTCTTGCACGAATCGATATATTGCCAGGGCGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTCGATGCAA
ATAAAGCTGTAAAAGAAGGATGTCCCGTGTATTTCACTGGAGAGGTTCCAGTGGCAGCTGCGGTCTATTATGAAGATATGTTTGTGAACTTCAAGTTGGCCATGGAGACA
GCTTCCCAAATAGCAGGAATAAGGTTGTGGATAACTAATGAATTTATGCATTCTGGTCTGCGTGATGCGGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGG
AAAGAAGCCTTTATTCTGATGAGGTTTTTCTGATCTTTGTTGTTTTTCTTCATAATTATTGGATTAAGTTTATGGCATAAGCTTTTCATTTTCTTCTCAATAAGCTTGCT
AGTCGTTAGTTAGTGTATGAGATGAGATCAGGAGATTGTAGACTGGTTAATCCTTTCTCCTAGATAAGTTTTTTCCCCATTAATTATAACTATAAGTAAATT
Protein sequenceShow/hide protein sequence
MFAARTAAPPLLIKPLLHSHSLTCRFLSLIPLPKLFSAAHCLRSVRLSAVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFYVPLNYSLDQVSSPKISVFAREVVSEY
ALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKKHRGTGLSTPLTPSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFA
PQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPG
APKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMH
SGLRDAGPQVLDHLMGLLNGKKPLF