; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G080350 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G080350
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationCiama_Chr05:1297661..1301591
RNA-Seq ExpressionCaUC05G080350
SyntenyCaUC05G080350
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048938.1 proline iminopeptidase [Cucumis melo var. makuwa]4.7e-22780.43Show/hide
Query:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVD
        MAG   P   SPPVHVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS +ISVFAREVVS      GK           +  P  + + P +      K  D
Subjt:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVD

Query:  GYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGI
         ++     +RGTGLSTPLTPSSM QF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ      SYGGFCAVTYLSFAPQGLKQVLITGGI
Subjt:  GYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGI

Query:  PPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPK
        PPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGG+LLPSGGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPK
Subjt:  PPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPK

Query:  RISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQ
        RISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE                                
Subjt:  RISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQ

Query:  MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLN
        MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLN
Subjt:  MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLN

Query:  GKKPLF
        GKKPLF
Subjt:  GKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]3.2e-22375.75Show/hide
Query:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ
        SLP   LPL P       +  RR +     MAG   P  ASPPVHV+GTWYSVPELRLRDH+FSVPLNYSL+Q S  +ISVFAREVVS      GK    
Subjt:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ

Query:  CHTFCTYKVDPDLSAPDQLKQVDGYKK-----------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ
               +  P        +     +K           +RGTGLSTPLTPSSM QFQS++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ   
Subjt:  CHTFCTYKVDPDLSAPDQLKQVDGYKK-----------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ

Query:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL
           SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGG+LLPSGGILTPKGLQTLGL
Subjt:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL

Query:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG
        SALG+STGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYFTGE  
Subjt:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG

Query:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
                                      MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAG
Subjt:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG

Query:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IRLW+TNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]6.2e-22777.43Show/hide
Query:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ
        SLP   LPL P       +  RR +     MAG   P   SPPVHVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS +ISVFAREVVS      GK    
Subjt:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ

Query:  CHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ
               +  P  + + P +      K  + ++     +RGTGLSTPLTPSSM QF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ   
Subjt:  CHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ

Query:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL
           SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGG+LLPSGGILTPKGLQTLGL
Subjt:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL

Query:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG
        SALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE  
Subjt:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG

Query:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
                                      MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
Subjt:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG

Query:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]2.1e-21977.12Show/hide
Query:  VMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK--
        VMA TNP NGASPP H AGTWYSVPELRLRDHYFSVPLNYSLD  SSPKISV+AREVVS      GK           +  P    P   +     +K  
Subjt:  VMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK--

Query:  ---------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGG
                 +RGTGLSTPL+PSSM QFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ      SYGGFCAVTYLSFAPQGLKQVLITGG
Subjt:  ---------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGG

Query:  IPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAP
        IPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGGI LP GGILTPKGLQTLGLSALGSSTGFER+HYLFERVWDPI+VPGAP
Subjt:  IPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAP

Query:  KRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLM
        KRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGE                               
Subjt:  KRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLM

Query:  QMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLL
         MIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLL
Subjt:  QMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLL

Query:  NGKKPLF
        NGKKPLF
Subjt:  NGKKPLF

XP_023538651.1 uncharacterized protein LOC111799530 [Cucurbita pepo subsp. pepo]5.6e-22077.12Show/hide
Query:  VMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK--
        VMA TNP NGASPP H AGTWYSVPELRLRDHYFSVPLNYSLD  SSPKISV+AREVVS      GK  +        +  P    P   +     +K  
Subjt:  VMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK--

Query:  ---------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGG
                 +RGTGLSTPL+PSSM QFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ      SYGGFCAVTYLSFAP+GLKQVLITGG
Subjt:  ---------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGG

Query:  IPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAP
        IPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDVEIV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALGSSTGFER+HYLFERVWDPI+VPGAP
Subjt:  IPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAP

Query:  KRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLM
        KRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGE                               
Subjt:  KRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLM

Query:  QMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLL
         MIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLL
Subjt:  QMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLL

Query:  NGKKPLF
        NGKKPLF
Subjt:  NGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein1.5e-22375.75Show/hide
Query:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ
        SLP   LPL P       +  RR +     MAG   P  ASPPVHV+GTWYSVPELRLRDH+FSVPLNYSL+Q S  +ISVFAREVVS      GK    
Subjt:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ

Query:  CHTFCTYKVDPDLSAPDQLKQVDGYKK-----------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ
               +  P        +     +K           +RGTGLSTPLTPSSM QFQS++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ   
Subjt:  CHTFCTYKVDPDLSAPDQLKQVDGYKK-----------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ

Query:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL
           SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLAENGGG+LLPSGGILTPKGLQTLGL
Subjt:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL

Query:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG
        SALG+STGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYFTGE  
Subjt:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG

Query:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
                                      MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAG
Subjt:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG

Query:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IRLW+TNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase3.0e-22777.43Show/hide
Query:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ
        SLP   LPL P       +  RR +     MAG   P   SPPVHVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS +ISVFAREVVS      GK    
Subjt:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ

Query:  CHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ
               +  P  + + P +      K  + ++     +RGTGLSTPLTPSSM QF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ   
Subjt:  CHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ

Query:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL
           SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGG+LLPSGGILTPKGLQTLGL
Subjt:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL

Query:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG
        SALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE  
Subjt:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG

Query:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
                                      MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
Subjt:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG

Query:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A5A7U143 Proline iminopeptidase2.3e-22780.43Show/hide
Query:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVD
        MAG   P   SPPVHVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS +ISVFAREVVS      GK           +  P  + + P +      K  D
Subjt:  MAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDP--DLSAPDQL-----KQVD

Query:  GYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGI
         ++     +RGTGLSTPLTPSSM QF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ      SYGGFCAVTYLSFAPQGLKQVLITGGI
Subjt:  GYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGI

Query:  PPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPK
        PPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGG+LLPSGGILTPKGLQTLGLSALG+STGFERLHYLFERVWDPILVPGAPK
Subjt:  PPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPK

Query:  RISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQ
        RISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE                                
Subjt:  RISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQ

Query:  MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLN
        MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLN
Subjt:  MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLN

Query:  GKKPLF
        GKKPLF
Subjt:  GKKPLF

A0A5D3D1Y5 Proline iminopeptidase3.0e-22777.43Show/hide
Query:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ
        SLP   LPL P       +  RR +     MAG   P   SPPVHVAGTWYSVPELRLRDH+FSVPLNYSLDQ SS +ISVFAREVVS      GK    
Subjt:  SLPLFTLPL-PFVKSTPKSFLRRPLP-EIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQ

Query:  CHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ
               +  P  + + P +      K  + ++     +RGTGLSTPLTPSSM QF+SAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ   
Subjt:  CHTFCTYKVDP--DLSAPDQL-----KQVDGYK----KRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQ

Query:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL
           SYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQD+EIVREVVKYLA+NGGG+LLPSGGILTPKGLQTLGL
Subjt:  SAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGL

Query:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG
        SALG+STGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGE  
Subjt:  SALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVG

Query:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
                                      MIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG
Subjt:  ANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAG

Query:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  IRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406901.0e-21977.12Show/hide
Query:  VMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK--
        VMA TNP NGASPP H AGTWYSVPELRLRDHYFSVPLNYSLD  SSPKISV+AREVVS      GK           +  P    P   +     +K  
Subjt:  VMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK--

Query:  ---------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGG
                 +RGTGLSTPL+PSSM QFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQ      SYGGFCAVTYLSFAPQGLKQVLITGG
Subjt:  ---------RRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGG

Query:  IPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAP
        IPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQDV+IV EVVKYL ENGGGI LP GGILTPKGLQTLGLSALGSSTGFER+HYLFERVWDPI+VPGAP
Subjt:  IPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAP

Query:  KRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLM
        KRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSRWSAQRI NE+ENKFDA KAVKEGCPVYFTGE                               
Subjt:  KRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLM

Query:  QMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLL
         MIFPWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GPQVLDHLMGLL
Subjt:  QMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLL

Query:  NGKKPLF
        NGKKPLF
Subjt:  NGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH1.4e-4830.48Show/hide
Query:  RLRDHYFSVPLNYSLDQVSSPKISVFAREV-----VSEFALQW-----GKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYK----KRRGTGLSTPLTPSSM
        R  +  F VPLN+S  +     + +FAR +     V +  L W     G     C T   Y   P +          GY+      RGTG S+P+T  ++
Subjt:  RLRDHYFSVPLNYSLDQVSSPKISVFAREV-----VSEFALQW-----GKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYK----KRRGTGLSTPLTPSSM

Query:  LQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK
         Q    +  A+ LK FRADNIV D E +R  L  DA    + W+++      A S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F  
Subjt:  LQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK

Query:  VIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSN
           +NE YYK+YP+DV  V+ ++KYL EN   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I+N   +  N
Subjt:  VIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSN

Query:  PLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDA
         +Y LL E +YCQG +  W A + + + + +F  N   +    ++FTGE                                 IF  MF+    L+  K  
Subjt:  PLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEIHALRPFKDA

Query:  AHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
        A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L   TAS++  ++  + N + H  +     +V+  L  L
Subjt:  AHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase7.4e-7436.07Show/hide
Query:  SPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------RR
        S P+H     Y +  +    H+F+VPL++         I++F R +  +  L      ++       +  P   AP         K+           +R
Subjt:  SPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKK-----------RR

Query:  GTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTAD
        GTG STP+  + +L   +    A+YL HFRAD+IV DAE IR +L PD  PW++LGQ      S+GGFC++TYLS  P  L +V +TGG+ PIG   +AD
Subjt:  GTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTAD

Query:  SVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAI
         VYRA +++V  +N  ++ R+P    I   +  +L  +   + LP+G  LT + LQ  GL  LG+S  FE L+YL E  +         ++++  FL  +
Subjt:  SVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAI

Query:  DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEI
              ++NP++ +LHE IYC+GA+S W+A+R++ E    F A  A  +G    FTGE                                MIFPWMF++ 
Subjt:  DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWMFDEI

Query:  HALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL
          L P K+AAH+LA+K DW PLYD   L  NKVPVA AVY EDM+V F  + ET   ++  R WITNE+ H+GLR  G Q+LD L+ L
Subjt:  HALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein5.1e-17964.4Show/hide
Query:  GASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKKR----------
        G S   HV G W+SVPELRLRDH F VPL+YS    SSPKI+VFARE+V+      GK           +  P    P +  +  G+ +R          
Subjt:  GASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFAREVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKKR----------

Query:  --RGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGC
          RGTGLSTPLT SSMLQF+SA++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQ      S+GGFCA+TYLSFAP+GLKQVLITGGIPPIG  C
Subjt:  --RGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGC

Query:  TADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFF
        TAD VY A FE+V  QNEKYYKR+PQD+EIVRE+V YLAE+ GGG+ LPSGGILTPKGLQTLGLS LGSSTGFERLHY+ ERVWDPILV GAPK IS FF
Subjt:  TADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAEN-GGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHYLFERVWDPILVPGAPKRISFFF

Query:  LNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWM
        LNA ++W S D+NPLY LLHE+IYC+GASS WSA R++++ E KFDA KAVKE  PV FTGE                                MIFPWM
Subjt:  LNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYALFVMLMQMIFPWM

Query:  FDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        FDEIHAL+PFK AA +LA KEDWPPLYD+  L+NNKVPVAAAVYYEDM+VNFKL  ETAS I+GIRLW+TNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  FDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGCCCCCCACCATAGTTAATCTAATCCACCCACCGTCCAAAAATGTTCGCAGCTCGCACGCCGCCAATTTTGATAAAGCCACTCCTTCACTTCCACTCTTTACCCTGCC
GCTTCCTTTCGTTAAATCCACTCCCAAATCTTTTCTCCGCCGCCCATTGCCGGAGATCGTTATGGCCGGAACCAATCCCCCTAATGGAGCATCGCCGCCAGTGCACGTAG
CTGGCACGTGGTACTCCGTGCCGGAGCTCCGTCTTCGGGACCATTACTTCTCTGTGCCTCTCAATTACTCTCTAGATCAGGTTTCTTCTCCTAAGATCTCCGTTTTTGCG
CGGGAAGTTGTTTCAGAATTTGCGTTGCAGTGGGGAAAGAGGAGCAACCAATGCCATACCTTTTGTACTTACAAGGTGGACCCGGATTTGAGTGCGCCCGACCAACTGAA
GCAAGTGGATGGATACAAAAAGCGTCGAGGAACAGGATTATCAACTCCTTTGACTCCATCATCCATGTTGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAAC
ATTTTCGAGCTGATAACATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGATTTTGCAATCAGCATATAGC
TATGGTGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTC
TGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCGCAAGATGTTGAAATTGTCCGCGAAGTTGTGAAATACTTGGCCGAGA
ATGGAGGCGGGATTCTTCTTCCCTCTGGTGGTATCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGATCCAGTACAGGTTTTGAGCGTTTGCATTAT
CTGTTTGAGAGAGTGTGGGATCCTATACTAGTTCCCGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAATGCTATTGATAACTGGCTCTCTCTTGATTCAAATCCTCT
TTATGTTCTCTTGCACGAATCAATATATTGCCAGGGCGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTTGATGCAAATAAAGCTGTAA
AAGAAGGATGTCCCGTGTATTTCACTGGAGAGGTAGGTGCAAACTGTGCTCTTTTAAATTCTCACTCCCACGGGCTCTTGAACTCCAAAAGAATGAAGTCTTATGCTCTG
TTTGTAATGCTTATGCAGATGATCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTCAAAGACGCAGCGCATATATTGGCCGATAAGGAGGATTGGCCTCC
TCTATATGACATCGCTGCTCTTAAAAATAACAAGGTTCCTGTGGCAGCTGCGGTCTATTACGAAGATATGTTTGTGAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAA
TAGCAGGAATAAGGTTGTGGATAACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCCCAAGTTTTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCT
TTATTCTGA
mRNA sequenceShow/hide mRNA sequence
TCGCCCCCCACCATAGTTAATCTAATCCACCCACCGTCCAAAAATGTTCGCAGCTCGCACGCCGCCAATTTTGATAAAGCCACTCCTTCACTTCCACTCTTTACCCTGCC
GCTTCCTTTCGTTAAATCCACTCCCAAATCTTTTCTCCGCCGCCCATTGCCGGAGATCGTTATGGCCGGAACCAATCCCCCTAATGGAGCATCGCCGCCAGTGCACGTAG
CTGGCACGTGGTACTCCGTGCCGGAGCTCCGTCTTCGGGACCATTACTTCTCTGTGCCTCTCAATTACTCTCTAGATCAGGTTTCTTCTCCTAAGATCTCCGTTTTTGCG
CGGGAAGTTGTTTCAGAATTTGCGTTGCAGTGGGGAAAGAGGAGCAACCAATGCCATACCTTTTGTACTTACAAGGTGGACCCGGATTTGAGTGCGCCCGACCAACTGAA
GCAAGTGGATGGATACAAAAAGCGTCGAGGAACAGGATTATCAACTCCTTTGACTCCATCATCCATGTTGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAAC
ATTTTCGAGCTGATAACATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGATTTTGCAATCAGCATATAGC
TATGGTGGTTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACTGCAGATTC
TGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAATGAAAAGTACTACAAGAGGTATCCGCAAGATGTTGAAATTGTCCGCGAAGTTGTGAAATACTTGGCCGAGA
ATGGAGGCGGGATTCTTCTTCCCTCTGGTGGTATCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTTTGGGATCCAGTACAGGTTTTGAGCGTTTGCATTAT
CTGTTTGAGAGAGTGTGGGATCCTATACTAGTTCCCGGAGCACCGAAACGAATCAGTTTTTTCTTCCTCAATGCTATTGATAACTGGCTCTCTCTTGATTCAAATCCTCT
TTATGTTCTCTTGCACGAATCAATATATTGCCAGGGCGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTTGATGCAAATAAAGCTGTAA
AAGAAGGATGTCCCGTGTATTTCACTGGAGAGGTAGGTGCAAACTGTGCTCTTTTAAATTCTCACTCCCACGGGCTCTTGAACTCCAAAAGAATGAAGTCTTATGCTCTG
TTTGTAATGCTTATGCAGATGATCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTCAAAGACGCAGCGCATATATTGGCCGATAAGGAGGATTGGCCTCC
TCTATATGACATCGCTGCTCTTAAAAATAACAAGGTTCCTGTGGCAGCTGCGGTCTATTACGAAGATATGTTTGTGAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAA
TAGCAGGAATAAGGTTGTGGATAACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCCCAAGTTTTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCT
TTATTCTGA
Protein sequenceShow/hide protein sequence
SPPTIVNLIHPPSKNVRSSHAANFDKATPSLPLFTLPLPFVKSTPKSFLRRPLPEIVMAGTNPPNGASPPVHVAGTWYSVPELRLRDHYFSVPLNYSLDQVSSPKISVFA
REVVSEFALQWGKRSNQCHTFCTYKVDPDLSAPDQLKQVDGYKKRRGTGLSTPLTPSSMLQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQILQSAYS
YGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDVEIVREVVKYLAENGGGILLPSGGILTPKGLQTLGLSALGSSTGFERLHY
LFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEVGANCALLNSHSHGLLNSKRMKSYAL
FVMLMQMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKP
LF