; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005840 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005840
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationscaffold254:1954767..1958525
RNA-Seq ExpressionMS005840
SyntenyMS005840
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597118.1 Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia]7.2e-24683.53Show/hide
Query:  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQS
        FHSFP  A  L+PL  LL   A H RSSVRS AVMA  TNP N ASPP+HA G WYSVPELRLRDH+F+VPL+YSLD  +SPKISV+AREVV VGKEEQ 
Subjt:  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQS

Query:  MPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFG
        MPYL+YLQGGPGFE PRPTEASGW+QKACEEFR      RGTGLSTPL+ SSMSQFQ+AEDLANYLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+G
Subjt:  MPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFG

Query:  GFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS
        GFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEK+IIQNEKYYKRYPQDV+I+ EV KYL E+GGGV LP GGILTPKGLQ LGL ALGSS
Subjt:  GFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS

Query:  TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFD
        TGFER+HYLFERVWDP+IVPGAPKRIS FFL A   WLSLDSNPLY L+HESIYCQGASSRWSAQRI+NEL+NKFD   A+KEGCP++FTGEM+FPWMFD
Subjt:  TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFD

Query:  EIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
        EIHAL+P KDAA ILAEKEDWPPLYD+AAL+NNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNE+MHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  EIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]7.4e-25182.4Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFAA  AAP     PLL +FHS PCR LPL+PL N  F  A H R SVR SA MA   +P  AASPP H +G WYSVPELRLRDHHF+VPL+YSL+Q + 
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
         +ISVFAREVV VGKE+Q MPYLL+LQGGPGFE  RPTEASGW+QKACEEFR      RGTGLSTPLT SSMSQFQS++DLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLAE+GGGV+LPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++V G+PKRIS FFL A DNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNE++NKFDAN A+
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGC ++FTGEM+FPWMFDEIHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNE+MHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]1.2e-25383.56Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFA   AAP     PLL +FHS P R LPL+PLPN  F  A H R SVR SA MA   +P  A SPP H  G WYSVPELRLRDHHF+VPL+YSLDQ +S
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
         +ISVFAREVV VGKE+Q MPYLLYLQGGPGFE  RP+EASGW+QKACEEFR      RGTGLSTPLT SSMSQF+SAEDLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLA++GGGV+LPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFDAN A+
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGCP++FTGEM+FPWMFDEIHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNE+MHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

XP_022147514.1 uncharacterized protein LOC111016418 [Momordica charantia]8.4e-30398.84Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
        PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR      RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]2.5e-24683.73Show/hide
Query:  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQS
        FHSFP  A  L+PL  LL   A H RSSVRS AVMA  TNP N ASPP+HA G WYSVPELRLRDH+F+VPL+YSLD  +SPKISV+AREVV VGKEEQ 
Subjt:  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQS

Query:  MPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFG
        MPYLLYLQGGPGFE PRPTEASGW+QKACEEFR      RGTGLSTPL+ SSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+G
Subjt:  MPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFG

Query:  GFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS
        GFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEK+IIQNEKYYKRYPQDV+I+ EV KYL E+GGG+ LP GGILTPKGLQ LGL ALGSS
Subjt:  GFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS

Query:  TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFD
        TGFER+HYLFERVWDP+IVPGAPKRIS FFL A   WLSLDSNPLY L+HESIYCQGASSRWSAQRI NEL+NKFDA  A+KEGCP++FTGEM+FPWMFD
Subjt:  TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFD

Query:  EIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
        EIHAL+P KDAA ILAEKEDWPPLYD+AAL+NNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNE+MHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  EIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein3.6e-25182.4Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFAA  AAP     PLL +FHS PCR LPL+PL N  F  A H R SVR SA MA   +P  AASPP H +G WYSVPELRLRDHHF+VPL+YSL+Q + 
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
         +ISVFAREVV VGKE+Q MPYLL+LQGGPGFE  RPTEASGW+QKACEEFR      RGTGLSTPLT SSMSQFQS++DLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLAE+GGGV+LPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++V G+PKRIS FFL A DNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNE++NKFDAN A+
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGC ++FTGEM+FPWMFDEIHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNE+MHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

A0A1S3AUX5 proline iminopeptidase5.9e-25483.56Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFA   AAP     PLL +FHS P R LPL+PLPN  F  A H R SVR SA MA   +P  A SPP H  G WYSVPELRLRDHHF+VPL+YSLDQ +S
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
         +ISVFAREVV VGKE+Q MPYLLYLQGGPGFE  RP+EASGW+QKACEEFR      RGTGLSTPLT SSMSQF+SAEDLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLA++GGGV+LPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFDAN A+
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGCP++FTGEM+FPWMFDEIHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNE+MHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase5.9e-25483.56Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFA   AAP     PLL +FHS P R LPL+PLPN  F  A H R SVR SA MA   +P  A SPP H  G WYSVPELRLRDHHF+VPL+YSLDQ +S
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
         +ISVFAREVV VGKE+Q MPYLLYLQGGPGFE  RP+EASGW+QKACEEFR      RGTGLSTPLT SSMSQF+SAEDLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLA++GGGV+LPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFDAN A+
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGCP++FTGEM+FPWMFDEIHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNE+MHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

A0A6J1D184 uncharacterized protein LOC1110164184.1e-30398.84Show/hide
Query:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
        MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS
Subjt:  MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDAS

Query:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
        PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR      RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI
Subjt:  PKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
        RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
Subjt:  RTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG

Query:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
        GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL
Subjt:  GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNAL

Query:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
        KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP
Subjt:  KEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406901.2e-24683.73Show/hide
Query:  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQS
        FHSFP  A  L+PL  LL   A H RSSVRS AVMA  TNP N ASPP+HA G WYSVPELRLRDH+F+VPL+YSLD  +SPKISV+AREVV VGKEEQ 
Subjt:  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQS

Query:  MPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFG
        MPYLLYLQGGPGFE PRPTEASGW+QKACEEFR      RGTGLSTPL+ SSMSQFQSAEDLA+YLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+G
Subjt:  MPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFG

Query:  GFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS
        GFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEK+IIQNEKYYKRYPQDV+I+ EV KYL E+GGG+ LP GGILTPKGLQ LGL ALGSS
Subjt:  GFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS

Query:  TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFD
        TGFER+HYLFERVWDP+IVPGAPKRIS FFL A   WLSLDSNPLY L+HESIYCQGASSRWSAQRI NEL+NKFDA  A+KEGCP++FTGEM+FPWMFD
Subjt:  TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFD

Query:  EIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
        EIHAL+P KDAA ILAEKEDWPPLYD+AAL+NNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNE+MHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  EIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH3.2e-6334.17Show/hide
Query:  RLRDHHFTVPLDYSLDQDASPKISVFAREV-VLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEE-------FRRGTGLSTPLTSSSMSQFQSA
        R  +  F VPL++S   + +  + +FAR +  ++G ++  +P++LYLQGGPG     P E + W+    E+         RGTG S+P+T+ +++Q    
Subjt:  RLRDHHFTVPLDYSLDQDASPKISVFAREV-VLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEE-------FRRGTGLSTPLTSSSMSQFQSA

Query:  EDLANYLKHFRADNIVNDAEFIRTRLVPDA----GPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRY
        +  A+ LK FRADNIV D E +R  L  DA      W+++  SFGGFCA++Y+S  P  L +V I GG  P+ N+     V    F     +NE YYK+Y
Subjt:  EDLANYLKHFRADNIVNDAEFIRTRLVPDA----GPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYC
        P+DV  ++ + KYL E+    V  S G LTP+  Q LG+  LG   G + +H + +R  + +      K +++  L   +N   +  N +Y LL E +YC
Subjt:  PQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYC

Query:  QGASSRWSAQRIKNELKNKFDANNALKE-GCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAM
        QG +  W A + +     K D   +L E    ++FTGE +F  MF+    L+ LK  A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L  
Subjt:  QGASSRWSAQRIKNELKNKFDANNALKE-GCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAM

Query:  ETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGL
         TAS++  ++  + N + H  +     +V+  L  L
Subjt:  ETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase4.3e-10044.83Show/hide
Query:  YSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQ
        Y +  +    H FTVPLD+    D    I++F R +    + +  +P+LLYLQGGPGF +PRP+   GW+++A +EFR      RGTG STP+ +  ++ 
Subjt:  YSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQ

Query:  FQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRY
            +  A+YL HFRAD+IV DAE IR +L PD  PW++LGQSFGGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA +++V  +N  ++ R+
Subjt:  FQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRY

Query:  PQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYC
        P    I   +A +L  H   V LP+G  LT + LQ  GL  LG+S  FE L+YL E  +         ++++  FL         ++NP++ +LHE IYC
Subjt:  PQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYC

Query:  QGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAME
        +GA+S W+A+R++ E    F A  A  +G    FTGEM+FPWMF++   L PLK+AA +LAEK DW PLYD   L  NKVPVA AVY EDM+V F  + E
Subjt:  QGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAME

Query:  TASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGL
        T   ++  R WITNEY H+GLR  G Q+LD L+ L
Subjt:  TASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT3G61540.1 alpha/beta-Hydrolases superfamily protein2.4e-20771.05Show/hide
Query:  LPNLLFAYAYHRRSSVRSSAVMATETN-PINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPG
        +P L+  Y   R   V +S   A      +   S  +H TGKW+SVPELRLRDH F VPLDYS    +SPKI+VFARE+V VGKEEQ+MPYLLYLQGGPG
Subjt:  LPNLLFAYAYHRRSSVRSSAVMATETN-PINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPG

Query:  FESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAP
        FE PRP+EASGW+Q+ACEEFR      RGTGLSTPLT SSM QF+SA++LA+YL HFRADNIV DAEFIR RLVP A PWTILGQSFGGFCA+TYLSFAP
Subjt:  FESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAP

Query:  QGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAE-HGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFE
        +GLKQVLITGGIPPIG  CTAD VY A FE+V  QNEKYYKR+PQD+EI+RE+  YLAE  GGGV LPSGGILTPKGLQ LGL  LGSSTGFERLHY+ E
Subjt:  QGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAE-HGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFE

Query:  RVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDA
        RVWDP++V GAPK IS FFL A ++W S D+NPLY LLHE+IYC+GASS WSA R++++ + KFDA  A+KE  P+ FTGEM+FPWMFDEIHAL+P K A
Subjt:  RVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDA

Query:  ARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
        A +LA+KEDWPPLYDV  L+NNKVPVAAAVYYEDM+VNFKL  ETAS I+GIRLW+TNE+MHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  ARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCTGCTTCCGCGGCAGCGCCGCCCCGTTTGATAAAGCCACTCCTTTTTTACTTCCACTCCTTCCCATGCCGCGCGCTCCCGTTAGTTCCACTCCCAAACCTTCT
CTTCGCCTACGCCTACCATCGCCGGAGCTCGGTTCGTTCATCGGCAGTAATGGCCACCGAGACCAATCCCATTAATGCAGCATCGCCGCCGGATCACGCGACCGGGAAAT
GGTACTCCGTGCCGGAGCTCCGGCTCCGCGACCACCACTTCACTGTGCCTCTCGATTACTCTCTGGATCAGGATGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTT
GTTTTGGTGGGGAAAGAAGAGCAGTCGATGCCATATCTTTTATACTTACAAGGTGGACCGGGATTTGAGAGTCCCCGACCGACCGAAGCAAGTGGATGGATGCAAAAAGC
GTGTGAAGAATTTCGTCGAGGAACAGGATTATCAACTCCTCTGACTTCATCGTCTATGTCGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAACATTTTCGAG
CTGACAACATCGTGAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGGACCTTGGACTATTTTGGGTCAGAGCTTTGGTGGTTTTTGTGCAGTTACTTAT
TTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATAAATGCACTGCGGATTCTGTATATAGAGCATGCTTTGAAAAGGT
TATAATTCAAAACGAAAAGTACTACAAGCGGTATCCTCAGGATGTTGAAATCATCCGTGAAGTTGCGAAATACTTGGCTGAGCATGGTGGCGGGGTTGTTCTTCCCTCTG
GTGGTATCCTAACCCCCAAGGGGCTGCAAGTTCTTGGTCTTTTTGCTTTAGGATCTAGTACAGGTTTTGAGCGCTTGCATTATCTCTTTGAGAGAGTTTGGGATCCTGTA
ATAGTTCCTGGAGCGCCAAAACGAATCAGTTCTTTCTTCCTCCGCGCTTGTGATAACTGGCTCTCACTTGATTCAAATCCTCTATATGTTCTTCTACATGAATCAATATA
TTGCCAGGGTGCCTCATCTCGGTGGTCTGCTCAAAGAATAAAGAATGAACTTAAGAATAAGTTCGATGCAAATAACGCCCTGAAAGAAGGATGTCCCATGTTTTTCACAG
GCGAGATGGTCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTGAAAGATGCTGCTCGTATATTGGCCGAGAAAGAGGATTGGCCTCCACTATATGACGTT
GCCGCTCTTAGAAACAACAAGGTTCCAGTCGCAGCTGCTGTTTATTACGAAGATATGTTTGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAGATAGCAGGAATAAG
GCTGTGGATAACCAATGAATATATGCATTCTGGTCTGCGAGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTC
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCTGCTTCCGCGGCAGCGCCGCCCCGTTTGATAAAGCCACTCCTTTTTTACTTCCACTCCTTCCCATGCCGCGCGCTCCCGTTAGTTCCACTCCCAAACCTTCT
CTTCGCCTACGCCTACCATCGCCGGAGCTCGGTTCGTTCATCGGCAGTAATGGCCACCGAGACCAATCCCATTAATGCAGCATCGCCGCCGGATCACGCGACCGGGAAAT
GGTACTCCGTGCCGGAGCTCCGGCTCCGCGACCACCACTTCACTGTGCCTCTCGATTACTCTCTGGATCAGGATGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTT
GTTTTGGTGGGGAAAGAAGAGCAGTCGATGCCATATCTTTTATACTTACAAGGTGGACCGGGATTTGAGAGTCCCCGACCGACCGAAGCAAGTGGATGGATGCAAAAAGC
GTGTGAAGAATTTCGTCGAGGAACAGGATTATCAACTCCTCTGACTTCATCGTCTATGTCGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAACATTTTCGAG
CTGACAACATCGTGAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGGACCTTGGACTATTTTGGGTCAGAGCTTTGGTGGTTTTTGTGCAGTTACTTAT
TTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATAAATGCACTGCGGATTCTGTATATAGAGCATGCTTTGAAAAGGT
TATAATTCAAAACGAAAAGTACTACAAGCGGTATCCTCAGGATGTTGAAATCATCCGTGAAGTTGCGAAATACTTGGCTGAGCATGGTGGCGGGGTTGTTCTTCCCTCTG
GTGGTATCCTAACCCCCAAGGGGCTGCAAGTTCTTGGTCTTTTTGCTTTAGGATCTAGTACAGGTTTTGAGCGCTTGCATTATCTCTTTGAGAGAGTTTGGGATCCTGTA
ATAGTTCCTGGAGCGCCAAAACGAATCAGTTCTTTCTTCCTCCGCGCTTGTGATAACTGGCTCTCACTTGATTCAAATCCTCTATATGTTCTTCTACATGAATCAATATA
TTGCCAGGGTGCCTCATCTCGGTGGTCTGCTCAAAGAATAAAGAATGAACTTAAGAATAAGTTCGATGCAAATAACGCCCTGAAAGAAGGATGTCCCATGTTTTTCACAG
GCGAGATGGTCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTGAAAGATGCTGCTCGTATATTGGCCGAGAAAGAGGATTGGCCTCCACTATATGACGTT
GCCGCTCTTAGAAACAACAAGGTTCCAGTCGCAGCTGCTGTTTATTACGAAGATATGTTTGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAGATAGCAGGAATAAG
GCTGTGGATAACCAATGAATATATGCATTCTGGTCTGCGAGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTC
Protein sequenceShow/hide protein sequence
MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREV
VLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFRRGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTY
LSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPV
IVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDV
AALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF