; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G1048 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G1048
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationctg1:7118705..7122364
RNA-Seq ExpressionCucsat.G1048
SyntenyCucsat.G1048
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004177 - aminopeptidase activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR002410 - Peptidase S33
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048938.1 proline iminopeptidase [Cucumis melo var. makuwa]0.096.11Show/hide
Query:  MAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVI
        MAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAREVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKAC+EFRVI
Subjt:  MAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVI

Query:  LMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
        LMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
Subjt:  LMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS

Query:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAID
        VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAID
Subjt:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAID

Query:  NWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
        NWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
Subjt:  NWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        PVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  PVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

XP_004133842.3 uncharacterized protein LOC101216845 [Cucumis sativus]0.0100Show/hide
Query:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
        MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
Subjt:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR

Query:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

XP_008437982.1 PREDICTED: proline iminopeptidase [Cucumis melo]0.095.87Show/hide
Query:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
        MFA RTAAPPLLLHFHSLP R+LPLIPL NFLSAAHCRRSVRLSAAMAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAR
Subjt:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR

Query:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

XP_022147514.1 uncharacterized protein LOC111016418 [Momordica charantia]0.083.37Show/hide
Query:  MFAARTAAPP-----LLLHFHSLPCRVLPLIPLRN--FLSAAHCRRSVRLSAAMAGILSP-RAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASC
        MFAA  AAPP     LL +FHS PCR LPL+PL N  F  A H R SVR SA MA   +P  AASPP H +G WYSVPELRLRDHHF+VPL+YSL+Q + 
Subjt:  MFAARTAAPP-----LLLHFHSLPCRVLPLIPLRN--FLSAAHCRRSVRLSAAMAGILSP-RAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASC

Query:  TRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFI
         +ISVFAREVV VGKE+Q MPYLL+LQGGPGFE  RPTEASGW+QKACEEFRV+LMDQRGTGLSTPLT SSMSQFQS++DLANYLKHFRADNIVNDAEFI
Subjt:  TRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFI

Query:  RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSG
        RTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLAE+GGGV+LPSG
Subjt:  RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSG

Query:  GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAV
        GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++V G+PKRIS FFL A DNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNE++NKFDAN A+
Subjt:  GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAV

Query:  KEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGP
        KEGC ++FTGEM+FPWMFDEIHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNE+MHSGLRDAGP
Subjt:  KEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGP

Query:  QVLDHLMGLLNGKKPLF
        QVLDHLMGLLNGKKPLF
Subjt:  QVLDHLMGLLNGKKPLF

XP_022933365.1 uncharacterized protein LOC111440690 [Cucurbita moschata]0.086.26Show/hide
Query:  FHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPY
        FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPMPY
Subjt:  FHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPY

Query:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
        LL+LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQS++DLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
Subjt:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC

Query:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGF
        AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG+STGF
Subjt:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGF

Query:  ERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIH
        ER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSRWSAQRI NE+ENKFDA KAVKEGC VYFTGEMIFPWMFDEIH
Subjt:  ERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIH

Query:  ALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        AL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  ALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

TrEMBL top hitse value%identityAlignment
A0A0A0L423 AB hydrolase-1 domain-containing protein0.0100Show/hide
Query:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
        MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
Subjt:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR

Query:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

A0A1S3AUX5 proline iminopeptidase0.095.87Show/hide
Query:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
        MFA RTAAPPLLLHFHSLP R+LPLIPL NFLSAAHCRRSVRLSAAMAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAR
Subjt:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR

Query:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

A0A5A7U143 Proline iminopeptidase0.096.11Show/hide
Query:  MAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVI
        MAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAREVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKAC+EFRVI
Subjt:  MAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVI

Query:  LMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
        LMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS
Subjt:  LMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADS

Query:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAID
        VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAID
Subjt:  VYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAID

Query:  NWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
        NWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV
Subjt:  NWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKV

Query:  PVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        PVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
Subjt:  PVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

A0A5D3D1Y5 Proline iminopeptidase0.095.87Show/hide
Query:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR
        MFA RTAAPPLLLHFHSLP R+LPLIPL NFLSAAHCRRSVRLSAAMAGILSPRA SPPVHV+GTWYSVPELRLRDHHFSVPLNYSL+Q S TRISVFAR
Subjt:  MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAR

Query:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA
        EVVSVGKEDQPMPYLL+LQGGPGFECARP+EASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQF+S++DLANYLKHFRADNIVNDAEFIRTRLVPDA
Subjt:  EVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDA

Query:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL
        APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLA+NGGGVLLPSGGILTPKGL
Subjt:  APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGL

Query:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF
        QTLGLSALGTSTGFERLHYLFERVWDPILV G+PKRISFFFLNAIDNWLSLDSNPLYVLLHE+IYCQGASSRWSAQRIKNEVENKFDANKAVKEGC VYF
Subjt:  QTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYF

Query:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG
        TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAM+TASQIAGIRLW+TNEFMHSGLRDAGPQVLDHLMG
Subjt:  TGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMG

Query:  LLNGKKPLF
        LLNGKKPLF
Subjt:  LLNGKKPLF

A0A6J1F4P5 uncharacterized protein LOC1114406900.086.26Show/hide
Query:  FHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPY
        FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPMPY
Subjt:  FHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPY

Query:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
        LL+LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQS++DLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC
Subjt:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFC

Query:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGF
        AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG+STGF
Subjt:  AVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGF

Query:  ERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIH
        ER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSRWSAQRI NE+ENKFDA KAVKEGC VYFTGEMIFPWMFDEIH
Subjt:  ERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIH

Query:  ALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        AL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Subjt:  ALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF

SwissProt top hitse value%identityAlignment
A0A1L9WUM2 Proline iminopeptidase aneH3.9e-6935.63Show/hide
Query:  RLRDHHFSVPLNYSLNQASCTRISVFAREVVSV-GKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEE-FRVILMDQRGTGLSTPLTPSSMSQFQSS
        R  +  F VPLN+S       R+  FAR +  V G +D  +P++L+LQGGPG  C  P E + W+    E+ +RV+ +D+RGTG S+P+T  +++Q    
Subjt:  RLRDHHFSVPLNYSLNQASCTRISVFAREVVSV-GKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEE-FRVILMDQRGTGLSTPLTPSSMSQFQSS

Query:  DDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY
           A+ LK FRADNIV D E +R  L  DA    + W+++  S+GGFCA++Y+S  P  L +V I GG  P+ N      V    F     +NE YYK+Y
Subjt:  DDLANYLKHFRADNIVNDAEFIRTRLVPDA----APWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRY

Query:  PQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYC
        P+D+  V+ ++KYL EN   +   S G LTP+  Q LG+  LG   G + +H + +R  + +      K ++   L+ I+N   +  N +Y LL E +YC
Subjt:  PQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYC

Query:  QGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMD
        QG +  W A + + + + +F  N   +    ++FTGE IF  MF+    L+  K  A +LA   DW  LY+ A L  N+VPV  A   EDM+V++ L   
Subjt:  QGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMD

Query:  TASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGL
        TAS++  ++  V N + H  +     +V+  L  L
Subjt:  TASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGL

P46547 Proline iminopeptidase4.3e-10043.6Show/hide
Query:  SPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLS
        S P+H     Y +  +    H F+VPL++       T I++F R +    + D  +P+LL+LQGGPGF   RP+   GWI++A +EFRV+L+DQRGTG S
Subjt:  SPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLS

Query:  TPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVI
        TP+    ++        A+YL HFRAD+IV DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD VYRA +++V 
Subjt:  TPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVI

Query:  IQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPL
         +N  ++ R+P    I   +  +L  +   V LP+G  LT + LQ  GL  LG S  FE L+YL E  +         ++++  FL  +      ++NP+
Subjt:  IQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPL

Query:  YVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYED
        + +LHE IYC+GA+S W+A+R++ E    F A  A  +G    FTGEMIFPWMF++   L P K+AAH+LA+K DW PLYD   L  NKVPVA AVY ED
Subjt:  YVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYED

Query:  MFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGL
        M+V F  + +T   ++  R W+TNE+ H+GLR  G Q+LD L+ L
Subjt:  MFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGL

Arabidopsis top hitse value%identityAlignment
AT2G14260.1 proline iminopeptidase9.1e-0529.17Show/hide
Query:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        ++FL GGPG   A P+    +     E +R++L DQRG G STP                  L+     ++VND E +R  L +P+   W + G S+G  
Subjt:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT2G14260.2 proline iminopeptidase9.1e-0529.17Show/hide
Query:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF
        ++FL GGPG   A P+    +     E +R++L DQRG G STP                  L+     ++VND E +R  L +P+   W + G S+G  
Subjt:  LLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRL-VPDAAPWTILGQSYGGF

Query:  CAVTYLSFAPQGLKQVLITG
         A+ Y    P  +  +++ G
Subjt:  CAVTYLSFAPQGLKQVLITG

AT3G61540.1 alpha/beta-Hydrolases superfamily protein1.2e-20973.63Show/hide
Query:  CRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWI
        CR    ++ A +  +     S   HV+G W+SVPELRLRDH F VPL+YS    S  +I+VFARE+V+VGKE+Q MPYLL+LQGGPGFE  RP+EASGWI
Subjt:  CRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEASGWI

Query:  QKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIP
        Q+ACEEFRV+L+DQRGTGLSTPLT SSM QF+S+ +LA+YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGIP
Subjt:  QKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIP

Query:  PIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPK
        PIG  CTAD VY A FE+V  QNEKYYKR+PQDIEIVRE+V YLAE+ GGGV LPSGGILTPKGLQTLGLS LG+STGFERLHY+ ERVWDPILV G+PK
Subjt:  PIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAEN-GGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPK

Query:  RISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPL
         IS FFLNA ++W S D+NPLY LLHE IYC+GASS WSA R++++ E KFDA KAVKE   V FTGEMIFPWMFDEIHAL+PFK AA +LA KEDWPPL
Subjt:  RISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPL

Query:  YDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF
        YD+  L+NNKVPVAAAVYYEDM+VNFKL  +TAS I+GIRLWVTNEFMHSGLRDAG Q++DHL+G++NGKKPLF
Subjt:  YDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGCAGCTCGCACGGCAGCGCCGCCACTCCTTCTTCACTTCCACTCTTTACCTTGCCGCGTCCTTCCATTAATTCCACTCCGCAACTTTCTCTCAGCCGCTCATTG
CCGGAGATCGGTCCGTTTATCGGCAGCTATGGCCGGAATTTTATCCCCTCGTGCAGCATCGCCACCAGTGCACGTGTCTGGCACGTGGTACTCCGTTCCGGAGCTCCGTC
TCCGGGACCATCACTTCTCTGTGCCTCTCAATTACTCTCTAAATCAGGCTTCTTGTACTAGGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGTGGGAAAAGAGGATCAA
CCAATGCCATACCTTTTGTTCTTACAAGGTGGACCCGGATTTGAGTGTGCACGACCAACGGAAGCTAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGTTATACT
GATGGATCAGCGAGGAACAGGATTATCAACTCCTTTAACTCCATCATCCATGTCACAATTTCAAAGTTCAGATGACTTAGCCAACTACTTGAAACATTTTCGAGCTGATA
ACATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGT
TTTGCACCACAAGGATTGAAGCAAGTTCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAAT
TCAAAATGAAAAGTACTACAAGAGGTATCCACAGGATATTGAAATTGTCCGTGAAGTTGTGAAATACTTGGCCGAGAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTA
TCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTCTGGGAACTAGTACAGGTTTTGAGCGCTTGCACTATCTGTTTGAGAGAGTTTGGGATCCTATATTAGTT
CGTGGATCACCAAAACGAATTAGTTTTTTCTTCCTCAATGCAATTGATAACTGGCTCTCACTTGATTCAAATCCTCTATACGTTCTCTTGCACGAAACAATATATTGCCA
GGGAGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTCGATGCAAATAAAGCTGTAAAAGAAGGATGTGCCGTGTATTTCACTGGAGAGA
TGATCTTCCCATGGATGTTTGACGAAATACATGCCTTGAGACCATTCAAAGATGCAGCGCATATATTAGCTGATAAGGAGGATTGGCCTCCTCTATATGACATCGCTGCT
CTTAAAAATAACAAGGTTCCGGTGGCAGCTGCAGTGTATTACGAAGATATGTTTGTGAACTTCAAACTGGCCATGGACACAGCTTCCCAAATAGCAGGAATAAGGTTATG
GGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCCCAAGTTCTTGATCACTTGATGGGATTATTAAATGGAAAAAAGCCTTTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGCAGCTCGCACGGCAGCGCCGCCACTCCTTCTTCACTTCCACTCTTTACCTTGCCGCGTCCTTCCATTAATTCCACTCCGCAACTTTCTCTCAGCCGCTCATTG
CCGGAGATCGGTCCGTTTATCGGCAGCTATGGCCGGAATTTTATCCCCTCGTGCAGCATCGCCACCAGTGCACGTGTCTGGCACGTGGTACTCCGTTCCGGAGCTCCGTC
TCCGGGACCATCACTTCTCTGTGCCTCTCAATTACTCTCTAAATCAGGCTTCTTGTACTAGGATCTCCGTTTTTGCGCGGGAAGTTGTTTCAGTGGGAAAAGAGGATCAA
CCAATGCCATACCTTTTGTTCTTACAAGGTGGACCCGGATTTGAGTGTGCACGACCAACGGAAGCTAGTGGATGGATACAAAAAGCATGTGAAGAATTTCGTGTTATACT
GATGGATCAGCGAGGAACAGGATTATCAACTCCTTTAACTCCATCATCCATGTCACAATTTCAAAGTTCAGATGACTTAGCCAACTACTTGAAACATTTTCGAGCTGATA
ACATAGTTAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTACGGCGGTTTTTGTGCAGTTACGTATTTGAGT
TTTGCACCACAAGGATTGAAGCAAGTTCTCATAACTGGAGGAATCCCTCCAATTGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAAT
TCAAAATGAAAAGTACTACAAGAGGTATCCACAGGATATTGAAATTGTCCGTGAAGTTGTGAAATACTTGGCCGAGAATGGAGGCGGGGTTCTTCTTCCCTCTGGTGGTA
TCTTGACACCCAAAGGACTGCAAACTCTTGGTCTTTCTGCTCTGGGAACTAGTACAGGTTTTGAGCGCTTGCACTATCTGTTTGAGAGAGTTTGGGATCCTATATTAGTT
CGTGGATCACCAAAACGAATTAGTTTTTTCTTCCTCAATGCAATTGATAACTGGCTCTCACTTGATTCAAATCCTCTATACGTTCTCTTGCACGAAACAATATATTGCCA
GGGAGCCTCATCTCGTTGGTCTGCTCAAAGAATAAAGAATGAAGTGGAAAACAAATTCGATGCAAATAAAGCTGTAAAAGAAGGATGTGCCGTGTATTTCACTGGAGAGA
TGATCTTCCCATGGATGTTTGACGAAATACATGCCTTGAGACCATTCAAAGATGCAGCGCATATATTAGCTGATAAGGAGGATTGGCCTCCTCTATATGACATCGCTGCT
CTTAAAAATAACAAGGTTCCGGTGGCAGCTGCAGTGTATTACGAAGATATGTTTGTGAACTTCAAACTGGCCATGGACACAGCTTCCCAAATAGCAGGAATAAGGTTATG
GGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGCAGGGCCCCAAGTTCTTGATCACTTGATGGGATTATTAAATGGAAAAAAGCCTTTATTTTGA
Protein sequenceShow/hide protein sequence
MFAARTAAPPLLLHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQ
PMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLS
FAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILV
RGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDWPPLYDIAA
LKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF