; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022399 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022399
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionlysosomal Pro-X carboxypeptidase
Genome locationtig00154131:97135..107550
RNA-Seq ExpressionSgr022399
SyntenySgr022399
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004185 - serine-type carboxypeptidase activity (molecular function)
GO:0008239 - dipeptidyl-peptidase activity (molecular function)
InterPro domainsIPR008758 - Peptidase S28
IPR010471 - Protein of unknown function DUF1068
IPR029058 - Alpha/Beta hydrolase fold
IPR042269 - Serine carboxypeptidase S28, SKS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAE6006103.1 unnamed protein product [Arabidopsis arenosa]1.4e-24366Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        K+  + + ++P++TRY+PQ LDHF+FTP+S   F+QKYLIN + WR G PIFVYTGNEGDI+WFA+NTGF+LDIAPKF ALL    HRFYGES PFGK S
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        + SAETLGYL SQQALADYA+LIRSLKQNLSSEASPVVVFGG        MLAAWFRLKYPHITIGALASSAPILHFDNIVP +SFYDA+SQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFS-EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYY
        C++VIK SW EL+   + + GL ELS+ FRTCK LHS  S RDWL  AFVYT MVNYPT ANFM PLP YPV++MCKIIDGF   +  LD+ FAAASLYY
Subjt:  CYEVIKGSWAELQQAFS-EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYY

Query:  NYSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPW
        NYS  +KCF +E     HGL GW +QACTEMVMPM+CSN+SM PP +  YE F + CM  YGV PRPHWITTEFGGKRIE VLKRFGSNIIFSNGM+DPW
Subjt:  NYSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPW

Query:  SRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISLPHFLSNAETERKTGENKKQRGEKEKGRSGFRLA
        SRGGVL NIS+SIVA+VT+KGAHH D R+ATKDDP+WL +QR+QEV II +WI                                               
Subjt:  SRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISLPHFLSNAETERKTGENKKQRGEKEKGRSGFRLA

Query:  VCGPALYWRFKKAL-QLGDYKTSCAPC-ICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAKRA
                RF++ +  L    + C PC ICDCPPPLSLL+IAPGLANLS+T CGS+DP+LK+EMEKQFVDLLTEELKLQEAV+ EH+ HMNVTL+EAKR 
Subjt:  VCGPALYWRFKKAL-QLGDYKTSCAPC-ICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAKRA

Query:  ASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        ASQYQ+EAEKC AATE CE  RERA+AL++KERK+T LWERRA Q+GWEG
Subjt:  ASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

CAE6166892.1 unnamed protein product [Arabidopsis arenosa]1.4e-24866.17Show/hide
Query:  GASFPSLSCFPNKQGVRLRPKIP--YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----
        G S+  LS   N +  + + ++P  ++TRY PQ LDHF+F P+S + FYQKYLI+   WR G PIFVYTGNEGDIEWFA+NTGF+LDIAPKF ALL    
Subjt:  GASFPSLSCFPNKQGVRLRPKIP--YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----

Query:  HRFYGESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSF
        HRFYGESKP     ++ A+T+GYL SQQALADYA+LIRSLKQNLSSEASPVVVFGG        MLAAWFRLKYPHITIGALASSAPIL FD IVP SSF
Subjt:  HRFYGESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSF

Query:  YDAVSQDFKDASFNCYEVIKGSWAELQQAFS--EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPE
        YD VSQDFKDAS NC+EVIK SW EL + FS  ++GL ELS+ F TCK+LH+V     WL +A+  T MVNYPT ANFM PLPAYPV+EMCKIID F  E
Subjt:  YDAVSQDFKDASFNCYEVIKGSWAELQQAFS--EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPE

Query:  TGKLDKVFAAASLYYNYSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKR
           LD+ FAAASLYYNYS  E CF++EN    HGL+GW WQACTEMVMP++CSN+SMF P E   + + +DC+K YGV PRPHWITTEFGG RIE VLKR
Subjt:  TGKLDKVFAAASLYYNYSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKR

Query:  FGSNIIFSNGMKDPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK----ISLPHF---LSNAETER
        FGSNIIFSNGM+DPWSR GVL NIS+SI+A+VT+KGAHH D R+ATKDDP+WL +QR+QEV  I +WI EYY+DL+Q++    +   H     S++++  
Subjt:  FGSNIIFSNGMKDPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK----ISLPHF---LSNAETER

Query:  KTGENKKQR-GEKEKGRSGF----RLAVCGPALYWRFKKALQLGDYKTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLT
         + E   +R G+  +    F     L VCGPALYW+F K   +G  +T+  C PC+CDCPPPLSLL+IAPGLANLS+TDCGS+DP+LKQEMEKQFVDLLT
Subjt:  KTGENKKQR-GEKEKGRSGF----RLAVCGPALYWRFKKALQLGDYKTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLT

Query:  EELKLQEAVSGEHTRHMNVTLSEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        EELKLQEAV+ EH+RHMNVTL+EAKR ASQYQ+EAEKC AATE CE ARERAEAL+IKERK+TSLWE+RARQ GWEG
Subjt:  EELKLQEAVSGEHTRHMNVTLSEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

XP_008457347.1 PREDICTED: lysosomal Pro-X carboxypeptidase [Cucumis melo]1.5e-23986.83Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        KQ   L+PKI ++TR+YPQLLDHFTFTPKSSK FYQKYLIN++ WRNGAPIFVYTGNEGDIEWF ANTGFL DIAPKFHALL    HRFYGES PFG DS
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        Y+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVV FGG        MLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN
        C++VIK SW EL+Q FSEEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIID FAPET KLDKVFAAASLYYN
Subjt:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN

Query:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS
        YSHGEKCFN+ENGP +HGLSGW+WQACTEMVMPMTCSN+SMFPPSEF YEEFA DC K YGVSPRPHWITTEFGG+RIE+VLKRFGSN+IFSNGM+DPWS
Subjt:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS

Query:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        RGGVL NISTSI+AIVTEKGAHHVDFRSATKDDPDWLV+QRKQEVEII QWI+EYYAD+KQDK
Subjt:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

XP_022142979.1 lysosomal Pro-X carboxypeptidase [Momordica charantia]2.8e-24990.69Show/hide
Query:  QGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSY
        Q  RL+PKIPY+TRYYPQLLDHFTFTP+SSK FYQKYLIN Q WRNGAPIFVYTGNEGDI+WFAANTGFLLDIAPKFHALL    HRFYGESKPFG DSY
Subjt:  QGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSY

Query:  SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNC
        SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGG        MLAAWFRLKYPHITIGALASSAPILHFDNI+PRSSFYDAVSQDFKDAS NC
Subjt:  SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNC

Query:  YEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNY
        YEVIKGSWAEL+QAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMV+YPTEANFMRPLPAYPVQEMCKIID FAPET KLDKVFAAASLYYNY
Subjt:  YEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNY

Query:  SHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR
        SHGEKCFNLENGP +HGLSGWNWQACTEMVMPM CSNESMFPPSEFHY+EFA DC K YGVSPRPHWITTEFGG+RIEQVLKRFGSNIIFSNGMKDPWSR
Subjt:  SHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR

Query:  GGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        GGVLTNIST+IV IVTEKGAHHVDFRSATKDDPDWLV+QR+QEVEII QWI+EYYAD+KQDK
Subjt:  GGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

XP_038889517.1 lysosomal Pro-X carboxypeptidase isoform X1 [Benincasa hispida]9.0e-24889.85Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        KQ   L+PKIP++TR+YPQLLDHFTFTPKSSKRFYQKYLIN+Q WRNGAPIFVYTGNEGDIEWFAANTGFL DIAPKFHALL    HRFYGESKPFG DS
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        Y+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGG        MLAAWFR+KYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN
        CYEVIKGSWAELQQAF+EEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYP+QEMCKIID FAPET KLDKVFAAASLYYN
Subjt:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN

Query:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS
        YSHGEKCFNLENGP +HGLSGWNWQACTEMVMPMTCSNESMFPPSEF YEEFA DC K YGVSPRPHWITTEFGG+RIE+VLKRFGSNIIFSNGM+DPWS
Subjt:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS

Query:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        RGGVL NISTSIVAIVTEKGAHHVDFRSATKDDPDWLV+QR+QEVEII QWI+ YYAD+KQDK
Subjt:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

TrEMBL top hitse value%identityAlignment
A0A0A0LJ25 Uncharacterized protein7.5e-24086.61Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        KQ   L+PKI ++TR+YPQLLDHFTFTPKSSK FYQKYLIN++ WRNGAPIFVYTGNEGDIEWFAANTGFL DIAP+FHALL    HRFYGES PFG DS
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        Y+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGG        MLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN
        C+EVIKGSW ELQQ FSEEGLAELS+TFRTCKNLHSVSSV+DWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIID FAPET KL+K FAAASLYYN
Subjt:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN

Query:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS
        YSHGEKCFN+ENGP +HGLSGWNWQACTEMVMPMTCSN+SMFPPS+F YEEFA DC K YGVSPRPHWITTE+GG+RIE+VLKRFGSNIIFSNGM+DPWS
Subjt:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS

Query:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        RGGVL NISTSIVA+VTEKGAHHVDFRSATKDDPDWLV+QR+QEVEII QWI+E+YAD+KQDK
Subjt:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

A0A1S3C5Z3 lysosomal Pro-X carboxypeptidase7.5e-24086.83Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        KQ   L+PKI ++TR+YPQLLDHFTFTPKSSK FYQKYLIN++ WRNGAPIFVYTGNEGDIEWF ANTGFL DIAPKFHALL    HRFYGES PFG DS
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        Y+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVV FGG        MLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN
        C++VIK SW EL+Q FSEEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIID FAPET KLDKVFAAASLYYN
Subjt:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN

Query:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS
        YSHGEKCFN+ENGP +HGLSGW+WQACTEMVMPMTCSN+SMFPPSEF YEEFA DC K YGVSPRPHWITTEFGG+RIE+VLKRFGSN+IFSNGM+DPWS
Subjt:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS

Query:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        RGGVL NISTSI+AIVTEKGAHHVDFRSATKDDPDWLV+QRKQEVEII QWI+EYYAD+KQDK
Subjt:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

A0A3Q7H2H9 Uncharacterized protein2.6e-23264.66Show/hide
Query:  RPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWR--NGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSA
        + KIP+KT Y+PQ+LDHFTF PKS K FYQKYLINDQ W    G PIFVYTGNEG+I+WFAANTGF++DI P F+ALL    HRFYG+S PFG +SY SA
Subjt:  RPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWR--NGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSA

Query:  ETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEV
        +TLGYL SQQALADYAVLIRSLKQNLSS++SPVVVFGG        MLAAWFRLKYPHI IGA+ASSAPIL F+ I P SSFYDAVSQDFKDAS NCY+V
Subjt:  ETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEV

Query:  IKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNYSH
        IKGSWAEL   +  ++GL ++S+ FRTCK L SV S RDWLW AFVYT MVNYPTEANFM PLPAYPV+EMCKIIDG      KL K FAAASLYYNY+ 
Subjt:  IKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNYSH

Query:  GEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRGG
         EKCFNLE G   HGL GW+WQACTEMVMPMTCSNESMFPPS F Y+EF++DC K +GV PRPHWITTEFGG RIEQVLKRFGSN+IFSNGM+DPWSRGG
Subjt:  GEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRGG

Query:  VLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISLPHFLSNAETERKTGENKKQRGEKEK-----GRSGFRL
        VL NIS+SIVA+VT+K              P+ L KQ+                                      G   +Q G   +           L
Subjt:  VLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISLPHFLSNAETERKTGENKKQRGEKEK-----GRSGFRL

Query:  AVCGPALYWRFKKALQLGDYKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAKRAA
         V GPA+YW+FKK         SC PC CDC PPLSLL++APGLANL++TDCG +DPDLK+EMEKQFVDLL+EELKLQEAV  EH  HMN+T  EA+R A
Subjt:  AVCGPALYWRFKKALQLGDYKTSCAPCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAKRAA

Query:  SQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
        ++YQ+EAEKCIA TETCE  RERA  L  KE KLT+LWERRARQ GW+
Subjt:  SQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

A0A5D3BEY7 Lysosomal Pro-X carboxypeptidase7.5e-24086.83Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        KQ   L+PKI ++TR+YPQLLDHFTFTPKSSK FYQKYLIN++ WRNGAPIFVYTGNEGDIEWF ANTGFL DIAPKFHALL    HRFYGES PFG DS
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        Y+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVV FGG        MLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN
        C++VIK SW EL+Q FSEEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIID FAPET KLDKVFAAASLYYN
Subjt:  CYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN

Query:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS
        YSHGEKCFN+ENGP +HGLSGW+WQACTEMVMPMTCSN+SMFPPSEF YEEFA DC K YGVSPRPHWITTEFGG+RIE+VLKRFGSN+IFSNGM+DPWS
Subjt:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWS

Query:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        RGGVL NISTSI+AIVTEKGAHHVDFRSATKDDPDWLV+QRKQEVEII QWI+EYYAD+KQDK
Subjt:  RGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

A0A6J1CN04 lysosomal Pro-X carboxypeptidase1.4e-24990.69Show/hide
Query:  QGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSY
        Q  RL+PKIPY+TRYYPQLLDHFTFTP+SSK FYQKYLIN Q WRNGAPIFVYTGNEGDI+WFAANTGFLLDIAPKFHALL    HRFYGESKPFG DSY
Subjt:  QGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSY

Query:  SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNC
        SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGG        MLAAWFRLKYPHITIGALASSAPILHFDNI+PRSSFYDAVSQDFKDAS NC
Subjt:  SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNC

Query:  YEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNY
        YEVIKGSWAEL+QAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMV+YPTEANFMRPLPAYPVQEMCKIID FAPET KLDKVFAAASLYYNY
Subjt:  YEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNY

Query:  SHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR
        SHGEKCFNLENGP +HGLSGWNWQACTEMVMPM CSNESMFPPSEFHY+EFA DC K YGVSPRPHWITTEFGG+RIEQVLKRFGSNIIFSNGMKDPWSR
Subjt:  SHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR

Query:  GGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        GGVLTNIST+IV IVTEKGAHHVDFRSATKDDPDWLV+QR+QEVEII QWI+EYYAD+KQDK
Subjt:  GGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

SwissProt top hitse value%identityAlignment
P42785 Lysosomal Pro-X carboxypeptidase2.8e-9842.07Show/hide
Query:  YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW-RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSAETLGYL
        Y   Y+ Q +DHF F   + K F Q+YL+ D+ W +NG  I  YTGNEGDI WF  NTGF+ D+A +  A+L    HR+YGES PFG +S+  +  L +L
Subjt:  YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW-RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSAETLGYL

Query:  TSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKGSW
        TS+QALAD+A LI+ LK+ +  +E  PV+  GG        MLAAWFR+KYPH+ +GALA+SAPI  F+++VP   F   V+ DF+ +  +C E I  SW
Subjt:  TSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKGSW

Query:  AELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DGFAPETGKLDKVFAAASLYYNYSHGE
          + + + +  GL  L+     C  L S  +  ++DW+   +V   MV+YP  +NF++PLPA+P++ +C+ + +    ++  L  +F A ++YYNYS   
Subjt:  AELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DGFAPETGKLDKVFAAASLYYNYSHGE

Query:  KCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRG
        KC N+ E   S  G  GW++QACTE+VMP  C+N  + MF P  ++ +E + DC + +GV PRP WITT +GGK I        +NI+FSNG  DPWS G
Subjt:  KCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRG

Query:  GVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYY
        GV  +I+ ++VA+   +GAHH+D R+    DP  ++  R  EV  ++ WI ++Y
Subjt:  GVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYY

Q2TA14 Lysosomal Pro-X carboxypeptidase6.8e-10543.83Show/hide
Query:  VGASFPSLSCFPNKQGVRLRPKI--PYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWR-NGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL--
        V  S  + S  P     R RP I   Y  RY  Q +DHF F     + F Q+YLI D  W+ +G  I  YTGNEGDI WF  NTGF+ DIA +  A+L  
Subjt:  VGASFPSLSCFPNKQGVRLRPKI--PYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWR-NGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL--

Query:  --HRFYGESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPR
          HR+YGES PFG DS+S +  L +LT++QALAD+A LIR LK+ +  +    V+  GG        MLAAWFR+KYPH+ +GALASSAPI  F+++VP 
Subjt:  --HRFYGESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPR

Query:  SSFYDAVSQDFKDASFNCYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNL---HSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIID
          F   V+ DF  +  NC E I+ SW  + + A    GL  LS     C  L     V  ++DW+   +V   MV+YP E+NF++PLPA+PV+ +C+   
Subjt:  SSFYDAVSQDFKDASFNCYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNL---HSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIID

Query:  -GFAPETGKLDKVFAAASLYYNYSHGEKCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGG
            P+T  +  +F A ++YYNYS   KC N+ E   S  G+ GW++QACTEMVMP TCS+  + MF P  ++ +E++ DC K +GV PRP WI T +GG
Subjt:  -GFAPETGKLDKVFAAASLYYNYSHGEKCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGG

Query:  KRIEQVLKRFGSNIIFSNGMKDPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQ
        K I        +NIIFSNG  DPWS GGV  +I+ +++AIV   GAHH+D R++   DP  +   R  EV+ ++QWI ++Y  L++
Subjt:  KRIEQVLKRFGSNIIFSNGMKDPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQ

Q5RBU7 Lysosomal Pro-X carboxypeptidase1.6e-9841.85Show/hide
Query:  YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW-RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSAETLGYL
        Y   Y+ Q +DHF F   + K F Q+YL+ D+ W +NG  I  YTGNEGDI WF  NTGF+ D+A +  A+L    HR+YGES PFG +++  +  L +L
Subjt:  YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW-RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSAETLGYL

Query:  TSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKGSW
        TS+QALAD+A LI+ LK+ +  +E  PV+  GG        MLAAWFR+KYPH+ +GALA+SAPI  F+++VP   F   V+ DF+ +  +C E I+ SW
Subjt:  TSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKGSW

Query:  AELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DGFAPETGKLDKVFAAASLYYNYSHGE
          + + + +  GL  L+     C  L S  +  ++DW+   +V   MV+YP  +NF++PLPA+P++ +C+ + +    ++  L  +F A ++YYNYS   
Subjt:  AELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DGFAPETGKLDKVFAAASLYYNYSHGE

Query:  KCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRG
        KC N+ E   S  G  GW++QACTE+VMP  C+N  + MF P  ++ +E + DC + +GV PRP WITT +GGK I        +NI+FSNG  DPWS G
Subjt:  KCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRG

Query:  GVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYY
        GV  +I+ ++VA+   +GAHH+D R+    DP  ++  R  EV  ++ WI ++Y
Subjt:  GVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYY

Q7TMR0 Lysosomal Pro-X carboxypeptidase8.3e-10342.56Show/hide
Query:  PSLSCFPNKQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW-RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYG
        P LS  P       R    Y   Y+ Q +DHF F     + F Q+YL+ D++W RNG  I  YTGNEGDI WF  NTGF+ D+A +  A+L    HR+YG
Subjt:  PSLSCFPNKQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW-RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYG

Query:  ESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAV
        ES PFG+DS+  ++ L +LTS+QALAD+A LIR L++ +  ++  PV+  GG        MLAAWFR+KYPHI +GALA+SAPI   D +VP   F   V
Subjt:  ESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAV

Query:  SQDFKDASFNCYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DGFAPETG
        + DF+ +   C E I+ SW  + + + S  GL  L+     C  L S  + +++ W+   +V   MVNYP   NF++PLPA+P++E+C+ + +    +T 
Subjt:  SQDFKDASFNCYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DGFAPETG

Query:  KLDKVFAAASLYYNYSHGEKCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLK
         L  +F A S+YYNYS    C N+ +   S  G  GW++QACTEMVMP  C+N  + MF P  +  E+++ DC   +GV PRPHW+TT +GGK I     
Subjt:  KLDKVFAAASLYYNYSHGEKCFNL-ENGPSIHGLSGWNWQACTEMVMPMTCSN--ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLK

Query:  RFGSNIIFSNGMKDPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLK
           SNIIFSNG  DPWS GGV  +I+ ++VAI    GAHH+D R+    DP  ++  R  EV+ +++WI ++Y++++
Subjt:  RFGSNIIFSNGMKDPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLK

Q9EPB1 Dipeptidyl peptidase 21.2e-8037.5Show/hide
Query:  YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNG-APIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSAETLGYL
        ++  Y+ Q +DHF F   S+K F Q++L++D+ W+ G  PIF YTGNEGDI   A N+GF++++A +  ALL    HR+YG+S PFG  S     T   L
Subjt:  YKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNG-APIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDSYSSAETLGYL

Query:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKGSWA
        T +QALAD+AVL+++L+ NL  + +P + FGG        ML+A+ R+KYPH+  GALA+SAP++    +     F+  V+ DF   S  C + ++ ++ 
Subjt:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKGSWA

Query:  ELQQAFSEEGLAELSRTFRTCKNLHS---VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNYSHGEKC
        +++  F +     +S+ F TC++L S   ++ +  +  +AF    M++YP   NF+ PLPA PV+  C   +    E  ++  + A A L YN S  E C
Subjt:  ELQQAFSEEGLAELSRTFRTCKNLHS---VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNYSHGEKC

Query:  FNLEN------GPSIHGLS----GWNWQACTEMVMPMTCSN-ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMK
        F++         P+  G       W++QACTE+ +    +N   MFP   F  E   Q C+  +GV PRP W+ T F G  +     +  SNIIFSNG  
Subjt:  FNLEN------GPSIHGLS----GWNWQACTEMVMPMTCSN-ESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMK

Query:  DPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWI
        DPW+ GG+  N+STSI+A+  + GAHH+D R++  +DP  +V+ RK E  +IR+W+
Subjt:  DPWSRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWI

Arabidopsis top hitse value%identityAlignment
AT2G24280.1 alpha/beta-Hydrolases superfamily protein1.6e-19771.34Show/hide
Query:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS
        K+  + + ++P++TRY+PQ LDHF+FTP S K F+QKYLIN++ WR G PIFVYTGNEGDI+WFA+NTGF+LDIAPKF ALL    HRFYGES PFGK S
Subjt:  KQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFGKDS

Query:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN
        + SAETLGYL SQQALADYA+LIRSLKQNLSSEASPVVVFGG        MLAAWFRLKYPHITIGALASSAPILHFDNIVP +SFYDA+SQDFKDAS N
Subjt:  YSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFN

Query:  CYEVIKGSWAELQQAFS-EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYY
        C++VIK SW EL+   + + GL ELS+ FRTCK LHS  S RDWL  AFVYT MVNYPT ANFM PLP YPV++MCKIIDGF   +  LD+ FAAASLYY
Subjt:  CYEVIKGSWAELQQAFS-EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYY

Query:  NYSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPW
        NYS  EKCF +E     HGL GW +QACTEMVMPM+CSN+SM PP E   E F + CM  YGV PRPHWITTEFGG RIE VLKRFGSNIIFSNGM+DPW
Subjt:  NYSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPW

Query:  SRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK
        SRGGVL NIS+SIVA+VT+KGAHH D R+ATKDDP+WL +QR+QEV II +WI EYY DL++++
Subjt:  SRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDK

AT4G30996.1 Protein of unknown function (DUF1068)2.7e-6478.43Show/hide
Query:  LAVCGPALYWRFKKALQLGDYKTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAK
        L VCGPALYW+F K   +G  + +  C PC+CDCPPPLSLL+IAPGLANLS+TDCGS+DP+LKQEMEKQFVDLLTEELKLQEAV+ EH+RHMNVTL+EAK
Subjt:  LAVCGPALYWRFKKALQLGDYKTS--CAPCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAK

Query:  RAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        R ASQYQ+EAEKC AATE CE ARERAEAL+IKERK+TSLWE+RARQ GWEGE
Subjt:  RAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

AT5G22860.1 Serine carboxypeptidase S28 family protein5.3e-9741.39Show/hide
Query:  KTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW---RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFG--KDSYSSAETL
        K  Y+ Q LDHFTFTP+S   F Q+Y I+  +W   +  API  + G E  ++   A  GFL D  P+ +ALL    HR+YGE+ PFG  +++  +A TL
Subjt:  KTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW---RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFG--KDSYSSAETL

Query:  GYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKG
        GYL + QALADYA ++  +K+  S+  SP++V GG        MLAAWFRLKYPHI +GALASSAP+L+F++  P+  +Y  V++ FK+AS  CY  I+ 
Subjt:  GYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKG

Query:  SWAELQQ-AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPET--GKLDKVFA--AASLYYNYS
        SW E+ + A    GL+ LS+ F+TC  L+    ++D+L +  +Y   V Y    NF        V ++C  I+   P      LD++FA   A +     
Subjt:  SWAELQQ-AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPET--GKLDKVFA--AASLYYNYS

Query:  HGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTC-SNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR
        +  K F      +I     W WQ+C+E+VMP+     ++MFP + F+   +   C   +GV+PRPHWITT FG + ++ +L++FGSNIIFSNG+ DP+S 
Subjt:  HGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTC-SNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR

Query:  GGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLK
        GGVL +IS ++VAI T+ G+H +D    +K+DP+WLV QR++E+++I  WI  Y  DL+
Subjt:  GGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLK

AT5G22860.2 Serine carboxypeptidase S28 family protein4.5e-8040.8Show/hide
Query:  KTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW---RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFG--KDSYSSAETL
        K  Y+ Q LDHFTFTP+S   F Q+Y I+  +W   +  API  + G E  ++   A  GFL D  P+ +ALL    HR+YGE+ PFG  +++  +A TL
Subjt:  KTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNW---RNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFG--KDSYSSAETL

Query:  GYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKG
        GYL + QALADYA ++  +K+  S+  SP++V GG        MLAAWFRLKYPHI +GALASSAP+L+F++  P+  +Y  V++ FK+AS  CY  I+ 
Subjt:  GYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNCYEVIKG

Query:  SWAELQQ-AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPET--GKLDKVFA--AASLYYNYS
        SW E+ + A    GL+ LS+ F+TC  L+    ++D+L +  +Y   V Y    NF        V ++C  I+   P      LD++FA   A +     
Subjt:  SWAELQQ-AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPET--GKLDKVFA--AASLYYNYS

Query:  HGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTC-SNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR
        +  K F      +I     W WQ+C+E+VMP+     ++MFP + F+   +   C   +GV+PRPHWITT FG + ++ +L++FGSNIIFSNG+ DP+S 
Subjt:  HGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTC-SNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSR

Query:  GG
        GG
Subjt:  GG

AT5G65760.1 Serine carboxypeptidase S28 family protein9.3e-14252.68Show/hide
Query:  RPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGA---PIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFG--KDSY
        R +  Y+T+++ Q LDHF+F      +F Q+YLIN  +W   +   PIF+Y GNEGDIEWFA N+GF+ DIAPKF ALL    HR+YGES P+G  +++Y
Subjt:  RPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGA---PIFVYTGNEGDIEWFAANTGFLLDIAPKFHALL----HRFYGESKPFG--KDSY

Query:  SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNC
         +A TL YLT++QALAD+AV +  LK+NLS+EA PVV+FGG        MLAAW RLKYPHI IGALASSAPIL F+++VP  +FYD  S DFK  S +C
Subjt:  SSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASFNC

Query:  YEVIKGSW-AELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN
        +  IK SW A + +   E GL +L++TF  C+ L+S   + DWL SA+ Y  MV+YP  A+FM PLP +P++E+C+ IDG       LD+++A  S+YYN
Subjt:  YEVIKGSW-AELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYN

Query:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNE-SMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPW
        Y+    CF L++ P  HGL GWNWQACTEMVMPM+ + E SMFP   F+Y  + ++C   + V+PRP W+TTEFGG  I   LK FGSNIIFSNG+ DPW
Subjt:  YSHGEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNE-SMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPW

Query:  SRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISL
        S G VL N+S +IVA+VT++GAHH+D R +T +DP WLV QR+ E+ +I+ WI+ Y  + K+ K+SL
Subjt:  SRGGVLTNISTSIVAIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAACCCAGTAATCTCCAACCACAGGAATGATGGGCATAAAAGGAGGCACATAGAATTTGACTTTGAGCTTGGCTTCATCACTAGCAGGGTTGGCCTTAACAGCAG
TGCCTTCAATGAATCCTCTCTTGCCGTCGACCCAGGCGATTTCGTACCACCGGCCCATGTACCTCTTCAGTCGACGCCCTTCACGACTTCCATCTCTTTCTTCCCCATTG
ATGAGTTCCTGAGAGAGAGATCAGAGGCACTTTTGGGGCTCGGCGGTGATCTGGGGTGTGGATGGAGATTGTGGATTTGGGAGGAGGAGCGACGTGGAGAGAAATTATAT
TTGTGGGAGAGGAGTTTTTTGTGGTTTGGAAATCACAGGACCCGGAGTTGTGTTGTGGGGGCATCATTTCCATCTCTTTCTTGCTTTCCAAACAAGCAAGGTGTAAGATT
GAGGCCGAAGATCCCGTACAAGACCCGTTACTATCCTCAGCTGCTAGACCACTTCACCTTCACGCCAAAGAGTTCCAAAAGATTTTACCAGAAGTACCTGATTAATGACC
AGAACTGGCGCAATGGAGCTCCCATCTTCGTTTACACTGGCAATGAGGGAGACATCGAATGGTTTGCTGCCAATACTGGTTTCTTGCTCGACATTGCTCCCAAGTTCCAT
GCCCTTCTGCATAGATTTTATGGAGAATCGAAGCCATTTGGAAAGGACTCCTATAGCTCAGCAGAAACATTAGGTTACTTGACTTCACAACAAGCCTTGGCTGACTATGC
AGTTTTGATAAGAAGTTTGAAGCAGAACCTCTCTTCTGAGGCTTCCCCCGTAGTTGTCTTTGGTGGTCTTATGGAGGAAGTAAGTCAAAAAATGCTGGCAGCCTGGTTTA
GACTGAAATACCCACATATTACTATTGGAGCTTTGGCATCTTCAGCACCCATTTTACACTTTGATAACATCGTACCAAGGTCGAGCTTCTACGATGCTGTTTCCCAGGAT
TTCAAGGATGCTAGCTTCAATTGCTATGAAGTGATCAAAGGGAGTTGGGCAGAGCTACAGCAAGCATTTTCTGAGGAGGGGTTGGCTGAACTAAGCAGAACATTCAGAAC
TTGCAAGAACCTTCATTCAGTATCCTCGGTTCGAGACTGGTTGTGGTCAGCATTTGTCTACACTACGATGGTAAATTACCCGACTGAAGCCAATTTTATGAGGCCATTGC
CTGCCTATCCTGTACAAGAGATGTGTAAGATCATCGACGGATTTGCCCCAGAAACTGGCAAGCTTGACAAGGTTTTTGCTGCTGCCAGCTTGTATTACAATTACTCACAT
GGAGAGAAATGCTTTAACCTGGAAAACGGACCCAGTATTCACGGTCTTAGTGGTTGGAACTGGCAGGCTTGTACAGAGATGGTGATGCCGATGACTTGTTCCAACGAGAG
CATGTTCCCTCCAAGTGAGTTCCATTACGAAGAATTTGCACAAGATTGCATGAAGATATATGGAGTTTCACCTCGTCCGCATTGGATCACTACTGAATTTGGTGGCAAAA
GAATTGAGCAAGTGTTGAAAAGATTTGGCAGCAATATCATATTTTCTAATGGAATGAAAGATCCATGGAGCAGAGGAGGTGTGCTGACAAATATTTCGACCAGTATCGTT
GCCATCGTCACGGAGAAAGGAGCTCACCACGTCGATTTTCGCTCAGCAACGAAAGATGACCCCGACTGGCTAGTCAAGCAGAGGAAGCAAGAAGTGGAGATCATTCGTCA
ATGGATCGATGAGTATTATGCTGATCTGAAACAAGATAAAATCTCACTTCCTCACTTTCTCAGTAACGCGGAAACAGAGAGGAAAACAGGAGAGAACAAGAAGCAAAGGG
GGGAAAAAGAAAAGGGAAGATCAGGTTTCCGCTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAAGCTTTGCAATTGGGAGATTACAAAACCTCGTGTGCT
CCTTGCATCTGCGATTGCCCGCCCCCATTGTCCCTTTTGAAGATTGCTCCTGGTCTGGCCAATCTCTCCGTCACAGACTGTGGGAGTAATGACCCAGATCTCAAGCAGGA
GATGGAAAAACAGTTTGTGGACCTTCTGACGGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGTGAACATACTCGGCATATGAATGTCACGTTATCCGAGGCAAAAAGGG
CAGCTTCTCAGTATCAGAGGGAGGCTGAGAAATGCATTGCTGCTACAGAAACTTGTGAAGAGGCCCGAGAACGTGCCGAGGCATTGATGATCAAGGAGAGAAAGCTAACA
TCATTGTGGGAGCGGCGAGCCCGCCAAATGGGCTGGGAAGGGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAAACCCAGTAATCTCCAACCACAGGAATGATGGGCATAAAAGGAGGCACATAGAATTTGACTTTGAGCTTGGCTTCATCACTAGCAGGGTTGGCCTTAACAGCAG
TGCCTTCAATGAATCCTCTCTTGCCGTCGACCCAGGCGATTTCGTACCACCGGCCCATGTACCTCTTCAGTCGACGCCCTTCACGACTTCCATCTCTTTCTTCCCCATTG
ATGAGTTCCTGAGAGAGAGATCAGAGGCACTTTTGGGGCTCGGCGGTGATCTGGGGTGTGGATGGAGATTGTGGATTTGGGAGGAGGAGCGACGTGGAGAGAAATTATAT
TTGTGGGAGAGGAGTTTTTTGTGGTTTGGAAATCACAGGACCCGGAGTTGTGTTGTGGGGGCATCATTTCCATCTCTTTCTTGCTTTCCAAACAAGCAAGGTGTAAGATT
GAGGCCGAAGATCCCGTACAAGACCCGTTACTATCCTCAGCTGCTAGACCACTTCACCTTCACGCCAAAGAGTTCCAAAAGATTTTACCAGAAGTACCTGATTAATGACC
AGAACTGGCGCAATGGAGCTCCCATCTTCGTTTACACTGGCAATGAGGGAGACATCGAATGGTTTGCTGCCAATACTGGTTTCTTGCTCGACATTGCTCCCAAGTTCCAT
GCCCTTCTGCATAGATTTTATGGAGAATCGAAGCCATTTGGAAAGGACTCCTATAGCTCAGCAGAAACATTAGGTTACTTGACTTCACAACAAGCCTTGGCTGACTATGC
AGTTTTGATAAGAAGTTTGAAGCAGAACCTCTCTTCTGAGGCTTCCCCCGTAGTTGTCTTTGGTGGTCTTATGGAGGAAGTAAGTCAAAAAATGCTGGCAGCCTGGTTTA
GACTGAAATACCCACATATTACTATTGGAGCTTTGGCATCTTCAGCACCCATTTTACACTTTGATAACATCGTACCAAGGTCGAGCTTCTACGATGCTGTTTCCCAGGAT
TTCAAGGATGCTAGCTTCAATTGCTATGAAGTGATCAAAGGGAGTTGGGCAGAGCTACAGCAAGCATTTTCTGAGGAGGGGTTGGCTGAACTAAGCAGAACATTCAGAAC
TTGCAAGAACCTTCATTCAGTATCCTCGGTTCGAGACTGGTTGTGGTCAGCATTTGTCTACACTACGATGGTAAATTACCCGACTGAAGCCAATTTTATGAGGCCATTGC
CTGCCTATCCTGTACAAGAGATGTGTAAGATCATCGACGGATTTGCCCCAGAAACTGGCAAGCTTGACAAGGTTTTTGCTGCTGCCAGCTTGTATTACAATTACTCACAT
GGAGAGAAATGCTTTAACCTGGAAAACGGACCCAGTATTCACGGTCTTAGTGGTTGGAACTGGCAGGCTTGTACAGAGATGGTGATGCCGATGACTTGTTCCAACGAGAG
CATGTTCCCTCCAAGTGAGTTCCATTACGAAGAATTTGCACAAGATTGCATGAAGATATATGGAGTTTCACCTCGTCCGCATTGGATCACTACTGAATTTGGTGGCAAAA
GAATTGAGCAAGTGTTGAAAAGATTTGGCAGCAATATCATATTTTCTAATGGAATGAAAGATCCATGGAGCAGAGGAGGTGTGCTGACAAATATTTCGACCAGTATCGTT
GCCATCGTCACGGAGAAAGGAGCTCACCACGTCGATTTTCGCTCAGCAACGAAAGATGACCCCGACTGGCTAGTCAAGCAGAGGAAGCAAGAAGTGGAGATCATTCGTCA
ATGGATCGATGAGTATTATGCTGATCTGAAACAAGATAAAATCTCACTTCCTCACTTTCTCAGTAACGCGGAAACAGAGAGGAAAACAGGAGAGAACAAGAAGCAAAGGG
GGGAAAAAGAAAAGGGAAGATCAGGTTTCCGCTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAAGCTTTGCAATTGGGAGATTACAAAACCTCGTGTGCT
CCTTGCATCTGCGATTGCCCGCCCCCATTGTCCCTTTTGAAGATTGCTCCTGGTCTGGCCAATCTCTCCGTCACAGACTGTGGGAGTAATGACCCAGATCTCAAGCAGGA
GATGGAAAAACAGTTTGTGGACCTTCTGACGGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGTGAACATACTCGGCATATGAATGTCACGTTATCCGAGGCAAAAAGGG
CAGCTTCTCAGTATCAGAGGGAGGCTGAGAAATGCATTGCTGCTACAGAAACTTGTGAAGAGGCCCGAGAACGTGCCGAGGCATTGATGATCAAGGAGAGAAAGCTAACA
TCATTGTGGGAGCGGCGAGCCCGCCAAATGGGCTGGGAAGGGGAATAA
Protein sequenceShow/hide protein sequence
MQNPVISNHRNDGHKRRHIEFDFELGFITSRVGLNSSAFNESSLAVDPGDFVPPAHVPLQSTPFTTSISFFPIDEFLRERSEALLGLGGDLGCGWRLWIWEEERRGEKLY
LWERSFLWFGNHRTRSCVVGASFPSLSCFPNKQGVRLRPKIPYKTRYYPQLLDHFTFTPKSSKRFYQKYLINDQNWRNGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFH
ALLHRFYGESKPFGKDSYSSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGLMEEVSQKMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQD
FKDASFNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDGFAPETGKLDKVFAAASLYYNYSH
GEKCFNLENGPSIHGLSGWNWQACTEMVMPMTCSNESMFPPSEFHYEEFAQDCMKIYGVSPRPHWITTEFGGKRIEQVLKRFGSNIIFSNGMKDPWSRGGVLTNISTSIV
AIVTEKGAHHVDFRSATKDDPDWLVKQRKQEVEIIRQWIDEYYADLKQDKISLPHFLSNAETERKTGENKKQRGEKEKGRSGFRLAVCGPALYWRFKKALQLGDYKTSCA
PCICDCPPPLSLLKIAPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRHMNVTLSEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLT
SLWERRARQMGWEGE