; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G211800 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G211800
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionlysosomal Pro-X carboxypeptidase
Genome locationCicolChr11:3796264..3809184
RNA-Seq ExpressionCcUC11G211800
SyntenyCcUC11G211800
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0035098 - ESC/E(Z) complex (cellular component)
GO:0035102 - PRC1 complex (cellular component)
GO:0003682 - chromatin binding (molecular function)
GO:0004185 - serine-type carboxypeptidase activity (molecular function)
GO:0008239 - dipeptidyl-peptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR008758 - Peptidase S28
IPR027806 - Harbinger transposase-derived nuclease domain
IPR029058 - Alpha/Beta hydrolase fold
IPR042269 - Serine carboxypeptidase S28, SKS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139280.1 lysosomal Pro-X carboxypeptidase [Cucumis sativus]6.6e-25691.77Show/hide
Query:  NLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGD
        +L    + KPKI FETRFYPQLLDHFTFTPKSSK FYQKYLINE+YWRNGAPIF+YTGNEGDIEWFAANTGFLPDIAP+FHALL    HRFYGES PFG+
Subjt:  NLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGD

Query:  DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCY
        DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDASLNC+
Subjt:  DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCY

Query:  EVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYS
        EVIKGSW ELQQ FSEEGLAELS+TFRTCKNLHSVSSV+DWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKL+K FAAASLYYNYS
Subjt:  EVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYS

Query:  HGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRG
        HGEKCFN+ENGPD+HGLSGWNWQACTEMVMPMTCSN+SMFPPS+FDYEEFATDCKKKYGVSPR HWITTE+GGERIE+VLKRFGSNIIFSNGMQDPWSRG
Subjt:  HGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRG

Query:  GVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK
        GVLRNISTSIVA+VTEKGAHHVDFRSAT DDPDWLV+QRRQEVEIIHQWINE+YADMKQDKK
Subjt:  GVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK

XP_004139403.1 protein ALP1-like [Cucumis sativus]4.3e-23191.69Show/hide
Query:  MATRGLGGEKRTTRSSAMNAPAAT-TRSKTKKFDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP
        MATRGL G+KRTTRSSAMNA AA  TRSK KK D+ENHLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LPPP 
Subjt:  MATRGLGGEKRTTRSSAMNAPAAT-TRSKTKKFDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP

Query:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK
        PPPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI SS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEK
Subjt:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIE S+ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNL

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL
        D+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL

Query:  DEEQDQEEGASCSGEEQKFPLYDGEIGDNRGKDIRDTLASHLSSL
        DEEQDQEEGASCS EEQKFPL+DGEIGD RGKDIRD LA HLSSL
Subjt:  DEEQDQEEGASCSGEEQKFPLYDGEIGDNRGKDIRDTLASHLSSL

XP_008457347.1 PREDICTED: lysosomal Pro-X carboxypeptidase [Cucumis melo]2.5e-25592.95Show/hide
Query:  KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAET
        KPKI FETRFYPQLLDHFTFTPKSSK FYQKYLINE+YWRNGAPIF+YTGNEGDIEWF ANTGFLPDIAPKFHALL    HRFYGES PFG+DSYNSAET
Subjt:  KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAET

Query:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWA
        LGYLTSQQALADYAVLIRSLKQNLSSEASPVV FGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDASLNC++VIK SW 
Subjt:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWA

Query:  ELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL
        EL+Q FSEEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFN+
Subjt:  ELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL

Query:  ENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNIST
        ENGPD+HGLSGW+WQACTEMVMPMTCSN+SMFPPSEFDYEEFATDCKKKYGVSPR HWITTEFGGERIE+VLKRFGSN+IFSNGMQDPWSRGGVLRNIST
Subjt:  ENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNIST

Query:  SIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK
        SI+AIVTEKGAHHVDFRSAT DDPDWLV+QR+QEVEIIHQWINEYYADMKQDKK
Subjt:  SIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK

XP_022142979.1 lysosomal Pro-X carboxypeptidase [Momordica charantia]8.7e-25689.85Show/hide
Query:  FTRYKPNWGNLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRF
        F +    + +L      KPKIP+ETR+YPQLLDHFTFTP+SSK FYQKYLIN QYWRNGAPIF+YTGNEGDI+WFAANTGFL DIAPKFHALL    HRF
Subjt:  FTRYKPNWGNLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRF

Query:  YGESKPFGDDSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQD
        YGESKPFG+DSY+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNI+PRSSFYDAVSQD
Subjt:  YGESKPFGDDSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQD

Query:  FKDASLNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFA
        FKDASLNCYEVIKGSWAEL+QAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMV+YPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFA
Subjt:  FKDASLNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFA

Query:  AASLYYNYSHGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN
        AASLYYNYSHGEKCFNLENGPD+HGLSGWNWQACTEMVMPM CSN+SMFPPSEF Y+EFA DC+KKYGVSPR HWITTEFGGERIEQVLKRFGSNIIFSN
Subjt:  AASLYYNYSHGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN

Query:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI
        GM+DPWSRGGVL NIST+IV IVTEKGAHHVDFRSAT DDPDWLV+QRRQEVEIIHQWINEYYAD+KQDKK I
Subjt:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI

XP_038889517.1 lysosomal Pro-X carboxypeptidase isoform X1 [Benincasa hispida]8.7e-26492.83Show/hide
Query:  IFTRYKPNWGNLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HR
        +F R      +L    + KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIF+YTGNEGDIEWFAANTGFLPDIAPKFHALL    HR
Subjt:  IFTRYKPNWGNLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HR

Query:  FYGESKPFGDDSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQ
        FYGESKPFG DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFR+KYPHITIGALASSAPILHFDNIVPRSSFYDAVSQ
Subjt:  FYGESKPFGDDSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQ

Query:  DFKDASLNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVF
        DFKDASLNCYEVIKGSWAELQQAF+EEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYP+QEMCKIIDAFAPETSKLDKVF
Subjt:  DFKDASLNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVF

Query:  AAASLYYNYSHGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFS
        AAASLYYNYSHGEKCFNLENGPD+HGLSGWNWQACTEMVMPMTCSN+SMFPPSEFDYEEFATDC+KKYGVSPR HWITTEFGGERIE+VLKRFGSNIIFS
Subjt:  AAASLYYNYSHGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFS

Query:  NGMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI
        NGMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSAT DDPDWLV+QRRQEVEIIHQWIN YYADMKQDKK I
Subjt:  NGMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI

TrEMBL top hitse value%identityAlignment
A0A0A0LFB5 DDE Tnp4 domain-containing protein2.1e-23191.69Show/hide
Query:  MATRGLGGEKRTTRSSAMNAPAAT-TRSKTKKFDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP
        MATRGL G+KRTTRSSAMNA AA  TRSK KK D+ENHLNHQL+TLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSL+ALSPRLPK+ LPPP 
Subjt:  MATRGLGGEKRTTRSSAMNAPAAT-TRSKTKKFDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPP

Query:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK
        PPPR CWFQRFLSATS+VDCDPRWNLSFRMSKSSFSLLLRLLSPI SS SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADAC SFYAVCKAINEK
Subjt:  PPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACHSFYAVCKAINEK

Query:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNL
        LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIE S+ELLKGPVYNL
Subjt:  LGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNL

Query:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL
        D+EKPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALVNTAFC LRARWKLLSKPWKEGCRD+FPFI+LTGCLL NFLIKCSEKL
Subjt:  DDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKL

Query:  DEEQDQEEGASCSGEEQKFPLYDGEIGDNRGKDIRDTLASHLSSL
        DEEQDQEEGASCS EEQKFPL+DGEIGD RGKDIRD LA HLSSL
Subjt:  DEEQDQEEGASCSGEEQKFPLYDGEIGDNRGKDIRDTLASHLSSL

A0A0A0LJ25 Uncharacterized protein3.2e-25691.77Show/hide
Query:  NLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGD
        +L    + KPKI FETRFYPQLLDHFTFTPKSSK FYQKYLINE+YWRNGAPIF+YTGNEGDIEWFAANTGFLPDIAP+FHALL    HRFYGES PFG+
Subjt:  NLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGD

Query:  DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCY
        DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDASLNC+
Subjt:  DSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCY

Query:  EVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYS
        EVIKGSW ELQQ FSEEGLAELS+TFRTCKNLHSVSSV+DWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKL+K FAAASLYYNYS
Subjt:  EVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYS

Query:  HGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRG
        HGEKCFN+ENGPD+HGLSGWNWQACTEMVMPMTCSN+SMFPPS+FDYEEFATDCKKKYGVSPR HWITTE+GGERIE+VLKRFGSNIIFSNGMQDPWSRG
Subjt:  HGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRG

Query:  GVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK
        GVLRNISTSIVA+VTEKGAHHVDFRSAT DDPDWLV+QRRQEVEIIHQWINE+YADMKQDKK
Subjt:  GVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK

A0A1S3C5Z3 lysosomal Pro-X carboxypeptidase1.2e-25592.95Show/hide
Query:  KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAET
        KPKI FETRFYPQLLDHFTFTPKSSK FYQKYLINE+YWRNGAPIF+YTGNEGDIEWF ANTGFLPDIAPKFHALL    HRFYGES PFG+DSYNSAET
Subjt:  KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAET

Query:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWA
        LGYLTSQQALADYAVLIRSLKQNLSSEASPVV FGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDASLNC++VIK SW 
Subjt:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWA

Query:  ELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL
        EL+Q FSEEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFN+
Subjt:  ELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL

Query:  ENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNIST
        ENGPD+HGLSGW+WQACTEMVMPMTCSN+SMFPPSEFDYEEFATDCKKKYGVSPR HWITTEFGGERIE+VLKRFGSN+IFSNGMQDPWSRGGVLRNIST
Subjt:  ENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNIST

Query:  SIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK
        SI+AIVTEKGAHHVDFRSAT DDPDWLV+QR+QEVEIIHQWINEYYADMKQDKK
Subjt:  SIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK

A0A5D3BEY7 Lysosomal Pro-X carboxypeptidase1.2e-25592.95Show/hide
Query:  KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAET
        KPKI FETRFYPQLLDHFTFTPKSSK FYQKYLINE+YWRNGAPIF+YTGNEGDIEWF ANTGFLPDIAPKFHALL    HRFYGES PFG+DSYNSAET
Subjt:  KPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAET

Query:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWA
        LGYLTSQQALADYAVLIRSLKQNLSSEASPVV FGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVP SSFYDAVSQDFKDASLNC++VIK SW 
Subjt:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWA

Query:  ELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL
        EL+Q FSEEGLAELS+TFRTCKNLHSVSSVRDWLWSAFVYT+MVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFN+
Subjt:  ELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL

Query:  ENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNIST
        ENGPD+HGLSGW+WQACTEMVMPMTCSN+SMFPPSEFDYEEFATDCKKKYGVSPR HWITTEFGGERIE+VLKRFGSN+IFSNGMQDPWSRGGVLRNIST
Subjt:  ENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNIST

Query:  SIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK
        SI+AIVTEKGAHHVDFRSAT DDPDWLV+QR+QEVEIIHQWINEYYADMKQDKK
Subjt:  SIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKK

A0A6J1CN04 lysosomal Pro-X carboxypeptidase4.2e-25689.85Show/hide
Query:  FTRYKPNWGNLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRF
        F +    + +L      KPKIP+ETR+YPQLLDHFTFTP+SSK FYQKYLIN QYWRNGAPIF+YTGNEGDI+WFAANTGFL DIAPKFHALL    HRF
Subjt:  FTRYKPNWGNLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRF

Query:  YGESKPFGDDSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQD
        YGESKPFG+DSY+SAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNI+PRSSFYDAVSQD
Subjt:  YGESKPFGDDSYNSAETLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQD

Query:  FKDASLNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFA
        FKDASLNCYEVIKGSWAEL+QAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMV+YPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFA
Subjt:  FKDASLNCYEVIKGSWAELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFA

Query:  AASLYYNYSHGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN
        AASLYYNYSHGEKCFNLENGPD+HGLSGWNWQACTEMVMPM CSN+SMFPPSEF Y+EFA DC+KKYGVSPR HWITTEFGGERIEQVLKRFGSNIIFSN
Subjt:  AASLYYNYSHGEKCFNLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN

Query:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI
        GM+DPWSRGGVL NIST+IV IVTEKGAHHVDFRSAT DDPDWLV+QRRQEVEIIHQWINEYYAD+KQDKK I
Subjt:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI

SwissProt top hitse value%identityAlignment
P42785 Lysosomal Pro-X carboxypeptidase1.4e-10242.33Show/hide
Query:  TNPLNSKPKIP--FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYW-RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG
        TNP  S P +   +   ++ Q +DHF F   + K F Q+YL+ ++YW +NG  I  YTGNEGDI WF  NTGF+ D+A +  A+L    HR+YGES PFG
Subjt:  TNPLNSKPKIP--FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYW-RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG

Query:  DDSYNSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLN
        D+S+  +  L +LTS+QALAD+A LI+ LK+ +  +E  PV+  GGSYGGMLAAWFR+KYPH+ +GALA+SAPI  F+++VP   F   V+ DF+ +  +
Subjt:  DDSYNSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLN

Query:  CYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DAFAPETSKLDKVFAAAS
        C E I  SW  + + + +  GL  L+     C  L S  +  ++DW+   +V   MV+YP  +NF++PLPA+P++ +C+ + +    ++  L  +F A +
Subjt:  CYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DAFAPETSKLDKVFAAAS

Query:  LYYNYSHGEKCFNL-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN
        +YYNYS   KC N+ E      G  GW++QACTE+VMP  C+N    MF P  ++ +E + DC +++GV PR  WITT +GG+ I        +NI+FSN
Subjt:  LYYNYSHGEKCFNL-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN

Query:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYY
        G  DPWS GGV ++I+ ++VA+   +GAHH+D R+    DP  ++  R  EV  +  WI ++Y
Subjt:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYY

Q2TA14 Lysosomal Pro-X carboxypeptidase1.0e-10544.18Show/hide
Query:  SKPKI--PFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWR-NGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYN
        S+P I   +  R+  Q +DHF F     + F Q+YLI + YW+ +G  I  YTGNEGDI WF  NTGF+ DIA +  A+L    HR+YGES PFG DS++
Subjt:  SKPKI--PFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWR-NGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYN

Query:  SAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVI
         +  L +LT++QALAD+A LIR LK+ +  +    V+  GGSYGGMLAAWFR+KYPH+ +GALASSAPI  F+++VP   F   V+ DF  +  NC E I
Subjt:  SAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVI

Query:  KGSWAELQQ-AFSEEGLAELSRTFRTCKNL---HSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIID-AFAPETSKLDKVFAAASLYYN
        + SW  + + A    GL  LS     C  L     V  ++DW+   +V   MV+YP E+NF++PLPA+PV+ +C+    +  P+T  +  +F A ++YYN
Subjt:  KGSWAELQQ-AFSEEGLAELSRTFRTCKNL---HSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIID-AFAPETSKLDKVFAAASLYYN

Query:  YSHGEKCFNL-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQD
        YS   KC N+ E      G+ GW++QACTEMVMP TCS+    MF P  ++ +E++ DC K++GV PR  WI T +GG+ I        +NIIFSNG  D
Subjt:  YSHGEKCFNL-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQD

Query:  PWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQ
        PWS GGV ++I+ +++AIV   GAHH+D R++   DP  +   R  EV+ + QWI+++Y  +++
Subjt:  PWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQ

Q5RBU7 Lysosomal Pro-X carboxypeptidase8.0e-10342.12Show/hide
Query:  TNPLNSKPKIP--FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYW-RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG
        TNP  S P +   +   ++ Q +DHF F   + K F Q+YL+ ++YW +NG  I  YTGNEGDI WF  NTGF+ D+A +  A+L    HR+YGES PFG
Subjt:  TNPLNSKPKIP--FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYW-RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG

Query:  DDSYNSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLN
        D+++  +  L +LTS+QALAD+A LI+ LK+ +  +E  PV+  GGSYGGMLAAWFR+KYPH+ +GALA+SAPI  F+++VP   F   V+ DF+ +  +
Subjt:  DDSYNSAETLGYLTSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLN

Query:  CYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DAFAPETSKLDKVFAAAS
        C E I+ SW  + + + +  GL  L+     C  L S  +  ++DW+   +V   MV+YP  +NF++PLPA+P++ +C+ + +    ++  L  +F A +
Subjt:  CYEVIKGSWAELQQ-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DAFAPETSKLDKVFAAAS

Query:  LYYNYSHGEKCFNL-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN
        +YYNYS   KC N+ E      G  GW++QACTE+VMP  C+N    MF P  ++ +E + DC +++GV PR  WITT +GG+ I        +NI+FSN
Subjt:  LYYNYSHGEKCFNL-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSN

Query:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYY
        G  DPWS GGV ++I+ ++VA+   +GAHH+D R+    DP  ++  R  EV  +  WI ++Y
Subjt:  GMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYY

Q7TMR0 Lysosomal Pro-X carboxypeptidase1.7e-10543.61Show/hide
Query:  FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYW-RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAETLGYL
        +   ++ Q +DHF F     + F Q+YL+ +++W RNG  I  YTGNEGDI WF  NTGF+ D+A +  A+L    HR+YGES PFG DS+  ++ L +L
Subjt:  FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYW-RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAETLGYL

Query:  TSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQ
        TS+QALAD+A LIR L++ +  ++  PV+  GGSYGGMLAAWFR+KYPHI +GALA+SAPI   D +VP   F   V+ DF+ +   C E I+ SW  + 
Subjt:  TSQQALADYAVLIRSLKQNL-SSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQ

Query:  Q-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DAFAPETSKLDKVFAAASLYYNYSHGEKCFN
        + + S  GL  L+     C  L S  + +++ W+   +V   MVNYP   NF++PLPA+P++E+C+ + +    +T  L  +F A S+YYNYS    C N
Subjt:  Q-AFSEEGLAELSRTFRTCKNLHS--VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKII-DAFAPETSKLDKVFAAASLYYNYSHGEKCFN

Query:  L-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLR
        + +      G  GW++QACTEMVMP  C+N    MF P  +D E+++ DC  ++GV PR HW+TT +GG+ I        SNIIFSNG  DPWS GGV R
Subjt:  L-ENGPDVHGLSGWNWQACTEMVMPMTCSN--KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLR

Query:  NISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMK
        +I+ ++VAI    GAHH+D R+    DP  ++  R  EV+ + +WI ++Y++++
Subjt:  NISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMK

Q9EPB1 Dipeptidyl peptidase 22.8e-8437.83Show/hide
Query:  FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNG-APIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAETLGYL
        F   ++ Q +DHF F   S+K F Q++L+++++W+ G  PIF YTGNEGDI   A N+GF+ ++A +  ALL    HR+YG+S PFG  S     T   L
Subjt:  FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNG-APIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAETLGYL

Query:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQ
        T +QALAD+AVL+++L+ NL  + +P + FGGSYGGML+A+ R+KYPH+  GALA+SAP++    +     F+  V+ DF   S  C + ++ ++ +++ 
Subjt:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQ

Query:  AFSEEGLAELSRTFRTCKNLHS---VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL-
         F +     +S+ F TC++L S   ++ +  +  +AF    M++YP   NF+ PLPA PV+  C   +    E  ++  + A A L YN S  E CF++ 
Subjt:  AFSEEGLAELSRTFRTCKNLHS---VSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNL-

Query:  ---ENGPDVHGLS------GWNWQACTEMVMPMTCSN-KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWS
           ++  D  G         W++QACTE+ +    +N   MFP   F  E     C   +GV PR  W+ T F G  +     +  SNIIFSNG  DPW+
Subjt:  ---ENGPDVHGLS------GWNWQACTEMVMPMTCSN-KSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWS

Query:  RGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWI
         GG+ RN+STSI+A+  + GAHH+D R++ ++DP  +V+ R+ E  +I +W+
Subjt:  RGGVLRNISTSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWI

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.2e-7741.05Show/hide
Query:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPP-------PRPCWFQRFLSATSEVDCDPRWNLSFRMS
        L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S                LP  P P           WF RFL++ +E + DPRW L FRMS
Subjt:  LNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLYALSPRLPKIYLPPPPPP-------PRPCWFQRFLSATSEVDCDPRWNLSFRMS

Query:  KSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL
        KS+F  L  +L      S SS+P   + AA +FRLAHGASY+ +  RFG DS + A  SF+ VCK INEKL         +D     F    LPNC GV+
Subjt:  KSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACHSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVL

Query:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN
        G  RF  +G+LLG  GS+LVQALVD+ GRF+D+SAGWPS+MKPE I RQ+KL++  E   E+L G    L +   +P+Y++GDSC PLLPWL+TPY   +
Subjt:  GLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELLKGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLN

Query:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDQEEG--ASCSGE-------EQKFPL
        +E+S  F E  FN+  +  +  V  AF  +RARW++L K WK    ++ PF++ TGCLLHNFL+   +  D  ++   G  A  +GE       E++   
Subjt:  EEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSEKLDEEQDQEEG--ASCSGE-------EQKFPL

Query:  YDGEIGDNRGKDIRDTLASHLSSLKSGSLKVKYWELKTPAASSEIIQQIQYLEDYTNS
        ++GE      K IRD +A +LS  K   +     +L+     +E  Q+ +Y +D  NS
Subjt:  YDGEIGDNRGKDIRDTLASHLSSLKSGSLKVKYWELKTPAASSEIIQQIQYLEDYTNS

AT2G24280.1 alpha/beta-Hydrolases superfamily protein4.1e-20373.41Show/hide
Query:  SKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAE
        SK ++PFETR++PQ LDHF+FTP S K F+QKYLIN ++WR G PIF+YTGNEGDI+WFA+NTGF+ DIAPKF ALL    HRFYGES PFG  S+ SAE
Subjt:  SKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFGDDSYNSAE

Query:  TLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSW
        TLGYL SQQALADYA+LIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVP +SFYDA+SQDFKDAS+NC++VIK SW
Subjt:  TLGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSW

Query:  AELQQAFS-EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCF
         EL+   + + GL ELS+ FRTCK LHS  S RDWL  AFVYT MVNYPT ANFM PLP YPV++MCKIID F   +S LD+ FAAASLYYNYS  EKCF
Subjt:  AELQQAFS-EEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCF

Query:  NLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNI
         +E   D HGL GW +QACTEMVMPM+CSN+SM PP E D E F   C  +YGV PR HWITTEFGG RIE VLKRFGSNIIFSNGMQDPWSRGGVL+NI
Subjt:  NLENGPDVHGLSGWNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNI

Query:  STSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDK
        S+SIVA+VT+KGAHH D R+AT DDP+WL +QRRQEV II +WI+EYY D+++++
Subjt:  STSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDK

AT5G22860.1 Serine carboxypeptidase S28 family protein3.4e-10142.04Show/hide
Query:  FYPQLLDHFTFTPKSSKRFYQKYLINEQYW---RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG--DDSYNSAETLGYL
        ++ Q LDHFTFTP+S   F Q+Y I+  +W   +  API  + G E  ++   A  GFL D  P+ +ALL    HR+YGE+ PFG  +++  +A TLGYL
Subjt:  FYPQLLDHFTFTPKSSKRFYQKYLINEQYW---RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG--DDSYNSAETLGYL

Query:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQ
         + QALADYA ++  +K+  S+  SP++V GGSYGGMLAAWFRLKYPHI +GALASSAP+L+F++  P+  +Y  V++ FK+AS  CY  I+ SW E+ +
Subjt:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQ

Query:  -AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPET--SKLDKVFA--AASLYYNYSHGEKCFN
         A    GL+ LS+ F+TC  L+    ++D+L +  +Y   V Y    NF        V ++C  I+A  P    + LD++FA   A +     +  K F 
Subjt:  -AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPET--SKLDKVFA--AASLYYNYSHGEKCFN

Query:  LENGPDVHGLSGWNWQACTEMVMPMTCSNK-SMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNI
             ++     W WQ+C+E+VMP+    + +MFP + F+   +   CK  +GV+PR HWITT FG + ++ +L++FGSNIIFSNG+ DP+S GGVL +I
Subjt:  LENGPDVHGLSGWNWQACTEMVMPMTCSNK-SMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNI

Query:  STSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMK
        S ++VAI T+ G+H +D    + +DP+WLV QR +E+++I  WI+ Y  D++
Subjt:  STSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMK

AT5G22860.2 Serine carboxypeptidase S28 family protein1.7e-8442.03Show/hide
Query:  FYPQLLDHFTFTPKSSKRFYQKYLINEQYW---RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG--DDSYNSAETLGYL
        ++ Q LDHFTFTP+S   F Q+Y I+  +W   +  API  + G E  ++   A  GFL D  P+ +ALL    HR+YGE+ PFG  +++  +A TLGYL
Subjt:  FYPQLLDHFTFTPKSSKRFYQKYLINEQYW---RNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG--DDSYNSAETLGYL

Query:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQ
         + QALADYA ++  +K+  S+  SP++V GGSYGGMLAAWFRLKYPHI +GALASSAP+L+F++  P+  +Y  V++ FK+AS  CY  I+ SW E+ +
Subjt:  TSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQ

Query:  -AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPET--SKLDKVFA--AASLYYNYSHGEKCFN
         A    GL+ LS+ F+TC  L+    ++D+L +  +Y   V Y    NF        V ++C  I+A  P    + LD++FA   A +     +  K F 
Subjt:  -AFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPET--SKLDKVFA--AASLYYNYSHGEKCFN

Query:  LENGPDVHGLSGWNWQACTEMVMPMTCSNK-SMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGG
             ++     W WQ+C+E+VMP+    + +MFP + F+   +   CK  +GV+PR HWITT FG + ++ +L++FGSNIIFSNG+ DP+S GG
Subjt:  LENGPDVHGLSGWNWQACTEMVMPMTCSNK-SMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGG

AT5G65760.1 Serine carboxypeptidase S28 family protein3.3e-14453.93Show/hide
Query:  FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGA---PIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG--DDSYNSAET
        +ET+F+ Q LDHF+F      +F Q+YLIN  +W   +   PIF+Y GNEGDIEWFA N+GF+ DIAPKF ALL    HR+YGES P+G  +++Y +A T
Subjt:  FETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGA---PIFIYTGNEGDIEWFAANTGFLPDIAPKFHALL----HRFYGESKPFG--DDSYNSAET

Query:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSW-
        L YLT++QALAD+AV +  LK+NLS+EA PVV+FGGSYGGMLAAW RLKYPHI IGALASSAPIL F+++VP  +FYD  S DFK  S +C+  IK SW 
Subjt:  LGYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSW-

Query:  AELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFN
        A + +   E GL +L++TF  C+ L+S   + DWL SA+ Y  MV+YP  A+FM PLP +P++E+C+ ID      S LD+++A  S+YYNY+    CF 
Subjt:  AELQQAFSEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFN

Query:  LENGPDVHGLSGWNWQACTEMVMPMTCSNK-SMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNI
        L++ P  HGL GWNWQACTEMVMPM+ + + SMFP   F+Y  +  +C   + V+PR  W+TTEFGG  I   LK FGSNIIFSNG+ DPWS G VL+N+
Subjt:  LENGPDVHGLSGWNWQACTEMVMPMTCSNK-SMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNI

Query:  STSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI
        S +IVA+VT++GAHH+D R +T +DP WLV QR  E+ +I  WI  Y  + K+ K S+
Subjt:  STSIVAIVTEKGAHHVDFRSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAAAAAACTTGTTTTCAAAGTGTTCATTGTGTGTGCTCCCCCGTAGGCCTCCCCTGTTCGCCATGGCCACCAGAGGACTCGGCGGCGAGAAGAGAACA
ACCAGAAGCTCCGCCATGAACGCCCCCGCCGCCACTACCAGAAGCAAGACCAAGAAATTTGACAGAGAGAACCATCTCAACCATCAACTGGTAACCCTCATCGAA
ACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTCCTTCCCTCACAAACCCTCGCCCTTGAATCCCTCCTCTGTTCCACTTCATCCTCT
CTTTACGCTCTCTCTCCTCGTCTCCCAAAAATTTACCTACCGCCGCCGCCGCCTCCGCCACGACCATGCTGGTTCCAACGCTTCCTCTCTGCGACATCCGAGGTC
GATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCGATTCCGAGCTCCTCATCCTCTTCAGTT
CCTCCGGATTGTGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGTGCGAGCTACAAGGCGGTTGGGAGGCGGTTCGGGATCGATTCCGCTGATGCTTGCCAC
TCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTT
CCGAATTGCTGTGGGGTTTTAGGTCTTAGAAGATTTGGGTTTGAGGGTGAGCTGCTAGGCAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGG
AGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATCGAGAACTCCACTGAATTACTC
AAAGGCCCTGTTTACAATCTCGATGATGAAAAGCCCATTCCTCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTAACACCATACATGAAA
CTGAACGAGGAAGATAGCTCTGGCTTTTCTGAAAGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCGGACTCCGAGCTCGG
TGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGATTATTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAG
AAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGTTCAGGTGAGGAGCAGAAGTTTCCTCTTTATGACGGTGAGATAGGAGATAATAGAGGAAAGGAT
ATCAGAGATACGCTTGCCTCGCACTTGAGTAGCCTGAAATCTGGCAGTCTAAAAGTAAAATATTGGGAGCTCAAAACCCCAGCAGCTTCATCAGAAATTATTCAG
CAAATACAATATCTGGAAGACTATACAAACAGCACAAAGGATTCGTTTCAACCATTCGACAAAACCAAGCACAAAATCTTCACTAGATACAAACCAAACTGGGGA
AATTTAACAAATCCGTTGAACTCAAAGCCAAAGATCCCTTTTGAGACCCGTTTCTATCCTCAGCTGCTAGATCATTTCACCTTCACGCCAAAGAGTTCCAAAAGA
TTTTACCAGAAGTATCTGATTAATGAGCAATACTGGCGAAATGGAGCTCCAATCTTCATTTACACTGGCAATGAGGGCGACATTGAATGGTTTGCTGCCAATACC
GGTTTCTTGCCAGATATTGCTCCAAAATTCCATGCCCTTCTGCATAGATTTTATGGGGAATCGAAGCCGTTTGGAGATGACTCATATAACTCGGCAGAAACATTA
GGCTACTTGACTTCACAACAAGCTTTGGCTGACTATGCAGTTTTGATAAGAAGTTTGAAGCAAAACCTCTCTTCTGAGGCTTCCCCTGTTGTTGTGTTTGGTGGG
TCTTATGGAGGAATGCTGGCAGCCTGGTTTAGACTGAAATACCCTCATATTACTATTGGAGCTTTGGCATCTTCAGCACCCATTTTACACTTTGATAACATCGTA
CCGAGGTCGAGCTTCTATGATGCTGTTTCTCAGGATTTCAAGGATGCTAGTTTGAATTGCTATGAAGTGATAAAAGGGAGTTGGGCAGAGCTTCAGCAAGCATTT
TCTGAAGAGGGGTTGGCCGAACTGAGCAGAACATTCAGAACTTGCAAGAACCTTCATTCAGTATCCTCGGTTCGAGACTGGTTATGGTCAGCATTTGTCTACACT
ACAATGGTAAATTACCCGACTGAAGCAAATTTTATGAGGCCATTGCCTGCCTATCCTGTACAAGAGATGTGTAAGATTATCGACGCATTTGCACCAGAAACTAGC
AAGCTTGACAAGGTTTTTGCTGCTGCGAGCTTGTATTACAATTACTCACATGGAGAGAAATGCTTTAACCTGGAAAACGGACCTGATGTTCACGGTCTTAGTGGT
TGGAACTGGCAGGCTTGTACAGAGATGGTGATGCCAATGACTTGTTCCAACAAGAGCATGTTTCCACCGAGCGAGTTCGATTATGAAGAATTTGCAACAGATTGC
AAGAAGAAATATGGAGTTTCACCTCGTCTGCATTGGATCACTACTGAGTTCGGTGGCGAAAGAATTGAGCAAGTGTTGAAAAGATTTGGCAGCAATATCATATTT
TCTAATGGAATGCAAGATCCATGGAGCAGAGGAGGGGTGCTGAGAAATATTTCAACCAGTATCGTTGCCATCGTCACGGAGAAAGGAGCCCACCATGTCGATTTT
CGATCAGCGACGAACGATGACCCCGACTGGCTTGTCAAGCAGAGGAGACAAGAAGTGGAGATCATTCATCAATGGATTAATGAGTATTATGCAGATATGAAACAA
GACAAGAAATCCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAAAAAACTTGTTTTCAAAGTGTTCATTGTGTGTGCTCCCCCGTAGGCCTCCCCTGTTCGCCATGGCCACCAGAGGACTCGGCGGCGAGAAGAGAACA
ACCAGAAGCTCCGCCATGAACGCCCCCGCCGCCACTACCAGAAGCAAGACCAAGAAATTTGACAGAGAGAACCATCTCAACCATCAACTGGTAACCCTCATCGAA
ACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTCCTTCCCTCACAAACCCTCGCCCTTGAATCCCTCCTCTGTTCCACTTCATCCTCT
CTTTACGCTCTCTCTCCTCGTCTCCCAAAAATTTACCTACCGCCGCCGCCGCCTCCGCCACGACCATGCTGGTTCCAACGCTTCCTCTCTGCGACATCCGAGGTC
GATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTTCGTCTCCTTTCCCCGATTCCGAGCTCCTCATCCTCTTCAGTT
CCTCCGGATTGTGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGTGCGAGCTACAAGGCGGTTGGGAGGCGGTTCGGGATCGATTCCGCTGATGCTTGCCAC
TCGTTTTATGCTGTTTGTAAAGCTATCAATGAGAAATTGGGGCATTTGCTTGAGCTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTT
CCGAATTGCTGTGGGGTTTTAGGTCTTAGAAGATTTGGGTTTGAGGGTGAGCTGCTAGGCAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGG
AGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATCGAGAACTCCACTGAATTACTC
AAAGGCCCTGTTTACAATCTCGATGATGAAAAGCCCATTCCTCAATACTTGATTGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTAACACCATACATGAAA
CTGAACGAGGAAGATAGCTCTGGCTTTTCTGAAAGAGCATTCAATTCCACACATAACCGTGCAATGGCGTTGGTTAACACAGCATTTTGCGGACTCCGAGCTCGG
TGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGATTATTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCACAATTTCCTCATTAAATGCAGTGAG
AAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGTTCAGGTGAGGAGCAGAAGTTTCCTCTTTATGACGGTGAGATAGGAGATAATAGAGGAAAGGAT
ATCAGAGATACGCTTGCCTCGCACTTGAGTAGCCTGAAATCTGGCAGTCTAAAAGTAAAATATTGGGAGCTCAAAACCCCAGCAGCTTCATCAGAAATTATTCAG
CAAATACAATATCTGGAAGACTATACAAACAGCACAAAGGATTCGTTTCAACCATTCGACAAAACCAAGCACAAAATCTTCACTAGATACAAACCAAACTGGGGA
AATTTAACAAATCCGTTGAACTCAAAGCCAAAGATCCCTTTTGAGACCCGTTTCTATCCTCAGCTGCTAGATCATTTCACCTTCACGCCAAAGAGTTCCAAAAGA
TTTTACCAGAAGTATCTGATTAATGAGCAATACTGGCGAAATGGAGCTCCAATCTTCATTTACACTGGCAATGAGGGCGACATTGAATGGTTTGCTGCCAATACC
GGTTTCTTGCCAGATATTGCTCCAAAATTCCATGCCCTTCTGCATAGATTTTATGGGGAATCGAAGCCGTTTGGAGATGACTCATATAACTCGGCAGAAACATTA
GGCTACTTGACTTCACAACAAGCTTTGGCTGACTATGCAGTTTTGATAAGAAGTTTGAAGCAAAACCTCTCTTCTGAGGCTTCCCCTGTTGTTGTGTTTGGTGGG
TCTTATGGAGGAATGCTGGCAGCCTGGTTTAGACTGAAATACCCTCATATTACTATTGGAGCTTTGGCATCTTCAGCACCCATTTTACACTTTGATAACATCGTA
CCGAGGTCGAGCTTCTATGATGCTGTTTCTCAGGATTTCAAGGATGCTAGTTTGAATTGCTATGAAGTGATAAAAGGGAGTTGGGCAGAGCTTCAGCAAGCATTT
TCTGAAGAGGGGTTGGCCGAACTGAGCAGAACATTCAGAACTTGCAAGAACCTTCATTCAGTATCCTCGGTTCGAGACTGGTTATGGTCAGCATTTGTCTACACT
ACAATGGTAAATTACCCGACTGAAGCAAATTTTATGAGGCCATTGCCTGCCTATCCTGTACAAGAGATGTGTAAGATTATCGACGCATTTGCACCAGAAACTAGC
AAGCTTGACAAGGTTTTTGCTGCTGCGAGCTTGTATTACAATTACTCACATGGAGAGAAATGCTTTAACCTGGAAAACGGACCTGATGTTCACGGTCTTAGTGGT
TGGAACTGGCAGGCTTGTACAGAGATGGTGATGCCAATGACTTGTTCCAACAAGAGCATGTTTCCACCGAGCGAGTTCGATTATGAAGAATTTGCAACAGATTGC
AAGAAGAAATATGGAGTTTCACCTCGTCTGCATTGGATCACTACTGAGTTCGGTGGCGAAAGAATTGAGCAAGTGTTGAAAAGATTTGGCAGCAATATCATATTT
TCTAATGGAATGCAAGATCCATGGAGCAGAGGAGGGGTGCTGAGAAATATTTCAACCAGTATCGTTGCCATCGTCACGGAGAAAGGAGCCCACCATGTCGATTTT
CGATCAGCGACGAACGATGACCCCGACTGGCTTGTCAAGCAGAGGAGACAAGAAGTGGAGATCATTCATCAATGGATTAATGAGTATTATGCAGATATGAAACAA
GACAAGAAATCCATCTAAATGAATCTTGTCAATTCAAGTTCAAGTAGGGAGTGATTCTGATTTTTGTTAAGTTCTTTTCTAGTGGAAAGATGGAATCATACATTG
TAAAATGTAGGATGATCTATCTCTATCTTTCTAAGAATTTGTATGGCGCAATATCAAATGGGTT
Protein sequenceShow/hide protein sequence
MKKKNLFSKCSLCVLPRRPPLFAMATRGLGGEKRTTRSSAMNAPAATTRSKTKKFDRENHLNHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSS
LYALSPRLPKIYLPPPPPPPRPCWFQRFLSATSEVDCDPRWNLSFRMSKSSFSLLLRLLSPIPSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACH
SFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIENSTELL
KGPVYNLDDEKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFSERAFNSTHNRAMALVNTAFCGLRARWKLLSKPWKEGCRDYFPFIVLTGCLLHNFLIKCSE
KLDEEQDQEEGASCSGEEQKFPLYDGEIGDNRGKDIRDTLASHLSSLKSGSLKVKYWELKTPAASSEIIQQIQYLEDYTNSTKDSFQPFDKTKHKIFTRYKPNWG
NLTNPLNSKPKIPFETRFYPQLLDHFTFTPKSSKRFYQKYLINEQYWRNGAPIFIYTGNEGDIEWFAANTGFLPDIAPKFHALLHRFYGESKPFGDDSYNSAETL
GYLTSQQALADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPRSSFYDAVSQDFKDASLNCYEVIKGSWAELQQAF
SEEGLAELSRTFRTCKNLHSVSSVRDWLWSAFVYTTMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLDKVFAAASLYYNYSHGEKCFNLENGPDVHGLSG
WNWQACTEMVMPMTCSNKSMFPPSEFDYEEFATDCKKKYGVSPRLHWITTEFGGERIEQVLKRFGSNIIFSNGMQDPWSRGGVLRNISTSIVAIVTEKGAHHVDF
RSATNDDPDWLVKQRRQEVEIIHQWINEYYADMKQDKKSI