; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G22960 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G22960
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLysosomal Pro-X carboxypeptidase-like
Genome locationClcChr02:34977741..34995461
RNA-Seq ExpressionClc02G22960
SyntenyClc02G22960
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004180 - carboxypeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
GO:0008239 - dipeptidyl-peptidase activity (molecular function)
InterPro domainsIPR007849 - ATPase assembly factor ATP10
IPR008758 - Peptidase S28
IPR029058 - Alpha/Beta hydrolase fold
IPR042269 - Serine carboxypeptidase S28, SKS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CBI17109.3 unnamed protein product, partial [Vitis vinifera]0.0e+0060.98Show/hide
Query:  LHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSV
        L  S   + L SDDF+T++YNQTLDHFNYRPESY TF QRY++NFKYW GAN+SAPIFAYLGAEA +D DL  +GF  DNA+QF ALLVYIEHRYYG+S+
Subjt:  LHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSV

Query:  PFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRG
        PF SR+EAL+NAST GYFNSAQAIADYA +L ++KK+L A  SPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ+GYY++VTKDFR 
Subjt:  PFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRG

Query:  VSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGTLTKIAAGVFAYRGNVSC
         SE+CY TI++SWSEI+ VAS+PNGLSIL ++F+TC  L    EL+DYL +MYA AAQYNHPPRYPVT +C  IDG    +  L++I AGV AYRGN SC
Subjt:  VSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGTLTKIAAGVFAYRGNVSC

Query:  YINEPRNETETDVGWRWQSCSEMVMPI--SSDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLH
        Y N   N TET  GWRWQ+CSEMVMPI    +D MFPP PF+L +FI  C  LY VPPRPHW+TTYYGGHD +L+L RF SNIIFSNGL+DPYS AGVL 
Subjt:  YINEPRNETETDVGWRWQSCSEMVMPI--SSDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLH

Query:  NISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE----------------------------------------------FRMPRLSRIGEKFLHH
        NIS +++A++T  GSHCLDIL A  TDPEWL+ QRKTE                                              F +PRL  +    L +
Subjt:  NISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE----------------------------------------------FRMPRLSRIGEKFLHH

Query:  SKALELPPS--DDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHR-------
         +   +  S   D KTF+Y QT+DHFNYRPESY TF QRY++NFK+WGGA + API AYLGAEAP+D  +  IGF+ DNA +FNALL+ IEHR       
Subjt:  SKALELPPS--DDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHR-------

Query:  ----------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREV
                  AS  G  + +           HVKK  +A+ SPVIVIGGSYGGMLA+WF LKYPH+ LG+LASSAPILYFD+I P+ GYY++VTKDFRE 
Subjt:  ----------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREV

Query:  SQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY
        S++CY TIR SWSE +++AS+PNGLSIL K FKTC+ L SS +L++YL  +YA AAQYN PP YPVT +C  I+       TL +I  G+ A  G  SCY
Subjt:  SQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY

Query:  INEPRN-ATETDVGWQWQRCSEMVMPIS-TDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLH
          +  N  TET +GW+WQ+CSEMV+PI    NDTMF P  F+L  FI  CN LY V PRPHWVTTYYGG DI LIL RFASNIIFSNGL+DPYS GGVL 
Subjt:  INEPRN-ATETDVGWQWQRCSEMVMPIS-TDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLH

Query:  NISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTE
        NISD+L AVY  +GSHCLDIL + K DP+WL  QRK E
Subjt:  NISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTE

KAA0033355.1 lysosomal Pro-X carboxypeptidase-like [Cucumis melo var. makuwa]0.0e+0079.5Show/hide
Query:  EFRMPRLSRIGEKFLHHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNA
        +FR+PRLS IGEKFL+HSKALELPPSDDFKTFY+NQT+DHFNYRPESYTTF QRYIINFKYWGGANSSAPILAYLG EAPIDSAMN IGFMTDNA+KFNA
Subjt:  EFRMPRLSRIGEKFLHHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNA

Query:  LLVDIEHR-----------------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITP
        LLV IEHR                 AS  G  + +           HVK EFNAKYSPVIVIGGSYGGMLATWF LKYPHV LG+LASSAPILYF+DITP
Subjt:  LLVDIEHR-----------------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITP

Query:  QNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSK
        QNGYY  VTKDFREVSQTCYETIRESWSE E VASQPNGLS+LDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHP  YPVTRIC AIDRT+S NGTL K
Subjt:  QNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSK

Query:  IAAGVFAYRGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSN
        IAAGVFAYRGNLSCYINEP N TET VGWQWQRCSEMVMPIST NDTMFPPRTFD ESF IYCNQLYGV PRPHWVTTYYGG D++LIL RFASNIIFSN
Subjt:  IAAGVFAYRGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSN

Query:  GLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLD
        GLKDPYSIGGVLHNISDSLPAVY  NGSHCLDILS+++MDPEWL TQRKTE         +  +     L  ++S+Q  ++    A DDFKTFYYNQ+LD
Subjt:  GLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLD

Query:  HFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNTIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIA
        HFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLN IGFMTDNAV+F+ALLVYIEHRYYGKS+PFGSR+EALKNASTLGYF+SAQAIA
Subjt:  HFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNTIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIA

Query:  DYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPVLYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNG
        DYAAVL+HLK+K+HAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAP+LYF++ITPHNGYYSIATKDFREVSETCY TIRDSWS+IE IASKPNG
Subjt:  DYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPVLYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNG

Query:  LSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGAYSGSGTISKIASGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVM
        LSILSKEFK CSPLNSSS+LEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGA  GSG ISK+A+GVFAYKGNL CYN+ PRNDTETDVGWRWQRCSEMVM
Subjt:  LSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGAYSGSGTISKIASGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVM

Query:  PISTGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYGGN------------------------------------------VWSHCLDILRANET
        P+ST NDTMFPP TFDLRSFIDYCYQ YGVSPRPHWVTTYYGGN                                            SHCLDILRANET
Subjt:  PISTGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYGGN------------------------------------------VWSHCLDILRANET

Query:  DPQWLVKQRETE
        DPQWLV+QRE E
Subjt:  DPQWLVKQRETE

KAG5408898.1 hypothetical protein IGI04_005217 [Brassica rapa subsp. trilocularis]0.0e+0044.96Show/hide
Query:  IPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL
        +PF+  +LS    +L +       +            S  + D   +YY+Q LDHF + P+SY TF QRY+IN K+W G+ ++APIFA+LG EA I+ DL
Subjt:  IPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL

Query:  NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGA
          +GF  DN  +  ALLVYIEHRYYGKSVPF S +EAL+NASTLGY N+AQA+ADYA IL+H+K++  A +SP+IV+GGSYGGMLA+WFRLKYPH+ALGA
Subjt:  NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGA

Query:  LASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRIC
        LASSAP+LYF+D  P+ GYY ++TK F+  S+ CY+TI+KSW EI+ VA++ NGL IL ++FKTC PL   F+++D+L S+YA + Q+N  P   V  +C
Subjt:  LASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRIC

Query:  DAIDGTSSVNGTLTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHD
        +AID                             P N            CSE+VMPI  D  D MF   PF++  FI  C   YGV PRPHW+TTY+G  D
Subjt:  DAIDGTSSVNGTLTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHD

Query:  TRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRM-----------PRLSRIG--EKFLHHSKAL
         +L+L+RFGSNIIFSNGL DPYS+ GVL N+S ++VA+ T  G+HC D+    + DP+WLV QR+ E +             R++R+G           L
Subjt:  TRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRM-----------PRLSRIG--EKFLHHSKAL

Query:  ELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRASNSGLCSHSYHV
        E   + D K FYY+Q +DHF++ PESY TF QRY ++ K+W GAN+SAPILA+LG EA ++  ++ +                                 
Subjt:  ELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRASNSGLCSHSYHV

Query:  KKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFK
                            +LA WF LKYPH+ LG+LASSAP+LYF+D  P  GYY V+T  F+E S+ CY+TIR+SW E ++VA++ NGL IL K+F+
Subjt:  KKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFK

Query:  TCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTD-NDT
        TC+PL  S  ++++L  +YA + Q+N  P   V  +C AID   +     + +A         +   +  P+  +   VG     CSE++MPI  D +DT
Subjt:  TCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTD-NDT

Query:  MFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQ
        MF    F++ S I  C   YGV PRPHWVTTY+G  D+ LIL+RF SNIIFSNGL DPYS+GGVL +++DS+ A+    G+H  D+ +  K DPEWL  +
Subjt:  MFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQ

Query:  RKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLDHFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLN
                               P +  ++P    Q     D K FY+NQ LDHF + P+SY  F  RY I+ K+W GA  +APILA+LG E  L+ DL+
Subjt:  RKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLDHFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLN

Query:  TIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIADYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGAL
         I F+ DN  +  ALLVYIEHRYYGK++PFGS +EALKNASTLGY N+AQA+ADYA++L+H+K+K+  K SP+IV+GGSYGGMLAAWFRLKYPH+ALGAL
Subjt:  TIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIADYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGAL

Query:  ASSAPVLYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNGLSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICG
        ASSAP+LYF++  P  GYY I TK  +  SE CY  IR SW EI+R+A+KPNGL ILSK+FK C+PLN+S +++D+L ++YA A QYN  P Y VT +C 
Subjt:  ASSAPVLYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNGLSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICG

Query:  GIDG--AYSGSGTISKIASGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVMPIS-TGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYG--
         ID     S  G + +I +G  A  GN SCY        +T++        E+VMP+     DTMFP   F++ S+I+ C   YGV+PRPHW+TTY+G  
Subjt:  GIDG--AYSGSGTISKIASGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVMPIS-TGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYG--

Query:  ----------------------------------------GNVWSHCLDILRANETDPQWLVKQRETE
                                                 N  SHC DI+   + DP+WLV QR+ E
Subjt:  ----------------------------------------GNVWSHCLDILRANETDPQWLVKQRETE

KAG5515637.1 hypothetical protein RHGRI_036621 [Rhododendron griersonianum]0.0e+0056.24Show/hide
Query:  LFVLSTSVTALHYRSPRLSPVGEKFLHHSRALN----SLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL
        L +LSTS +A  ++ PRLS + E  +      N    S  S+  +T++Y QTLDHFNY+PESY TF QRY++N KYW GAN +APIF YLGAEA ID DL
Subjt:  LFVLSTSVTALHYRSPRLSPVGEKFLHHSRALN----SLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL

Query:  NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGA
         +IGFL DNA  F AL VYIEHR+YG+S+PF S  EA++N ST GYFNSAQAIADYA ++I+LK++L A  SPVIV+GGSYGGMLASWFRLKYPH+ALGA
Subjt:  NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGA

Query:  LASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRIC
        LASSAPILYFDDITP++GYY++ TKDF+ VSETCYETI+KSWSEI+ VAS P+GLSIL Q+FKTC  L    EL+DYL +MYA AAQYNHPP YPVT++C
Subjt:  LASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRIC

Query:  DAIDGTSSVNGTLTKIAAGVFAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISS--DDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGH
          IDG +     L +I AG+ AYR N +CY ++E +  +ETD+GW WQ CSEMV+PI    DD MFPP PF+L  +I  C  LYGVPPRPHWVTTYYGGH
Subjt:  DAIDGTSSVNGTLTKIAAGVFAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISS--DDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGH

Query:  DTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE--------------------------------
        D +LVL RF SNIIFSNGL+DPYS  GVL ++SD+L+AV+T  GSHCLDIL A +TDP+WL+ QRK E                                
Subjt:  DTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE--------------------------------

Query:  ----------------------FRMPRLSRIGEKFLH-HSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAE
                               ++PRLS   E      SK +    S+D +TF+Y QT+DHFNY+P SY TF QRY+IN+K+WGGAN SAPI  YLG E
Subjt:  ----------------------FRMPRLSRIGEKFLH-HSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAE

Query:  APIDSAMNGIGFMTDNAIKFNALLVDIEHRASNSGL------------CSHSY---------------HVKKEFNAKYSPVIVIGGSYGGMLATWFLLKY
        APID  +  +GF+ +NA  F AL V IEHR     +             +  Y               ++KK+ +A  SP+IV GGSYGGMLA+WF LKY
Subjt:  APIDSAMNGIGFMTDNAIKFNALLVDIEHRASNSGL------------CSHSY---------------HVKKEFNAKYSPVIVIGGSYGGMLATWFLLKY

Query:  PHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPR
        PHV LG+LASSAPILYFDDITP + Y  VVTKDFREVS+ CYETIR+SWSE ++VAS PNGLS L ++FKTC+PL  +  L++YL  MYASAAQY+ PP 
Subjt:  PHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPR

Query:  YPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCYINEPRNA-TETDVGWQWQRCSEMVMPIST-DNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWV
        YPV ++CG ID    G   L KI AG+    G  +CY+N   N  +ETD+GW WQ CSEMV+PI     D+MF P  F+L+ +I  C   YGVPPRP+W 
Subjt:  YPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCYINEPRNA-TETDVGWQWQRCSEMVMPIST-DNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWV

Query:  TTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLA--LEACYADESSTT
        TTYYGG+DI L+LQRFASNIIFSNGL+DP+S GGVL ++SD+L AVY  NGSHCLDIL A + DP+WL  QRK E   +   L+  YAD  + T
Subjt:  TTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLA--LEACYADESSTT

QCE09401.1 lysosomal Pro-X carboxypeptidase [Vigna unguiculata]1.3e-30958.91Show/hide
Query:  FLLFVLSTSVTALH-YRSPRLS--PVGEKFLHHSRALNSLPS-DDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDD
        F LF+  T  T+++  + PRLS  P  +  LHH   L++  S D   T+YY Q LDHFNYRP+SY TF QRY+INFKYW GANSSAPI A+ GAE  ID 
Subjt:  FLLFVLSTSVTALH-YRSPRLS--PVGEKFLHHSRALNSLPS-DDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDD

Query:  DLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVAL
            I FLTDNA   NALLVYIEHRYYGKS+PF SR+EA +NAST+GYFNSAQAIADYA +LIH+KK LHA  SPVIVIGGSYGGMLASWFRLKYPH+A+
Subjt:  DLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVAL

Query:  GALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTR
        GALASSAPILYFDDITPQDGYY+VV++DFR  SETCY+TI KSWSEI+ VASQP GL +L Q F TCRPL+   EL+DYL +MYA+AAQYNHPPRYPVT 
Subjt:  GALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTR

Query:  ICDAIDGTSSVNGTLTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGH
        IC  ID  S  N  L+KI AGV A RGN +C +N P NE+ET +GWRWQ+CSEMV+P+    + MF P P+   S    C +LYGV PRPHWVTTYYGGH
Subjt:  ICDAIDGTSSVNGTLTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGH

Query:  DTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWL------VTQRKTEFRMPRLS--RIGEKFLHHSKALELPP
        + +L+LQ+FGSNIIFSNGL+DPYSI GVL NISD+LVA+      H ++ + +     +W+      ++      ++PRL   R  ++      +     
Subjt:  DTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWL------VTQRKTEFRMPRLS--RIGEKFLHHSKALELPP

Query:  SDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRAS---------NSGLCS
        ++D KTFYY Q +DHFNYRP+SY TF QRY+I+FK+W G  S+API A+ GAEAP+D  +  +GF TDNA  F AL+V IE   +         NS    
Subjt:  SDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRAS---------NSGLCS

Query:  HSY-----HVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPN
          Y     HVKK  +A+ SP+IVIGGSYGGMLA+WF LKYPH+ LG+LASSAPILYF+ I PQ GYY +VTKDF+E S+TCY+TIR+SWSE ++VA +PN
Subjt:  HSY-----HVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPN

Query:  GLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY-INEPRNATETDVGWQWQRCSEM
        GLSIL K FKTC  L  S +L++YL  +Y  AAQY+ P    V  +C AID        L +I  GV +Y    SCY +NE    TET++GW+WQ CSEM
Subjt:  GLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY-INEPRNATETDVGWQWQRCSEM

Query:  VMPISTD-NDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSA
        VMPI  + ND+MFPP  F+++ F+  C++LYGV P+PHWVTTYYGG+D+ LIL RFASNIIFSNGL+DPYS GGVL NIS+S+ AV   NG HCLDI S 
Subjt:  VMPISTD-NDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSA

Query:  DKMDPEWLATQRKTE
         + DPEWL  QR  E
Subjt:  DKMDPEWLATQRKTE

TrEMBL top hitse value%identityAlignment
A0A4D6N970 Lysosomal Pro-X carboxypeptidase6.3e-31058.91Show/hide
Query:  FLLFVLSTSVTALH-YRSPRLS--PVGEKFLHHSRALNSLPS-DDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDD
        F LF+  T  T+++  + PRLS  P  +  LHH   L++  S D   T+YY Q LDHFNYRP+SY TF QRY+INFKYW GANSSAPI A+ GAE  ID 
Subjt:  FLLFVLSTSVTALH-YRSPRLS--PVGEKFLHHSRALNSLPS-DDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDD

Query:  DLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVAL
            I FLTDNA   NALLVYIEHRYYGKS+PF SR+EA +NAST+GYFNSAQAIADYA +LIH+KK LHA  SPVIVIGGSYGGMLASWFRLKYPH+A+
Subjt:  DLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVAL

Query:  GALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTR
        GALASSAPILYFDDITPQDGYY+VV++DFR  SETCY+TI KSWSEI+ VASQP GL +L Q F TCRPL+   EL+DYL +MYA+AAQYNHPPRYPVT 
Subjt:  GALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTR

Query:  ICDAIDGTSSVNGTLTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGH
        IC  ID  S  N  L+KI AGV A RGN +C +N P NE+ET +GWRWQ+CSEMV+P+    + MF P P+   S    C +LYGV PRPHWVTTYYGGH
Subjt:  ICDAIDGTSSVNGTLTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGH

Query:  DTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWL------VTQRKTEFRMPRLS--RIGEKFLHHSKALELPP
        + +L+LQ+FGSNIIFSNGL+DPYSI GVL NISD+LVA+      H ++ + +     +W+      ++      ++PRL   R  ++      +     
Subjt:  DTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWL------VTQRKTEFRMPRLS--RIGEKFLHHSKALELPP

Query:  SDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRAS---------NSGLCS
        ++D KTFYY Q +DHFNYRP+SY TF QRY+I+FK+W G  S+API A+ GAEAP+D  +  +GF TDNA  F AL+V IE   +         NS    
Subjt:  SDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRAS---------NSGLCS

Query:  HSY-----HVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPN
          Y     HVKK  +A+ SP+IVIGGSYGGMLA+WF LKYPH+ LG+LASSAPILYF+ I PQ GYY +VTKDF+E S+TCY+TIR+SWSE ++VA +PN
Subjt:  HSY-----HVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPN

Query:  GLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY-INEPRNATETDVGWQWQRCSEM
        GLSIL K FKTC  L  S +L++YL  +Y  AAQY+ P    V  +C AID        L +I  GV +Y    SCY +NE    TET++GW+WQ CSEM
Subjt:  GLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY-INEPRNATETDVGWQWQRCSEM

Query:  VMPISTD-NDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSA
        VMPI  + ND+MFPP  F+++ F+  C++LYGV P+PHWVTTYYGG+D+ LIL RFASNIIFSNGL+DPYS GGVL NIS+S+ AV   NG HCLDI S 
Subjt:  VMPISTD-NDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSA

Query:  DKMDPEWLATQRKTE
         + DPEWL  QR  E
Subjt:  DKMDPEWLATQRKTE

A0A5A7SW17 Lysosomal Pro-X carboxypeptidase-like0.0e+0079.5Show/hide
Query:  EFRMPRLSRIGEKFLHHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNA
        +FR+PRLS IGEKFL+HSKALELPPSDDFKTFY+NQT+DHFNYRPESYTTF QRYIINFKYWGGANSSAPILAYLG EAPIDSAMN IGFMTDNA+KFNA
Subjt:  EFRMPRLSRIGEKFLHHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNA

Query:  LLVDIEHR-----------------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITP
        LLV IEHR                 AS  G  + +           HVK EFNAKYSPVIVIGGSYGGMLATWF LKYPHV LG+LASSAPILYF+DITP
Subjt:  LLVDIEHR-----------------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITP

Query:  QNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSK
        QNGYY  VTKDFREVSQTCYETIRESWSE E VASQPNGLS+LDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHP  YPVTRIC AIDRT+S NGTL K
Subjt:  QNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSK

Query:  IAAGVFAYRGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSN
        IAAGVFAYRGNLSCYINEP N TET VGWQWQRCSEMVMPIST NDTMFPPRTFD ESF IYCNQLYGV PRPHWVTTYYGG D++LIL RFASNIIFSN
Subjt:  IAAGVFAYRGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSN

Query:  GLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLD
        GLKDPYSIGGVLHNISDSLPAVY  NGSHCLDILS+++MDPEWL TQRKTE         +  +     L  ++S+Q  ++    A DDFKTFYYNQ+LD
Subjt:  GLKDPYSIGGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLD

Query:  HFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNTIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIA
        HFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLN IGFMTDNAV+F+ALLVYIEHRYYGKS+PFGSR+EALKNASTLGYF+SAQAIA
Subjt:  HFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNTIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIA

Query:  DYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPVLYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNG
        DYAAVL+HLK+K+HAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAP+LYF++ITPHNGYYSIATKDFREVSETCY TIRDSWS+IE IASKPNG
Subjt:  DYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPVLYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNG

Query:  LSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGAYSGSGTISKIASGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVM
        LSILSKEFK CSPLNSSS+LEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGA  GSG ISK+A+GVFAYKGNL CYN+ PRNDTETDVGWRWQRCSEMVM
Subjt:  LSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGAYSGSGTISKIASGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVM

Query:  PISTGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYGGN------------------------------------------VWSHCLDILRANET
        P+ST NDTMFPP TFDLRSFIDYCYQ YGVSPRPHWVTTYYGGN                                            SHCLDILRANET
Subjt:  PISTGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYGGN------------------------------------------VWSHCLDILRANET

Query:  DPQWLVKQRETE
        DPQWLV+QRE E
Subjt:  DPQWLVKQRETE

A0A6N2KQP0 Uncharacterized protein1.5e-29357.97Show/hide
Query:  FNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDD
        F+ +L +   RYYGKS+PF SR+EA ++AS LGYFNSAQAIADYA I+IH+K++L A YSPVIVIGGSYGGMLASWFRLKYPH+ALGALASSAPILYFDD
Subjt:  FNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDD

Query:  ITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGT
        ITPQDGY+++V++ FR  S TCY+TIK SW+EI+ +AS+ NGLS+L ++FKTC PL    +L+D+L SMYA  AQYN PP YPV ++C  IDG    +  
Subjt:  ITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGT

Query:  LTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNII
        L++I  G+ AY GN+SC++N   +E+ET VGWRWQ+CSE+ +PI   ++ MFPP PFDL  +I  C  LYGVP RPHWVTTYYGGH  +L+LQRFGSNII
Subjt:  LTKIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNII

Query:  FSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR-------------------------------------MPRLSRI
        FSNGL+DPYS  GVL NIS+++VAV T  GSHCLDIL A ETDPEWLV QRK E +                                     +PRLS  
Subjt:  FSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR-------------------------------------MPRLSRI

Query:  GEKFL--HHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHR
        G +    H  +       +DF+TF+YNQT+DHFNYRPESY TF QRY+IN KYWGGAN SAP+L YLGAEAPID  ++ +GF+ DNA++F++LLV IEHR
Subjt:  GEKFL--HHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHR

Query:  -----------------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVV
                         AS  G  + +           H+K++  AKYSPVIVIGGSYGGMLA+WF LKYPH+ +G+LASSAPILYFDDITP + YY++V
Subjt:  -----------------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVV

Query:  TKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAY
        ++ FRE S TCY+TI+ SW+E +++AS+ NGLS+L ++FKTC+PL  +++L+N+L  MYASAAQYN PP YPV ++C  ID    G+  LS+I  G+ AY
Subjt:  TKDFREVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAY

Query:  RGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSI
         GNLSCY+N     +ET VGW+WQ CSE+ +PI   N++MFPP  FDLE +I  C  LYGVP RPHWVTTYYGGH I LILQRFASNIIFSNGL+DPYS 
Subjt:  RGNLSCYINEPRNATETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSI

Query:  GGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLA--LEACYADES
        GGVL NISD++ AV   NGSHCLDIL A + DPEWL  QRK E   +   ++  YAD S
Subjt:  GGVLHNISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTECLYLA--LEACYADES

A0A7J6H8P8 Uncharacterized protein4.5e-29058.69Show/hide
Query:  PRLSPVGEK-FLHHSRALNSLPS--DDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNAL
        PRLSP G++ FLH+   L S  S  DDF+T++YNQTLDHFNYRPESY+TF QRY++N K+W   N   PIFAYLGAE PID D+  IGFLTDNA  F AL
Subjt:  PRLSPVGEK-FLHHSRALNSLPS--DDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNAL

Query:  LVYIEHRYYGKSVPFRSRDEA-LRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITP
        +V+IEHRYYGKS+PF SR+EA L NASTLGYFNSAQA+ DYA IL+H+K    A  SP+IVIGGSYGGMLASWFRLKYPH+ALGALASSAPILYFDDITP
Subjt:  LVYIEHRYYGKSVPFRSRDEA-LRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITP

Query:  QDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAID-GTSSVNGTLT
        Q+GYY++VTKDFR  SE CYETI KSWS+I+ VA+ PNGLSIL  +FKTC PL+   EL+DY  ++YATAAQYN PP+YPV  IC+AID    + +  L 
Subjt:  QDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAID-GTSSVNGTLT

Query:  KIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPISSDDD-MFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFS
        KI +G+ AY  N +CY+N P+  TETD GW+WQ+CSEMV+PI  ++D MF  YPF +  FI  C   YGVPPRPHWVT+YYGGHD +L+L RFGSNIIFS
Subjt:  KIAAGVFAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPISSDDD-MFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFS

Query:  NGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRMPRLSRIGEK-FLHHSKAL-ELPPSDDFKTFYYNQTIDHFNYRPES
        NGL+DPYS  GVL NISDS++A+ TT GSHCLDIL    +DP+WL+       ++PRL    ++ FLH+ + L E+  S D +TF+YNQT+DHFNYR ES
Subjt:  NGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRMPRLSRIGEK-FLHHSKAL-ELPPSDDFKTFYYNQTIDHFNYRPES

Query:  YTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALL----------------------------------VDIEHRASNSG
        YT F QRY++N KYWGG N  API AYLGAEA ID  +N IGF+ +NA  F AL+                                    I++  ++ G
Subjt:  YTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALL----------------------------------VDIEHRASNSG

Query:  LCS----HSYHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQ
          +    H   + K + A+YSP+IVIG SYGGMLA+WF LKYPH+   +LASSAPILYFDDITPQN Y+ +V+KDFREVS++CY+TI +SWSE +KVA+ 
Subjt:  LCS----HSYHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREVSQTCYETIRESWSETEKVASQ

Query:  PNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGT-LSKIAAGVFAY--RGNLSCYINEPRNATETDVGWQWQR
         +GLSIL K+F TC  L S+++L+NYL  +Y SAAQYN PP YPV  +CG ID   + N T L KI +G+ AY    N +CY+N+P N TETD GW WQ 
Subjt:  PNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGT-LSKIAAGVFAY--RGNLSCYINEPRNATETDVGWQWQR

Query:  CSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDI
        CSEMV PI   N+TMF  + FDL+ +I  C   YG+ PRPHW TTYYGGH +
Subjt:  CSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDI

F6GW68 Uncharacterized protein0.0e+0060.98Show/hide
Query:  LHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSV
        L  S   + L SDDF+T++YNQTLDHFNYRPESY TF QRY++NFKYW GAN+SAPIFAYLGAEA +D DL  +GF  DNA+QF ALLVYIEHRYYG+S+
Subjt:  LHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSV

Query:  PFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRG
        PF SR+EAL+NAST GYFNSAQAIADYA +L ++KK+L A  SPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ+GYY++VTKDFR 
Subjt:  PFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRG

Query:  VSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGTLTKIAAGVFAYRGNVSC
         SE+CY TI++SWSEI+ VAS+PNGLSIL ++F+TC  L    EL+DYL +MYA AAQYNHPPRYPVT +C  IDG    +  L++I AGV AYRGN SC
Subjt:  VSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGTLTKIAAGVFAYRGNVSC

Query:  YINEPRNETETDVGWRWQSCSEMVMPI--SSDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLH
        Y N   N TET  GWRWQ+CSEMVMPI    +D MFPP PF+L +FI  C  LY VPPRPHW+TTYYGGHD +L+L RF SNIIFSNGL+DPYS AGVL 
Subjt:  YINEPRNETETDVGWRWQSCSEMVMPI--SSDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLH

Query:  NISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE----------------------------------------------FRMPRLSRIGEKFLHH
        NIS +++A++T  GSHCLDIL A  TDPEWL+ QRKTE                                              F +PRL  +    L +
Subjt:  NISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE----------------------------------------------FRMPRLSRIGEKFLHH

Query:  SKALELPPS--DDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHR-------
         +   +  S   D KTF+Y QT+DHFNYRPESY TF QRY++NFK+WGGA + API AYLGAEAP+D  +  IGF+ DNA +FNALL+ IEHR       
Subjt:  SKALELPPS--DDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLGAEAPIDSAMNGIGFMTDNAIKFNALLVDIEHR-------

Query:  ----------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREV
                  AS  G  + +           HVKK  +A+ SPVIVIGGSYGGMLA+WF LKYPH+ LG+LASSAPILYFD+I P+ GYY++VTKDFRE 
Subjt:  ----------ASNSGLCSHS----------YHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFREV

Query:  SQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY
        S++CY TIR SWSE +++AS+PNGLSIL K FKTC+ L SS +L++YL  +YA AAQYN PP YPVT +C  I+       TL +I  G+ A  G  SCY
Subjt:  SQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCY

Query:  INEPRN-ATETDVGWQWQRCSEMVMPIS-TDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLH
          +  N  TET +GW+WQ+CSEMV+PI    NDTMF P  F+L  FI  CN LY V PRPHWVTTYYGG DI LIL RFASNIIFSNGL+DPYS GGVL 
Subjt:  INEPRN-ATETDVGWQWQRCSEMVMPIS-TDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLH

Query:  NISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTE
        NISD+L AVY  +GSHCLDIL + K DP+WL  QRK E
Subjt:  NISDSLPAVYITNGSHCLDILSADKMDPEWLATQRKTE

SwissProt top hitse value%identityAlignment
P42785 Lysosomal Pro-X carboxypeptidase4.9e-7636.89Show/hide
Query:  PRLSPVGEKFLHHSRALNSLP--SDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALL
        P L  +G   LH      SLP  + ++   Y+ Q +DHF +   +  TF QRY++  KYW    +   I  Y G E  I    N  GF+ D A +  A+L
Subjt:  PRLSPVGEKFLHHSRALNSLP--SDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALL

Query:  VYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ
        V+ EHRYYG+S+PF   D + +++  L +  S QA+AD+A ++ HLK+ +  A   PVI IGGSYGGMLA+WFR+KYPH+ +GALA+SAPI  F+D+ P 
Subjt:  VYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ

Query:  DGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPL--QGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDG
          +  +VT DFR     C E+I +SW  I  +++  +GL  L      C PL  Q    L+D++   +   A  ++P         P +P+  +C  +  
Subjt:  DGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPL--QGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDG

Query:  TSSVNGTLTK---IAAGV-FAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHD
         +  +  L +    A  V + Y G V C  I+E    +   +GW +Q+C+E+VMP  ++  DDMF P+ ++L      C + +GV PRP W+TT YGG +
Subjt:  TSSVNGTLTK---IAAGV-FAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHD

Query:  TRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR
                 +NI+FSNG  DP+S  GV  +I+D+LVAV  + G+H LD+   N  DP  ++  R  E R
Subjt:  TRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR

Q2TA14 Lysosomal Pro-X carboxypeptidase1.6e-7136.05Show/hide
Query:  YYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYF
        Y  Q +DHF +  +   TF QRY+I   YW        I  Y G E  I    N  GF+ D A +  A+LV+ EHRYYG+S+PF +  ++  ++  L + 
Subjt:  YYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYF

Query:  NSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIE
         + QA+AD+A ++ +LK+ +  A    VI +GGSYGGMLA+WFR+KYPH+ +GALASSAPI  F+D+ P D +  +VT DF      C E+I++SW  I 
Subjt:  NSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIE

Query:  TVASQPNGLSILDQEFKTCRPL---QGYFELEDYLWSMYATAAQYNHP---------PRYPVTRIC-----DAIDGTSSVNGTLTKIAAGVFAYRGNVSC
         +A +  GL  L +    C PL   Q    L+D++   +   A  ++P         P +PV  +C       +  T  V      +    + Y G   C
Subjt:  TVASQPNGLSILDQEFKTCRPL---QGYFELEDYLWSMYATAAQYNHP---------PRYPVTRIC-----DAIDGTSSVNGTLTKIAAGVFAYRGNVSC

Query:  Y-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVL
          ++E    +   +GW +Q+C+EMVMP  SD  DDMF P+ +++  +   C + +GV PRP W+ T YGG +         +NIIFSNG  DP+S  GV 
Subjt:  Y-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVL

Query:  HNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR
         +I+D+L+A+    G+H LD+  +N  DP  +   R  E +
Subjt:  HNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR

Q5RBU7 Lysosomal Pro-X carboxypeptidase2.2e-7636.89Show/hide
Query:  PRLSPVGEKFLHHSRALNSLP--SDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALL
        P L  +G   LH      SLP  + ++   Y+ Q +DHF +   +  TF QRY++  KYW    +   I  Y G E  I    N  GF+ D A +  A+L
Subjt:  PRLSPVGEKFLHHSRALNSLP--SDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALL

Query:  VYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ
        V+ EHRYYG+S+PF   D   +++  L +  S QA+AD+A ++ HLK+ +  A   PVI IGGSYGGMLA+WFR+KYPH+ +GALA+SAPI  F+D+ P 
Subjt:  VYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ

Query:  DGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPL--QGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDG
          +  +VT DFR     C E+I++SW  I  +++  +GL  L      C PL  Q    L+D++   +   A  ++P         P +P+  +C  +  
Subjt:  DGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPL--QGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDG

Query:  TSSVNGTLTK---IAAGV-FAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHD
         +  +  L +    A  V + Y G V C  I+E    +   +GW +Q+C+E+VMP  ++  DDMF P+ ++L      C + +GV PRP W+TT YGG +
Subjt:  TSSVNGTLTK---IAAGV-FAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHD

Query:  TRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR
                 +NI+FSNG  DP+S  GV  +I+D+LVAV  + G+H LD+   N  DP  ++  R  E R
Subjt:  TRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR

Q7TMR0 Lysosomal Pro-X carboxypeptidase9.9e-7735.19Show/hide
Query:  IPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL
        + FLL   +T++       PRL  +G   L  S   +   +  +   Y+ Q +DHF +      TF QRY++  K+W    +   I  Y G E  I    
Subjt:  IPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL

Query:  NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALG
        N  GF+ D A +  A+LV+ EHRYYG+S+PF    ++ +++  L +  S QA+AD+A ++ HL+K +  A   PVI IGGSYGGMLA+WFR+KYPH+ +G
Subjt:  NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKEL-HANYSPVIVIGGSYGGMLASWFRLKYPHVALG

Query:  ALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPL--QGYFELEDYLWSMYATAAQYNHP------
        ALA+SAPI   D + P   +  +VT DFR     C E+I+KSW+ I+ ++   +GL  L      C PL  +    L+ ++   +   A  N+P      
Subjt:  ALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPL--QGYFELEDYLWSMYATAAQYNHP------

Query:  ---PRYPVTRICDAIDGTSSVNGTLT----KIAAGVFAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLY
           P +P+  +C  +   +  +  L     +  +  + Y G  +C  I++    +   +GW +Q+C+EMVMP  ++  DDMF P+ +DL  + + C   +
Subjt:  ---PRYPVTRICDAIDGTSSVNGTLT----KIAAGVFAYRGNVSCY-INEPRNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLY

Query:  GVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR
        GV PRPHW+TT YGG +         SNIIFSNG  DP+S  GV  +I+D+LVA+    G+H LD+   N  DP  ++  R  E +
Subjt:  GVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFR

Q9UHL4 Dipeptidyl peptidase 26.2e-6332.53Show/hide
Query:  MFSSPWIPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEA
        M S+PW P LL  L+  +  L   + R                  P   F+  ++ Q LDHFN+      TFPQR++++ ++WV      PIF Y G E 
Subjt:  MFSSPWIPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEA

Query:  PIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYP
         +    N   F+ + A +  ALLV+ EHRYYGKS+PF ++     +   L      QA+AD+A +L  L+++L A  +P I  GGSYGGML+++ R+KYP
Subjt:  PIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYP

Query:  HVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELED---YLWSMYATAAQYNHP
        H+  GALA+SAP+L    +   + ++  VT DF G S  C + +++++ +I+ +  Q      +  EF TC+PL    +L     +  + +   A  ++P
Subjt:  HVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELED---YLWSMYATAAQYNHP

Query:  ---------PRYPVTRICDAIDGTSSVNGTLTKIAAGVFAYRGNVSCY-----INEPRNETETDVG-----WRWQSCSEMVMPISSDD--DMFPPYPFDL
                 P  PV   CD +   +     L  +A  V+   G+  CY      +   + T    G     W +Q+C+E+ +  +S++  DMFP  PF  
Subjt:  ---------PRYPVTRICDAIDGTSSVNGTLTKIAAGVFAYRGNVSCY-----INEPRNETETDVG-----WRWQSCSEMVMPISSDD--DMFPPYPFDL

Query:  GSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE
             YC   +GV PRP W+ T + G D R       SNIIFSNG  DP++  G+  N+S S++AV    G+H LD+  ++  DP  +V  RK E
Subjt:  GSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE

Arabidopsis top hitse value%identityAlignment
AT1G08220.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: mitochondrial proton-transporting ATP synthase complex assembly; LOCATED IN: mitochondrial inner membrane; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 7 growth stages; CONTAINS InterPro DOMAIN/s: ATPase assembly factor ATP10, mitochondria (InterPro:IPR007849); Has 168 Blast hits to 168 proteins in 86 species: Archae - 6; Bacteria - 0; Metazoa - 2; Fungi - 107; Plants - 30; Viruses - 0; Other Eukaryotes - 23 (source: NCBI BLink).1.7e-7662.72Show/hide
Query:  FFGCLQLGNKTAIEKERARLADEMNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASILPMATLLCLS
        F    + GNK AIE ERARL DEMNRGYFAD+ E K+HGGKIAAANK +IPA +A+KFP   V++S+GK+LKLPI  ++N V+  + + ++P  +L+CLS
Subjt:  FFGCLQLGNKTAIEKERARLADEMNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDANVVEGNSSASILPMATLLCLS

Query:  FRANSQAMIDSWSAPFLNAFSSSNNVQLYEVSFIDQWLLCRNPIKKVLLRLMRKSKGNARNDSLQRQIVYSFGDHYYFRKELKILNLLTGYIFLLDKFGR
        FRA+SQ MI SWS PFL +F +  ++QL+EVSFID+WLL   PI+K+LLR+++K   N  N  LQRQ+ Y+FGDHYYFRKE+K+LNLLTGYI LLDK GR
Subjt:  FRANSQAMIDSWSAPFLNAFSSSNNVQLYEVSFIDQWLLCRNPIKKVLLRLMRKSKGNARNDSLQRQIVYSFGDHYYFRKELKILNLLTGYIFLLDKFGR

Query:  IRWQGFGLATQEEVSSLLSCASLLLEEK
        IRWQGFG AT EEVS LLSC SLLLE++
Subjt:  IRWQGFGLATQEEVSSLLSCASLLLEEK

AT2G24280.1 alpha/beta-Hydrolases superfamily protein4.6e-11445.74Show/hide
Query:  SRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFR
        S++ + LP   F+T Y+ Q LDHF++ P+SY  F Q+Y+IN ++W       PIF Y G E  ID   +  GF+ D A +F ALLV+IEHR+YG+S PF 
Subjt:  SRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFR

Query:  SRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSE
         +    ++A TLGY NS QA+ADYA ++  LK+ L +  SPV+V GGSYGGMLA+WFRLKYPH+ +GALASSAPIL+FD+I P   +Y  +++DF+  S 
Subjt:  SRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSE

Query:  TCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDGTSSVNGTLTKIAAGV---
         C++ IK+SW E+E V++  NGL  L ++F+TC+ L   +   D+L   +   A  N+P         P YPV ++C  IDG    +  L +  A     
Subjt:  TCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDGTSSVNGTLTKIAAGV---

Query:  FAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDP
        + Y G+  C+  E + +     GW++Q+C+EMVMP+S S+  M PPY  D  +F   C   YGV PRPHW+TT +GG     VL+RFGSNIIFSNG++DP
Subjt:  FAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPIS-SDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDP

Query:  YSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE
        +S  GVL NIS S+VA+ T  G+H  D+  A + DPEWL  QR+ E
Subjt:  YSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTE

AT5G22860.1 Serine carboxypeptidase S28 family protein1.5e-15254.24Show/hide
Query:  SSPWIPFLLFVLSTSVTAL----HYRSPRLSPVGEKFLHH--SRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYL
        S P+   +LF+ STS + L    H +  RL  +  K L +    +   +   + K YY+NQTLDHF + PESY TF QRY I+  +W GA ++API A+L
Subjt:  SSPWIPFLLFVLSTSVTAL----HYRSPRLSPVGEKFLHH--SRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYL

Query:  GAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFR
        G E+ +D DL  IGFL DN  + NALLVYIEHRYYG+++PF S +EAL+NASTLGY N+AQA+ADYA IL+H+K++   N+SP+IVIGGSYGGMLA+WFR
Subjt:  GAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFR

Query:  LKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNH
        LKYPH+ALGALASSAP+LYF+D  P+ GYY +VTK F+  SE CY TI+ SW EI+ VA +PNGLSIL ++FKTC PL G F+++D+L ++YA A QYN 
Subjt:  LKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNH

Query:  PPRYPVTRICDAIDGT--SSVNGTLTKIAAGVFAYRGNVSCYINEP-RNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPP
         P + V ++C+AI+    +     L +I AGV A  GN +CY  +     T  ++ WRWQSCSE+VMP+  D  D MFP  PF++ S+I  C   +GV P
Subjt:  PPRYPVTRICDAIDGT--SSVNGTLTKIAAGVFAYRGNVSCYINEP-RNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPP

Query:  RPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRM
        RPHW+TTY+G  + +L+LQ+FGSNIIFSNGL DPYS+ GVL +ISD+LVA+ T  GSHCLDI   ++ DPEWLV QR+ E ++
Subjt:  RPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRM

AT5G22860.2 Serine carboxypeptidase S28 family protein3.9e-13753.99Show/hide
Query:  SSPWIPFLLFVLSTSVTAL----HYRSPRLSPVGEKFLHH--SRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYL
        S P+   +LF+ STS + L    H +  RL  +  K L +    +   +   + K YY+NQTLDHF + PESY TF QRY I+  +W GA ++API A+L
Subjt:  SSPWIPFLLFVLSTSVTAL----HYRSPRLSPVGEKFLHH--SRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYL

Query:  GAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFR
        G E+ +D DL  IGFL DN  + NALLVYIEHRYYG+++PF S +EAL+NASTLGY N+AQA+ADYA IL+H+K++   N+SP+IVIGGSYGGMLA+WFR
Subjt:  GAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFR

Query:  LKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNH
        LKYPH+ALGALASSAP+LYF+D  P+ GYY +VTK F+  SE CY TI+ SW EI+ VA +PNGLSIL ++FKTC PL G F+++D+L ++YA A QYN 
Subjt:  LKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNH

Query:  PPRYPVTRICDAIDGT--SSVNGTLTKIAAGVFAYRGNVSCYINEP-RNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPP
         P + V ++C+AI+    +     L +I AGV A  GN +CY  +     T  ++ WRWQSCSE+VMP+  D  D MFP  PF++ S+I  C   +GV P
Subjt:  PPRYPVTRICDAIDGT--SSVNGTLTKIAAGVFAYRGNVSCYINEP-RNETETDVGWRWQSCSEMVMPISSD--DDMFPPYPFDLGSFISYCNRLYGVPP

Query:  RPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAG
        RPHW+TTY+G  + +L+LQ+FGSNIIFSNGL DPYS+ G
Subjt:  RPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAG

AT5G65760.1 Serine carboxypeptidase S28 family protein5.9e-10945.1Show/hide
Query:  FKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNAST
        ++T +++Q LDHF++       F QRY+IN  +W+GA++  PIF Y G E  I+      GF+ D A +F ALLV+ EHRYYG+S+P+ SR+EA +NA+T
Subjt:  FKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDLNVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNAST

Query:  LGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWS
        L Y  + QA+AD+A  +  LK+ L A   PV++ GGSYGGMLA+W RLKYPH+A+GALASSAPIL F+D+ P + +Y + + DF+  S +C+ TIK SW 
Subjt:  LGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYTVVTKDFRGVSETCYETIKKSWS

Query:  EIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDGTSSVNGTLTKIAAGV---FAYRGNVSCYI
         I     + NGL  L + F  CR L    +L D+L S Y+  A  ++P         P +P+  +C  IDG  S    L +I AG+   + Y GNV C+ 
Subjt:  EIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHP---------PRYPVTRICDAIDGTSSVNGTLTKIAAGV---FAYRGNVSCYI

Query:  NEPRNETETDVGWRWQSCSEMVMPISS--DDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNI
         +  ++     GW WQ+C+EMVMP+SS  ++ MFP Y F+  S+   C   + V PRP WVTT +GGHD    L+ FGSNIIFSNGL DP+S   VL N+
Subjt:  NEPRNETETDVGWRWQSCSEMVMPISS--DDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNI

Query:  SDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRM
        SD++VA+ T  G+H LD+  +   DP+WLV QR+ E R+
Subjt:  SDSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTTCCAATGTTTTCTTCCCCATGGATTCCTTTTTTACTTTTTGTTCTTTCAACCTCTGTTACTGCTTTGCATTATAGATCCCCACGGCTTAGTCCTGTTGGTGA
AAAGTTTCTACATCATTCTCGAGCTCTGAATTCACTTCCTTCGGATGATTTCAAGACTTATTATTACAATCAAACACTTGATCATTTCAATTATAGGCCTGAAAGCTACA
CAACATTCCCTCAAAGATATATAATCAACTTCAAGTACTGGGTTGGTGCGAATTCGAGCGCGCCGATTTTTGCTTACTTGGGCGCTGAAGCACCAATAGATGATGATTTA
AATGTTATTGGGTTTTTGACAGATAATGCCATTCAGTTTAATGCTCTTCTAGTTTATATTGAGCATCGGTACTATGGGAAATCAGTACCATTTAGATCAAGGGATGAAGC
ATTGAGAAATGCAAGCACTCTTGGATACTTTAATTCAGCACAAGCAATAGCAGATTATGCAACCATTCTTATACATTTAAAAAAGGAGCTTCATGCTAATTATTCTCCTG
TGATTGTTATTGGTGGATCATATGGAGGAATGTTGGCTTCATGGTTCCGTCTTAAATACCCTCATGTAGCATTAGGAGCTCTTGCATCTTCAGCTCCAATTCTTTATTTC
GATGATATCACACCGCAGGATGGATACTATACTGTTGTCACCAAGGATTTCAGAGGTGTCAGTGAAACTTGCTATGAAACTATTAAGAAATCGTGGTCAGAGATTGAAAC
AGTTGCTTCTCAGCCAAACGGTCTGTCCATTCTGGACCAAGAGTTCAAAACATGCCGTCCTTTACAAGGATACTTTGAGTTGGAAGACTACTTGTGGTCCATGTATGCTA
CTGCAGCCCAATATAACCACCCGCCTAGATATCCAGTCACCAGGATCTGTGATGCCATTGATGGAACTTCTTCTGTGAATGGAACACTTACCAAAATAGCTGCAGGTGTA
TTTGCTTACAGAGGGAATGTATCCTGTTATATTAATGAGCCCAGAAATGAAACAGAAACTGATGTAGGATGGAGATGGCAGTCATGCAGTGAGATGGTGATGCCAATAAG
TTCAGATGATGATATGTTTCCACCGTACCCTTTTGACCTTGGAAGTTTCATCAGTTATTGCAATAGATTGTATGGTGTTCCTCCCAGACCCCATTGGGTTACCACCTACT
ATGGAGGCCATGATACACGGCTCGTCCTTCAGAGATTTGGTAGCAATATAATTTTTTCCAATGGACTCAAAGATCCTTATAGTATTGCTGGGGTATTGCACAACATATCA
GACAGTCTCGTTGCGGTGTATACAACTTATGGATCTCATTGCTTGGACATCTTAAAGGCGAATGAAACTGATCCTGAATGGTTAGTGACACAGAGAAAGACTGAGTTTAG
AATGCCAAGGCTTAGTCGTATTGGTGAAAAGTTTCTACATCATTCTAAAGCTCTGGAATTACCGCCTTCTGATGATTTCAAGACATTTTATTACAATCAAACAATTGATC
ATTTCAACTATAGGCCTGAAAGCTACACAACATTTCTTCAAAGATATATAATCAACTTCAAGTACTGGGGCGGTGCAAATTCGAGCGCTCCAATTCTTGCTTACTTGGGT
GCTGAAGCCCCAATAGATTCTGCTATGAATGGTATTGGGTTTATGACAGATAACGCCATCAAGTTCAATGCTCTTCTAGTTGATATTGAGCATCGCGCAAGCAATAGCGG
ATTATGCAGCCATTCTTATCATGTAAAAAAGGAGTTTAATGCTAAGTATTCTCCTGTGATTGTTATTGGTGGATCATATGGAGGAATGTTGGCTACATGGTTTCTTCTTA
AATATCCTCATGTGCCACTAGGATCTCTAGCATCTTCTGCTCCAATTCTTTACTTTGATGATATCACACCGCAAAATGGATACTATGCTGTTGTCACCAAGGATTTTAGA
GAAGTCAGTCAGACTTGCTATGAAACTATTAGGGAGTCATGGTCTGAAACAGAAAAAGTTGCCTCTCAACCCAATGGCCTTTCCATTCTTGACAAAGAGTTCAAAACATG
CAGTCCTTTAAGAAGTTCCACACAGCTGGAAAACTACTTGTGGTTCATGTATGCTAGTGCAGCCCAATACAACCACCCGCCAAGATATCCAGTCACCAGGATCTGTGGTG
CCATTGATCGAACTTTTTCTGGAAATGGAACACTTAGCAAAATAGCTGCAGGGGTATTTGCTTATAGAGGGAATCTCTCCTGTTATATTAATGAGCCTAGAAATGCAACC
GAAACTGATGTAGGATGGCAATGGCAGAGATGCAGTGAAATGGTGATGCCAATAAGTACAGACAATGATACTATGTTTCCACCTCGCACGTTTGATCTCGAAAGCTTCAT
CATTTACTGCAATCAATTATATGGCGTTCCTCCTAGGCCTCATTGGGTTACCACCTATTATGGAGGCCATGATATATATCTCATCCTTCAGAGATTTGCTAGCAACATTA
TCTTTTCCAATGGACTCAAAGATCCTTATAGCATTGGCGGGGTATTACACAATATTTCAGACAGTCTCCCAGCAGTGTATATAACTAATGGGTCTCATTGCTTAGACATC
CTAAGTGCGGATAAAATGGACCCGGAATGGTTGGCTACACAAAGGAAGACCGAGTGTCTTTATCTAGCACTTGAAGCATGTTATGCAGATGAAAGCTCTACAACTTTCTT
GCCTGCAATCAAATCAAGTCAACCATTTCATTGGCTTCAAAGCGGTGCTTATGATGATTTCAAGACATTTTATTATAATCAAACACTGGATCACTTCAACTATAGGCCTG
AAAGCTACACATGCTTCCCTCATAGATATATAATCAATTTTAAGTATTGGGGTGGTGCAAATTCTAGCGCTCCGATTCTTGCCTACTTGGGAGCCGAAGGTCCACTAGAA
GGCGATTTGAACACTATAGGGTTCATGACTGATAATGCTGTTCAATTTAATGCTCTTCTTGTTTATATCGAGCACCGATATTATGGGAAATCAATACCTTTTGGATCAAG
GAAAGAGGCACTTAAGAATGCAAGCACTCTAGGTTATTTCAACTCAGCTCAAGCAATAGCAGATTATGCAGCTGTTCTTATACATTTAAAAAAGAAGTTTCATGCTAAAG
ATTCTCCTGTAATTGTCCTTGGTGGATCATATGGAGGAATGTTGGCTGCATGGTTCCGTCTTAAATATCCTCATGTGGCACTAGGAGCTCTTGCATCTTCGGCTCCAGTT
CTATACTTCGACAATATCACGCCACATAATGGATACTACTCTATTGCCACCAAGGATTTTAGAGAAGTTAGTGAGACTTGCTACAGAACTATTCGGGATTCGTGGTCTGA
AATTGAAAGAATTGCTTCTAAGCCTAATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAGCATGCAGTCCTTTGAATAGTTCCTCTGAGCTGGAAGACTACTTGTGGTCTA
TGTATGCGGGTGCAGCCCAATACAACCACCCACCAAGATATCCGGTCACTAGAATCTGTGGTGGCATTGATGGAGCTTATTCTGGAAGTGGAACTATTAGCAAAATAGCT
TCAGGTGTATTTGCTTATAAAGGAAATCTATCCTGCTACAATCTTGAACCAAGAAACGATACTGAAACCGATGTAGGATGGAGATGGCAGAGATGCAGTGAGATGGTGAT
GCCAATAAGCACAGGCAATGATACTATGTTTCCTCCAGACACTTTTGACCTTAGAAGCTTCATAGATTACTGCTATCAGTCATACGGCGTTTCTCCGAGGCCTCACTGGG
TCACCACCTATTATGGAGGCAATGTATGGTCCCATTGTTTGGACATTTTACGGGCAAACGAAACCGATCCACAATGGCTAGTGAAACAAAGAGAGACAGAGGGACAAGCA
GATACTATAGGCATTGCAATGCAAAAAGCTTTGCCAGAGAAATGGAGGGAATATGTATCACTATCGCTTGTGCAAAATTTGAAAAAAATCCCACATGAATACTTTGCAAT
AATACTCAAGTCAAGAAGAATACTAACAGGGCCCAGTAGTAGTCCGGTATTACACATCAACATCCAGCCACCGGTAGACAATATGAAGAAGCAGGTCATTCCAGCGGAGA
TCAGAGATGACCGTACAGTTCTCCGATTAGGGTTCATCGGAGGCTCAACAGAAACTCTCTCTAGTTACAGGGAAAAATTCCTTATTTTTCCTTCCCACCATTTAGCTCAG
CTGACTTCCAATCGCTTCCTCGACATTTATCAGATTCTTCTTAATGCAGTTGTATTCGGTTACAAAGGGGTAATGTTTCGGATTAAGATCCATTTTTTCGGATGTTTGCA
GCTTGGAAACAAAACAGCCATTGAGAAAGAGCGCGCTCGGCTTGCAGATGAAATGAATAGAGGATACTTTGCTGATATTTCAGAGCTAAAGCAACATGGCGGAAAGATTG
CAGCAGCTAACAAGATTCTAATTCCGGCTATGGCTGCTGTAAAATTTCCGGAGTTTGAAGTGAGTTATTCTGATGGTAAAACGTTGAAGCTACCGATTAAATTTGATGCT
AATGTGGTAGAAGGCAATAGTTCGGCATCGATCTTGCCAATGGCCACATTACTGTGTCTTTCTTTCAGAGCTAACTCCCAGGCCATGATTGATTCTTGGAGTGCCCCTTT
TCTCAATGCCTTTTCTAGTTCAAACAATGTCCAGTTATATGAGGTTTCATTTATAGATCAGTGGCTCTTATGTCGAAACCCAATTAAGAAAGTGCTTCTTCGGCTAATGA
GGAAGTCCAAAGGCAATGCACGGAATGATTCACTTCAAAGGCAGATTGTATACTCGTTTGGCGACCATTATTACTTCAGAAAGGAGCTAAAAATATTAAATCTTCTCACT
GGGTATATATTCCTGCTTGACAAATTTGGTAGAATAAGATGGCAAGGCTTTGGATTGGCAACTCAAGAGGAGGTGTCATCTCTTCTTTCATGCGCGTCACTTCTTTTGGA
AGAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATAACAAGCACCCTGGCATGAGGTTTCCAATGTTTTCTTCCCCATGGATTCCTTTTTTACTTTTTGTTCTTTCAACCTCTGTTACTGCTTTGCATTATAGATCCCCACGG
CTTAGTCCTGTTGGTGAAAAGTTTCTACATCATTCTCGAGCTCTGAATTCACTTCCTTCGGATGATTTCAAGACTTATTATTACAATCAAACACTTGATCATTTCAATTA
TAGGCCTGAAAGCTACACAACATTCCCTCAAAGATATATAATCAACTTCAAGTACTGGGTTGGTGCGAATTCGAGCGCGCCGATTTTTGCTTACTTGGGCGCTGAAGCAC
CAATAGATGATGATTTAAATGTTATTGGGTTTTTGACAGATAATGCCATTCAGTTTAATGCTCTTCTAGTTTATATTGAGCATCGGTACTATGGGAAATCAGTACCATTT
AGATCAAGGGATGAAGCATTGAGAAATGCAAGCACTCTTGGATACTTTAATTCAGCACAAGCAATAGCAGATTATGCAACCATTCTTATACATTTAAAAAAGGAGCTTCA
TGCTAATTATTCTCCTGTGATTGTTATTGGTGGATCATATGGAGGAATGTTGGCTTCATGGTTCCGTCTTAAATACCCTCATGTAGCATTAGGAGCTCTTGCATCTTCAG
CTCCAATTCTTTATTTCGATGATATCACACCGCAGGATGGATACTATACTGTTGTCACCAAGGATTTCAGAGGTGTCAGTGAAACTTGCTATGAAACTATTAAGAAATCG
TGGTCAGAGATTGAAACAGTTGCTTCTCAGCCAAACGGTCTGTCCATTCTGGACCAAGAGTTCAAAACATGCCGTCCTTTACAAGGATACTTTGAGTTGGAAGACTACTT
GTGGTCCATGTATGCTACTGCAGCCCAATATAACCACCCGCCTAGATATCCAGTCACCAGGATCTGTGATGCCATTGATGGAACTTCTTCTGTGAATGGAACACTTACCA
AAATAGCTGCAGGTGTATTTGCTTACAGAGGGAATGTATCCTGTTATATTAATGAGCCCAGAAATGAAACAGAAACTGATGTAGGATGGAGATGGCAGTCATGCAGTGAG
ATGGTGATGCCAATAAGTTCAGATGATGATATGTTTCCACCGTACCCTTTTGACCTTGGAAGTTTCATCAGTTATTGCAATAGATTGTATGGTGTTCCTCCCAGACCCCA
TTGGGTTACCACCTACTATGGAGGCCATGATACACGGCTCGTCCTTCAGAGATTTGGTAGCAATATAATTTTTTCCAATGGACTCAAAGATCCTTATAGTATTGCTGGGG
TATTGCACAACATATCAGACAGTCTCGTTGCGGTGTATACAACTTATGGATCTCATTGCTTGGACATCTTAAAGGCGAATGAAACTGATCCTGAATGGTTAGTGACACAG
AGAAAGACTGAGTTTAGAATGCCAAGGCTTAGTCGTATTGGTGAAAAGTTTCTACATCATTCTAAAGCTCTGGAATTACCGCCTTCTGATGATTTCAAGACATTTTATTA
CAATCAAACAATTGATCATTTCAACTATAGGCCTGAAAGCTACACAACATTTCTTCAAAGATATATAATCAACTTCAAGTACTGGGGCGGTGCAAATTCGAGCGCTCCAA
TTCTTGCTTACTTGGGTGCTGAAGCCCCAATAGATTCTGCTATGAATGGTATTGGGTTTATGACAGATAACGCCATCAAGTTCAATGCTCTTCTAGTTGATATTGAGCAT
CGCGCAAGCAATAGCGGATTATGCAGCCATTCTTATCATGTAAAAAAGGAGTTTAATGCTAAGTATTCTCCTGTGATTGTTATTGGTGGATCATATGGAGGAATGTTGGC
TACATGGTTTCTTCTTAAATATCCTCATGTGCCACTAGGATCTCTAGCATCTTCTGCTCCAATTCTTTACTTTGATGATATCACACCGCAAAATGGATACTATGCTGTTG
TCACCAAGGATTTTAGAGAAGTCAGTCAGACTTGCTATGAAACTATTAGGGAGTCATGGTCTGAAACAGAAAAAGTTGCCTCTCAACCCAATGGCCTTTCCATTCTTGAC
AAAGAGTTCAAAACATGCAGTCCTTTAAGAAGTTCCACACAGCTGGAAAACTACTTGTGGTTCATGTATGCTAGTGCAGCCCAATACAACCACCCGCCAAGATATCCAGT
CACCAGGATCTGTGGTGCCATTGATCGAACTTTTTCTGGAAATGGAACACTTAGCAAAATAGCTGCAGGGGTATTTGCTTATAGAGGGAATCTCTCCTGTTATATTAATG
AGCCTAGAAATGCAACCGAAACTGATGTAGGATGGCAATGGCAGAGATGCAGTGAAATGGTGATGCCAATAAGTACAGACAATGATACTATGTTTCCACCTCGCACGTTT
GATCTCGAAAGCTTCATCATTTACTGCAATCAATTATATGGCGTTCCTCCTAGGCCTCATTGGGTTACCACCTATTATGGAGGCCATGATATATATCTCATCCTTCAGAG
ATTTGCTAGCAACATTATCTTTTCCAATGGACTCAAAGATCCTTATAGCATTGGCGGGGTATTACACAATATTTCAGACAGTCTCCCAGCAGTGTATATAACTAATGGGT
CTCATTGCTTAGACATCCTAAGTGCGGATAAAATGGACCCGGAATGGTTGGCTACACAAAGGAAGACCGAGTGTCTTTATCTAGCACTTGAAGCATGTTATGCAGATGAA
AGCTCTACAACTTTCTTGCCTGCAATCAAATCAAGTCAACCATTTCATTGGCTTCAAAGCGGTGCTTATGATGATTTCAAGACATTTTATTATAATCAAACACTGGATCA
CTTCAACTATAGGCCTGAAAGCTACACATGCTTCCCTCATAGATATATAATCAATTTTAAGTATTGGGGTGGTGCAAATTCTAGCGCTCCGATTCTTGCCTACTTGGGAG
CCGAAGGTCCACTAGAAGGCGATTTGAACACTATAGGGTTCATGACTGATAATGCTGTTCAATTTAATGCTCTTCTTGTTTATATCGAGCACCGATATTATGGGAAATCA
ATACCTTTTGGATCAAGGAAAGAGGCACTTAAGAATGCAAGCACTCTAGGTTATTTCAACTCAGCTCAAGCAATAGCAGATTATGCAGCTGTTCTTATACATTTAAAAAA
GAAGTTTCATGCTAAAGATTCTCCTGTAATTGTCCTTGGTGGATCATATGGAGGAATGTTGGCTGCATGGTTCCGTCTTAAATATCCTCATGTGGCACTAGGAGCTCTTG
CATCTTCGGCTCCAGTTCTATACTTCGACAATATCACGCCACATAATGGATACTACTCTATTGCCACCAAGGATTTTAGAGAAGTTAGTGAGACTTGCTACAGAACTATT
CGGGATTCGTGGTCTGAAATTGAAAGAATTGCTTCTAAGCCTAATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAGCATGCAGTCCTTTGAATAGTTCCTCTGAGCTGGA
AGACTACTTGTGGTCTATGTATGCGGGTGCAGCCCAATACAACCACCCACCAAGATATCCGGTCACTAGAATCTGTGGTGGCATTGATGGAGCTTATTCTGGAAGTGGAA
CTATTAGCAAAATAGCTTCAGGTGTATTTGCTTATAAAGGAAATCTATCCTGCTACAATCTTGAACCAAGAAACGATACTGAAACCGATGTAGGATGGAGATGGCAGAGA
TGCAGTGAGATGGTGATGCCAATAAGCACAGGCAATGATACTATGTTTCCTCCAGACACTTTTGACCTTAGAAGCTTCATAGATTACTGCTATCAGTCATACGGCGTTTC
TCCGAGGCCTCACTGGGTCACCACCTATTATGGAGGCAATGTATGGTCCCATTGTTTGGACATTTTACGGGCAAACGAAACCGATCCACAATGGCTAGTGAAACAAAGAG
AGACAGAGGGACAAGCAGATACTATAGGCATTGCAATGCAAAAAGCTTTGCCAGAGAAATGGAGGGAATATGTATCACTATCGCTTGTGCAAAATTTGAAAAAAATCCCA
CATGAATACTTTGCAATAATACTCAAGTCAAGAAGAATACTAACAGGGCCCAGTAGTAGTCCGGTATTACACATCAACATCCAGCCACCGGTAGACAATATGAAGAAGCA
GGTCATTCCAGCGGAGATCAGAGATGACCGTACAGTTCTCCGATTAGGGTTCATCGGAGGCTCAACAGAAACTCTCTCTAGTTACAGGGAAAAATTCCTTATTTTTCCTT
CCCACCATTTAGCTCAGCTGACTTCCAATCGCTTCCTCGACATTTATCAGATTCTTCTTAATGCAGTTGTATTCGGTTACAAAGGGGTAATGTTTCGGATTAAGATCCAT
TTTTTCGGATGTTTGCAGCTTGGAAACAAAACAGCCATTGAGAAAGAGCGCGCTCGGCTTGCAGATGAAATGAATAGAGGATACTTTGCTGATATTTCAGAGCTAAAGCA
ACATGGCGGAAAGATTGCAGCAGCTAACAAGATTCTAATTCCGGCTATGGCTGCTGTAAAATTTCCGGAGTTTGAAGTGAGTTATTCTGATGGTAAAACGTTGAAGCTAC
CGATTAAATTTGATGCTAATGTGGTAGAAGGCAATAGTTCGGCATCGATCTTGCCAATGGCCACATTACTGTGTCTTTCTTTCAGAGCTAACTCCCAGGCCATGATTGAT
TCTTGGAGTGCCCCTTTTCTCAATGCCTTTTCTAGTTCAAACAATGTCCAGTTATATGAGGTTTCATTTATAGATCAGTGGCTCTTATGTCGAAACCCAATTAAGAAAGT
GCTTCTTCGGCTAATGAGGAAGTCCAAAGGCAATGCACGGAATGATTCACTTCAAAGGCAGATTGTATACTCGTTTGGCGACCATTATTACTTCAGAAAGGAGCTAAAAA
TATTAAATCTTCTCACTGGGTATATATTCCTGCTTGACAAATTTGGTAGAATAAGATGGCAAGGCTTTGGATTGGCAACTCAAGAGGAGGTGTCATCTCTTCTTTCATGC
GCGTCACTTCTTTTGGAAGAAAAATGAGAAATTATCCTCCTCCATAAAATGATTCACTTCCAGAAGTTTTGTTCGACGACGATGCAAGGAAAATTAGCAAATACTAGTAT
CTATGAATAAAGATATAATTTTCAAATGGTGTGAATTAACACCAAGGAAAGATTTGTGCGAGCTTAATTTCCTGAAAGCAACTATTTCACATATATATAATATCAAGTCG
ACAATGACTGATTTTGTTTTTGTTTGTATTTTTTCTTCAATAGTTGCGGAGTCGGGTCGAATTATCGATTTTTGGAATGATAATTGGTGGCTTATTTGAAATC
Protein sequenceShow/hide protein sequence
MRFPMFSSPWIPFLLFVLSTSVTALHYRSPRLSPVGEKFLHHSRALNSLPSDDFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWVGANSSAPIFAYLGAEAPIDDDL
NVIGFLTDNAIQFNALLVYIEHRYYGKSVPFRSRDEALRNASTLGYFNSAQAIADYATILIHLKKELHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYF
DDITPQDGYYTVVTKDFRGVSETCYETIKKSWSEIETVASQPNGLSILDQEFKTCRPLQGYFELEDYLWSMYATAAQYNHPPRYPVTRICDAIDGTSSVNGTLTKIAAGV
FAYRGNVSCYINEPRNETETDVGWRWQSCSEMVMPISSDDDMFPPYPFDLGSFISYCNRLYGVPPRPHWVTTYYGGHDTRLVLQRFGSNIIFSNGLKDPYSIAGVLHNIS
DSLVAVYTTYGSHCLDILKANETDPEWLVTQRKTEFRMPRLSRIGEKFLHHSKALELPPSDDFKTFYYNQTIDHFNYRPESYTTFLQRYIINFKYWGGANSSAPILAYLG
AEAPIDSAMNGIGFMTDNAIKFNALLVDIEHRASNSGLCSHSYHVKKEFNAKYSPVIVIGGSYGGMLATWFLLKYPHVPLGSLASSAPILYFDDITPQNGYYAVVTKDFR
EVSQTCYETIRESWSETEKVASQPNGLSILDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPPRYPVTRICGAIDRTFSGNGTLSKIAAGVFAYRGNLSCYINEPRNAT
ETDVGWQWQRCSEMVMPISTDNDTMFPPRTFDLESFIIYCNQLYGVPPRPHWVTTYYGGHDIYLILQRFASNIIFSNGLKDPYSIGGVLHNISDSLPAVYITNGSHCLDI
LSADKMDPEWLATQRKTECLYLALEACYADESSTTFLPAIKSSQPFHWLQSGAYDDFKTFYYNQTLDHFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLE
GDLNTIGFMTDNAVQFNALLVYIEHRYYGKSIPFGSRKEALKNASTLGYFNSAQAIADYAAVLIHLKKKFHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPV
LYFDNITPHNGYYSIATKDFREVSETCYRTIRDSWSEIERIASKPNGLSILSKEFKACSPLNSSSELEDYLWSMYAGAAQYNHPPRYPVTRICGGIDGAYSGSGTISKIA
SGVFAYKGNLSCYNLEPRNDTETDVGWRWQRCSEMVMPISTGNDTMFPPDTFDLRSFIDYCYQSYGVSPRPHWVTTYYGGNVWSHCLDILRANETDPQWLVKQRETEGQA
DTIGIAMQKALPEKWREYVSLSLVQNLKKIPHEYFAIILKSRRILTGPSSSPVLHINIQPPVDNMKKQVIPAEIRDDRTVLRLGFIGGSTETLSSYREKFLIFPSHHLAQ
LTSNRFLDIYQILLNAVVFGYKGVMFRIKIHFFGCLQLGNKTAIEKERARLADEMNRGYFADISELKQHGGKIAAANKILIPAMAAVKFPEFEVSYSDGKTLKLPIKFDA
NVVEGNSSASILPMATLLCLSFRANSQAMIDSWSAPFLNAFSSSNNVQLYEVSFIDQWLLCRNPIKKVLLRLMRKSKGNARNDSLQRQIVYSFGDHYYFRKELKILNLLT
GYIFLLDKFGRIRWQGFGLATQEEVSSLLSCASLLLEEK