; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011931 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011931
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionlysosomal Pro-X carboxypeptidase
Genome locationtig00153145:26449..43756
RNA-Seq ExpressionSgr011931
SyntenySgr011931
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004180 - carboxypeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
GO:0008239 - dipeptidyl-peptidase activity (molecular function)
InterPro domainsIPR008758 - Peptidase S28
IPR029058 - Alpha/Beta hydrolase fold
IPR042269 - Serine carboxypeptidase S28, SKS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596591.1 Lysosomal Pro-X carboxypeptidase, partial [Cucurbita argyrosperma subsp. sororia]2.7e-25584.43Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG
        MA PST LS + +L LHF+SSFSK +  F  SLLLRPQ  PI S   R YR  +FTQILDHFNFNPQ+YQ FQQRYLINDT+WGGAAENAPIFVYTGNEG
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG

Query:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY
        +IEWFAQNTG +LESAP+FRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK+NLTAIDSPVVVFGGSYGGMLAAWFRLKY
Subjt:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY

Query:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS
        PH++IGA+ASSAPILQFENITSPY+FNNIITQDFKSESQNCYRVIKGSW  I+  ANQPGGP++LRKSFKFCK  +S+ IQNWLY A  YTAMTDYPTPS
Subjt:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS

Query:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL
        NFLNPLPAYPVKQMCKAIDD T GNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG+WTWQACTE+ILP G NT++SIFPASTW+YA+RV++C  L
Subjt:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL

Query:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        FDVEPRR WITT FGG NIERVLK+FGSNIIFFNGLRDPWSGGGVLKNIS+SIIAIVAKEGAHHVDLRFS P DPKW+KDVRKQ+LN+I+DWLSQYYLDL
Subjt:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Query:  A
        A
Subjt:  A

KAG7028128.1 Lysosomal Pro-X carboxypeptidase [Cucurbita argyrosperma subsp. argyrosperma]2.7e-25584.43Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG
        MA PST LS + +L LHF+SSFSK +  F  SLLLRPQ  PI S   R YR  +FTQILDHFNFNPQ+YQ FQQRYLINDT+WGGAAENAPIFVYTGNEG
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG

Query:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY
        +IEWFAQNTG +LESAP+FRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK+NLTAIDSPVVVFGGSYGGMLAAWFRLKY
Subjt:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY

Query:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS
        PH++IGA+ASSAPILQFENITSPY+FNNIITQDFKSESQNCYRVIKGSW  I+  ANQPGGP++LRKSFKFCK  +S+ IQNWLY A  YTAMTDYPTPS
Subjt:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS

Query:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL
        NFLNPLPAYPVKQMCKAIDD T GNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG+WTWQACTE+ILP G NT++SIFPASTW+YA+RV++C  L
Subjt:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL

Query:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        FDVEPRR WITT FGG NIERVLK+FGSNIIFFNGLRDPWSGGGVLKNIS+SIIAIVAKEGAHHVDLRFS P DPKW+KDVRKQ+LN+I+DWLSQYYLDL
Subjt:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Query:  A
        A
Subjt:  A

XP_022144768.1 lysosomal Pro-X carboxypeptidase-like isoform X1 [Momordica charantia]1.9e-25684.75Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN
        M  PSTFL F+ ++ LHFTSSFS F    PPSLLLRP++      S  +RLYR NYFTQILDHFNFNPQ+Y  FQQRYLINDT+WGGAAENAPIFVYTGN
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN

Query:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL
        EG+IEWFAQNTGL+LESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK NLTA+DSPVVVFGGSYGGMLAAWFRL
Subjt:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL

Query:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT
        KYPHISIGAVASSAPIL FENITSPYAFNN+ITQDFKSESQNCY VIK SW  IE AAN+PGGPEMLR SFKFCK+ +SE I++WLY A  YTAMTDYPT
Subjt:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT

Query:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS
          NFLNPLPAYPVKQMCKAIDD TAGNDTFAKLYGAANVYYNY+GTATCFDLDDDSDPHDLG+WTWQACTEMILP GGNTK+SIFPAS W+Y +R++FC 
Subjt:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS

Query:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL
        RLF VEPRR WITT FGGH IERVLK+FGSNIIFFNGLRDPWS GGVLKNIS+SIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLN+IQDWLSQYY 
Subjt:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL

Query:  DLAHN
        DLA N
Subjt:  DLAHN

XP_022144769.1 lysosomal Pro-X carboxypeptidase-like isoform X2 [Momordica charantia]4.6e-25584.75Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN
        M  PSTFL F+ ++ LHFTSSFS F    PPSLLLRP++      S  +RLYR NYFTQILDHFNFNPQ+Y  FQQRYLINDT+WGGAAENAPIFVYTGN
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN

Query:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL
        EG+IEWFAQNTGL+LESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK NLTA+DSPVVVFGGSYGGMLAAWFRL
Subjt:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL

Query:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT
        KYPHISIGAVASSAPIL FENITSPYAFNN+ITQDFKSESQNCY VIK SW  IE AAN+PGGPEMLR SFKFCK+ +SE I++WLY A  YTAMTDYPT
Subjt:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT

Query:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS
          NFLNPLPAYPVKQMCKAIDD TAGNDTFAKLYGAANVYYNY+GTATCFDLDDDSDPHDLG+WTWQACTEMILP GGNTK+SIFPAS W+Y +R++FC 
Subjt:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS

Query:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL
        RLF VEPRR WITT FGGH IERVLK+FGSNIIFFNGLRDPWS GGVLKNIS+SIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLN+IQDWLSQYY 
Subjt:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL

Query:  DLAHN
        DLA N
Subjt:  DLAHN

XP_023539018.1 lysosomal Pro-X carboxypeptidase [Cucurbita pepo subsp. pepo]3.2e-25684.83Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG
        MA PST LS + ++ LHF+SSFSK +  F  SLLLRPQ  PI S   R YR  +FTQILDHFNFNPQ+YQ FQQRYLINDT+WGGAAENAPIFVYTGNEG
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG

Query:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY
        +IEWFAQNTG +LESAP+FRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK+NLTAIDSPVVVFGGSYGGMLAAWFRLKY
Subjt:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY

Query:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS
        PH++IGAVASSAPILQFENITSPY+FNNIITQDFKSESQNCYRVIKGSW  I+  ANQPGGP++LRKSFKFCK  +S+ IQNWLY A  YTAMTDYPTPS
Subjt:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS

Query:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL
        NFLNPLPAYPVKQMCKAIDD T GNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG+WTWQACTE+ILP G NT++SIFPASTW+YA+RVN+C  L
Subjt:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL

Query:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        FDVEPRR WITT FGG NIERVLK+FGSNIIFFNGLRDPWSGGGVLKNIS+SIIAIVAKEGAHHVDLRFS P DPKWLKDVRKQ+LN+IQDWL+QYYLDL
Subjt:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Query:  A
        A
Subjt:  A

TrEMBL top hitse value%identityAlignment
A0A1S3B7I8 lysosomal Pro-X carboxypeptidase-like9.0e-24982.63Show/hide
Query:  AFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRL-YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG
        A PS  LS + +LSLHFTSSFSK    F  SLLLRPQ  PI   D  L Y+  +FTQILDHFNFNPQ+YQ FQQRYLINDT+WGGAA N+PIFVYTGNEG
Subjt:  AFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRL-YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG

Query:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY
        +IEWFAQNTG LL+SAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLG+LSSTQALADYATLITDLKKNL+A+DSPV+VFGGSYGGMLAAWFRLKY
Subjt:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY

Query:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS
        PHI++GA+ASSAPILQFENITSPYAF+NI+TQDFKSESQNCYRVIK SW LI+  +  P GP++LRKSFKFCK  E+E I+NWL  A  YTAMTDYPTPS
Subjt:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS

Query:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL
        NFLNPLPAYPVKQMCKAIDD   GNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG+WTWQACTEMILP GGNTKESIFPASTW++A+R++FC   
Subjt:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL

Query:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        FDVEPRRIWI T FGGHNIERVLK+FGSNIIFFNGLRDPWSGGGVLKNIS++IIAIVAKEGAHHVDLRFS+ EDPKWLKDVR+Q+L++I+DWLSQYYLDL
Subjt:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Query:  A
        A
Subjt:  A

A0A6J1CT96 lysosomal Pro-X carboxypeptidase-like isoform X22.2e-25584.75Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN
        M  PSTFL F+ ++ LHFTSSFS F    PPSLLLRP++      S  +RLYR NYFTQILDHFNFNPQ+Y  FQQRYLINDT+WGGAAENAPIFVYTGN
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN

Query:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL
        EG+IEWFAQNTGL+LESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK NLTA+DSPVVVFGGSYGGMLAAWFRL
Subjt:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL

Query:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT
        KYPHISIGAVASSAPIL FENITSPYAFNN+ITQDFKSESQNCY VIK SW  IE AAN+PGGPEMLR SFKFCK+ +SE I++WLY A  YTAMTDYPT
Subjt:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT

Query:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS
          NFLNPLPAYPVKQMCKAIDD TAGNDTFAKLYGAANVYYNY+GTATCFDLDDDSDPHDLG+WTWQACTEMILP GGNTK+SIFPAS W+Y +R++FC 
Subjt:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS

Query:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL
        RLF VEPRR WITT FGGH IERVLK+FGSNIIFFNGLRDPWS GGVLKNIS+SIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLN+IQDWLSQYY 
Subjt:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL

Query:  DLAHN
        DLA N
Subjt:  DLAHN

A0A6J1CU66 lysosomal Pro-X carboxypeptidase-like isoform X19.0e-25784.75Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN
        M  PSTFL F+ ++ LHFTSSFS F    PPSLLLRP++      S  +RLYR NYFTQILDHFNFNPQ+Y  FQQRYLINDT+WGGAAENAPIFVYTGN
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPI--HSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGN

Query:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL
        EG+IEWFAQNTGL+LESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK NLTA+DSPVVVFGGSYGGMLAAWFRL
Subjt:  EGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRL

Query:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT
        KYPHISIGAVASSAPIL FENITSPYAFNN+ITQDFKSESQNCY VIK SW  IE AAN+PGGPEMLR SFKFCK+ +SE I++WLY A  YTAMTDYPT
Subjt:  KYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPT

Query:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS
          NFLNPLPAYPVKQMCKAIDD TAGNDTFAKLYGAANVYYNY+GTATCFDLDDDSDPHDLG+WTWQACTEMILP GGNTK+SIFPAS W+Y +R++FC 
Subjt:  PSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCS

Query:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL
        RLF VEPRR WITT FGGH IERVLK+FGSNIIFFNGLRDPWS GGVLKNIS+SIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLN+IQDWLSQYY 
Subjt:  RLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYL

Query:  DLAHN
        DLA N
Subjt:  DLAHN

A0A6J1FCY6 lysosomal Pro-X carboxypeptidase1.9e-25484.43Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG
        MA PST LS + +L LHF+SSFSK +  F  SLLLRPQ  PI S   R YR  +FTQILDHFNFNPQ+YQ FQQRYLINDT+WGGA ENAPIFVYTGNEG
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG

Query:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY
        +IEWFAQNTG +LESAP+FRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITDLK+NLTAIDSPVVVFGGSYGGMLAAWFRLKY
Subjt:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY

Query:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS
        PH++IGA+ASSAPILQFENITSPY+FNNIITQDFKSESQNCYRVIKGSW  I+  ANQPGGP++LRKSFKFCK  +S+ IQNWLY A  YTAMTDYPTPS
Subjt:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS

Query:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL
        NFLNPLPAYPVKQMCKAIDD T GNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG+WTWQACTE+ILP G NT++SIFPASTW+YA+RV++C  L
Subjt:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL

Query:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        FDVEPRR WITT FGG NIERVLK+FGSNIIFFNGLRDPWSGGGVLKNIS+SIIAIVAKEGAHHVDLRFS P DPKW+KDVRKQ+LN+IQDWLSQYYLDL
Subjt:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Query:  A
        A
Subjt:  A

A0A6J1KYN4 lysosomal Pro-X carboxypeptidase3.9e-25283.63Show/hide
Query:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG
        MA  ST LS + +L LHF+SSFSK +  F  S LLR Q  PI S   R YR  +FTQILDHFNFNPQ+YQ FQQRYLINDT+WGGAAENAPIFVYTGNEG
Subjt:  MAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEG

Query:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY
        +IEWFAQNTG +LESAP+FRALVVFIEHRFYGKSIPFGGDEDVAN NSSTLG+LSSTQALADYATLITD K+NLTAIDSPVVVFGGSYGGMLAAWFRLKY
Subjt:  DIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKY

Query:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS
        PH++IGAVASSAPILQFENITSPY+FNNIITQDFKSESQNCYRVIKGSW  I+  ANQPGGP++LR SFKFCK  +S+ IQNWLY A  YTAMTDYPTPS
Subjt:  PHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPS

Query:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL
        NFLNPLPAYPVKQMCKAIDD T GNDTFAKLYGAANVYYNY+GTATCFDLDDDSDPHDLG+WTWQACTE+ILP G NT++SIFPASTW+YA+RV++C  L
Subjt:  NFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRL

Query:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        FDVEPRR WITT FGG NIERVLK+FGSNIIFFNGLRDPWSGGGVLKNIS+SIIAIVAKEGAHHVDLRFS   DPKWLKDVRKQ+LN+IQDWLSQYYLDL
Subjt:  FDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Query:  A
        A
Subjt:  A

SwissProt top hitse value%identityAlignment
P42785 Lysosomal Pro-X carboxypeptidase1.8e-10041.1Show/hide
Query:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS
        Y + YF Q +DHF FN    + F QRYL+ D +W        I  YTGNEGDI WF  NTG + + A   +A++VF EHR+YG+S+PFG   D +  +S 
Subjt:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS

Query:  TLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGS
         L +L+S QALAD+A LI  LK+ +  A + PV+  GGSYGGMLAAWFR+KYPH+ +GA+A+SAPI QFE++     F  I+T DF+    +C   I  S
Subjt:  TLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGS

Query:  WDLIERAANQPGGPEMLRKSFKFCKRTESEIIQ---NWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYTGT
        WD I R +N   G + L  +   C    S+ IQ   +W+   +   AM DYP  SNFL PLPA+P+K +C+ + +    +    + ++ A NVYYNY+G 
Subjt:  WDLIERAANQPGGPEMLRKSFKFCKRTESEIIQ---NWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYTGT

Query:  ATCFDLDDDSDPHDLG--NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSG
          C ++ + +    LG   W++QACTE+++P   N  + +F   +WN     + C + + V PR  WITT +GG NI        +NI+F NG  DPWSG
Subjt:  ATCFDLDDDSDPHDLG--NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSG

Query:  GGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYY
        GGV K+I+ +++A+   EGAHH+DLR     DP  +   R  ++  +++W+  +Y
Subjt:  GGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYY

Q2TA14 Lysosomal Pro-X carboxypeptidase1.4e-10041.83Show/hide
Query:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS
        Y I Y  Q +DHF FN    + F+QRYLI D +W    +   I  YTGNEGDI WF  NTG + + A   +A++VF EHR+YG+S+PFG D   + S+S 
Subjt:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS

Query:  TLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGS
         L +L++ QALAD+A LI  LK+ +  A +  V+  GGSYGGMLAAWFR+KYPH+ +GA+ASSAPI QF ++     F  I+T DF     NC   I+ S
Subjt:  TLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGS

Query:  WDLIERAANQPGGPEMLRKSFKFC----KRTESEIIQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYTG
        WD I R A +  G   L ++   C    K  + + +++W+   +   AM DYP  SNFL PLPA+PVK +C+        +    + ++ A NVYYNY+G
Subjt:  WDLIERAANQPGGPEMLRKSFKFC----KRTESEIIQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYTG

Query:  TATCFDLDDDSDPHDLG--NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWS
         A C ++ + +    LG   W++QACTEM++P   +  + +F   +WN     + C + + V PR  WI T +GG NI        +NIIF NG  DPWS
Subjt:  TATCFDLDDDSDPHDLG--NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWS

Query:  GGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        GGGV K+I+ +++AIV   GAHH+DLR S   DP  ++  R  ++  ++ W+S +Y+ L
Subjt:  GGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Q5RBU7 Lysosomal Pro-X carboxypeptidase1.0e-10041.1Show/hide
Query:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS
        Y + YF Q +DHF FN    + F QRYL+ D +W        I  YTGNEGDI WF  NTG + + A   +A++VF EHR+YG+S+PFG   D    +S 
Subjt:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS

Query:  TLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGS
         L +L+S QALAD+A LI  LK+ +  A + PV+  GGSYGGMLAAWFR+KYPH+ +GA+A+SAPI QFE++     F  I+T DF+    +C   I+ S
Subjt:  TLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGS

Query:  WDLIERAANQPGGPEMLRKSFKFCKRTESEIIQ---NWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYTGT
        WD I R +N   G + L  +   C    S+ IQ   +W+   +   AM DYP  SNFL PLPA+P+K +C+ + +    +    + ++ A NVYYNY+G 
Subjt:  WDLIERAANQPGGPEMLRKSFKFCKRTESEIIQ---NWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYTGT

Query:  ATCFDLDDDSDPHDLG--NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSG
          C ++ + +    LG   W++QACTE+++P   N  + +F   +WN     + C + + V PR  WITT +GG NI        +NI+F NG  DPWSG
Subjt:  ATCFDLDDDSDPHDLG--NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSG

Query:  GGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYY
        GGV K+I+ +++A+   EGAHH+DLR     DP  +   R  ++  +++W+  +Y
Subjt:  GGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYY

Q7TMR0 Lysosomal Pro-X carboxypeptidase7.7e-9639.57Show/hide
Query:  RLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSN
        R Y + YF Q +DHF F     + F+QRYL+ D HW        I  YTGNEGDI WF  NTG + + A   +A++VF EHR+YG+S+PFG D   +  +
Subjt:  RLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSN

Query:  SSTLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIK
        S  L +L+S QALAD+A LI  L+K +  A   PV+  GGSYGGMLAAWFR+KYPHI +GA+A+SAPI Q + +     F  I+T DF+     C   I+
Subjt:  SSTLGYLSSTQALADYATLITDLKKNLT-AIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIK

Query:  GSWDLIERAANQPGGPEMLRKSFKFCKRTESE---IIQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYT
         SW++I++ +    G + L      C    SE    ++ W+   +   AM +YP   NFL PLPA+P+K++C+ + +    +    + ++ A +VYYNY+
Subjt:  GSWDLIERAANQPGGPEMLRKSFKFCKRTESE---IIQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAK-LYGAANVYYNYT

Query:  GTATCFDLDDDSDPHDLGN--WTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPW
        G A C ++   +    LG+  W++QACTEM++P   N  + +F    W+     N C   + V+PR  W+TT +GG NI        SNIIF NG  DPW
Subjt:  GTATCFDLDDDSDPHDLGN--WTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPW

Query:  SGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        SGGGV ++I+ +++AI   +GAHH+DLR     DP  +   R  ++  ++ W+  +Y ++
Subjt:  SGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

Q9EPB1 Dipeptidyl peptidase 22.6e-8839.43Show/hide
Query:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS
        +R NYF Q +DHFNF   + + F QR+L++D  W       PIF YTGNEGDI   A N+G ++E A +  AL+VF EHR+YGKS+PFG    V ++   
Subjt:  YRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSS

Query:  TLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSW
            L+  QALAD+A L+  L+ NL   D+P + FGGSYGGML+A+ R+KYPH+  GA+A+SAP++    + +P  F   +T DF  +S  C + ++ ++
Subjt:  TLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSW

Query:  DLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLY----IAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTA
          I+    Q G  + + ++F  C+   S      L+     AF   AM DYP P+NFL PLPA PVK  C+ +    +       L   A + YN +G  
Subjt:  DLIERAANQPGGPEMLRKSFKFCKRTESEIIQNWLY----IAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTA

Query:  TCFDL----DDDSDPHDLGN------WTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNG
         CFD+       +DP   G       W +QACTE+ L    N    +FP   ++   R  +C   + V PR  W+ T F G ++     K  SNIIF NG
Subjt:  TCFDL----DDDSDPHDLGN------WTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNG

Query:  LRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLS
          DPW+GGG+ +N+STSIIA+  + GAHH+DLR S  EDP  + +VRK +  +I++W++
Subjt:  LRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLS

Arabidopsis top hitse value%identityAlignment
AT2G18080.1 Serine carboxypeptidase S28 family protein1.6e-2727.43Show/hide
Query:  LSFYFYLSLHFTS--SFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRIN---YFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDI
        L F F L   FT+  SFS  +       LL P  +  +    R Y      +F Q LDH   +P  ++ F+QRY     ++   + + P+F+    EG  
Subjt:  LSFYFYLSLHFTS--SFSKFTPRFPPSLLLRPQKLPIHSPDDRLYRIN---YFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDI

Query:  EWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLT--------AIDSPVVVFGGSYGGMLAA
           A +   +L  A +F+A VV +EHR+YGKS PF     +A  N   L YLSS QAL D A+     +++L           D+P   FG SY G L+A
Subjt:  EWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLT--------AIDSPVVVFGGSYGGMLAA

Query:  WFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFK-FCKRTESEIIQNWLYIAFFYTAM
        WFRLK+PH++ G++ASSA       + + Y F+    Q  +S  Q C   ++ +  L+E       G ++  K+ K     TE ++  ++LY+      M
Subjt:  WFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFK-FCKRTESEIIQNWLYIAFFYTAM

Query:  T-DYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEM----ILPIGGNTKESIFPASTW
           Y  P     PL           +   T   +   +++G     YN           D +       W +QACTE+    + P     K     +   
Subjt:  T-DYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEM----ILPIGGNTKESIFPASTW

Query:  NYANRVNFCSRLF--DVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSII---AIVAKEGAHHVDLRFSTPED---------
        N    ++ C  LF  DV P+       +GG  +        + IIF NG  DPW      K  ST  +    I  +   H  D+R   P+          
Subjt:  NYANRVNFCSRLF--DVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSII---AIVAKEGAHHVDLRFSTPED---------

Query:  ----PKWLKDVRKQQLNVIQDWLSQ
            P ++  VR+Q +  I  WLS+
Subjt:  ----PKWLKDVRKQQLNVIQDWLSQ

AT2G24280.1 alpha/beta-Hydrolases superfamily protein3.6e-14953.75Show/hide
Query:  LRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKS
        LR +K    S  +  +   YF Q LDHF+F P +Y++F Q+YLIN+  W    +  PIFVYTGNEGDI+WFA NTG +L+ AP+FRAL+VFIEHRFYG+S
Subjt:  LRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKS

Query:  IPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDF
         PFG     ++ ++ TLGYL+S QALADYA LI  LK+NL++  SPVVVFGGSYGGMLAAWFRLKYPHI+IGA+ASSAPIL F+NI    +F + I+QDF
Subjt:  IPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDF

Query:  KSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEI-IQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYG
        K  S NC++VIK SW+ +E  +    G + L K F+ CK   S+   ++WL  AF YTAM +YPT +NF+ PLP YPV+QMCK ID    G+    + + 
Subjt:  KSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTESEI-IQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYG

Query:  AANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFF
        AA++YYNY+G+  CF+++  +D H L  W +QACTEM++P+  + +  + P    + A +   C   + V+PR  WITTEFGG  IE VLK+FGSNIIF 
Subjt:  AANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFF

Query:  NGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        NG++DPWS GGVLKNIS+SI+A+V K+GAHH DLR +T +DP+WLK+ R+Q++ +I+ W+S+YY DL
Subjt:  NGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

AT5G22860.1 Serine carboxypeptidase S28 family protein2.2e-10641.58Show/hide
Query:  DDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN
        D+   ++ YF Q LDHF F P++Y  FQQRY I+ THWGGA  NAPI  + G E  ++      G L ++ PR  AL+V+IEHR+YG+++PFG  E+ A 
Subjt:  DDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN

Query:  SNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVI
         N+STLGYL++ QALADYA ++  +K+  +   SP++V GGSYGGMLAAWFRLKYPHI++GA+ASSAP+L FE+    + +  I+T+ FK  S+ CY  I
Subjt:  SNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVI

Query:  KGSWDLIERAANQPGGPEMLRKSFKFCKRTESEI-IQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGT
        + SW  I+R A +P G  +L K FK C        I++  ++   Y     Y       N  P + V ++C AI +    N  +  L           G 
Subjt:  KGSWDLIERAANQPGGPEMLRKSFKFCKRTESEI-IQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGT

Query:  ATCFDLDDDSDPHDLG-NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGG
         TC+D    + P +    W WQ+C+E+++P+G + ++++FP + +N  + ++ C     V PR  WITT FG   ++ +L+KFGSNIIF NGL DP+S G
Subjt:  ATCFDLDDDSDPHDLG-NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGG

Query:  GVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL
        GVL++IS +++AI  K G+H +D+   + EDP+WL   R++++ VI  W+S Y  DL
Subjt:  GVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQDWLSQYYLDL

AT5G22860.2 Serine carboxypeptidase S28 family protein7.6e-9141.65Show/hide
Query:  DDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN
        D+   ++ YF Q LDHF F P++Y  FQQRY I+ THWGGA  NAPI  + G E  ++      G L ++ PR  AL+V+IEHR+YG+++PFG  E+ A 
Subjt:  DDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVAN

Query:  SNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVI
         N+STLGYL++ QALADYA ++  +K+  +   SP++V GGSYGGMLAAWFRLKYPHI++GA+ASSAP+L FE+    + +  I+T+ FK  S+ CY  I
Subjt:  SNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVI

Query:  KGSWDLIERAANQPGGPEMLRKSFKFCKRTESEI-IQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGT
        + SW  I+R A +P G  +L K FK C        I++  ++   Y     Y       N  P + V ++C AI +    N  +  L           G 
Subjt:  KGSWDLIERAANQPGGPEMLRKSFKFCKRTESEI-IQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGT

Query:  ATCFDLDDDSDPHDLG-NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGG
         TC+D    + P +    W WQ+C+E+++P+G + ++++FP + +N  + ++ C     V PR  WITT FG   ++ +L+KFGSNIIF NGL DP+S G
Subjt:  ATCFDLDDDSDPHDLG-NWTWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGG

Query:  G
        G
Subjt:  G

AT5G65760.1 Serine carboxypeptidase S28 family protein1.7e-14650.39Show/hide
Query:  MAFPSTFLSFYFYLSLHFTS-----SFSKFTPRFPPSLLLRPQKLPIHSPDDR---LYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPI
        MA     L  + + +L F S     S SK  PRFP       +        DR    Y   +F+Q LDHF+F       F QRYLIN  HW GA+   PI
Subjt:  MAFPSTFLSFYFYLSLHFTS-----SFSKFTPRFPPSLLLRPQKLPIHSPDDR---LYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPI

Query:  FVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGML
        F+Y GNEGDIEWFA N+G + + AP+F AL+VF EHR+YG+S+P+G  E+ A  N++TL YL++ QALAD+A  +TDLK+NL+A   PVV+FGGSYGGML
Subjt:  FVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGML

Query:  AAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTES-EIIQNWLYIAFFYT
        AAW RLKYPHI+IGA+ASSAPILQFE++  P  F +I + DFK ES +C+  IK SWD I     +  G   L K+F FC+   S + + +WL  A+ Y 
Subjt:  AAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLIERAANQPGGPEMLRKSFKFCKRTES-EIIQNWLYIAFFYT

Query:  AMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYA
        AM DYP P++F+ PLP +P++++C+ ID   +      ++Y   +VYYNYTG   CF LDD  DPH L  W WQACTEM++P+  N + S+FP   +NY+
Subjt:  AMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNWTWQACTEMILPIGGNTKESIFPASTWNYA

Query:  NRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQD
        +    C   F V PR  W+TTEFGGH+I   LK FGSNIIF NGL DPWSGG VLKN+S +I+A+V KEGAHH+DLR STPEDPKWL D R+ ++ +IQ 
Subjt:  NRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTPEDPKWLKDVRKQQLNVIQD

Query:  WLSQYYLD
        W+  Y ++
Subjt:  WLSQYYLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCGTCTCAGCCAGTTCTCTCCCGAGTTCTTGTTCAAGCTCTCGAACCTTCTCCGAAGTTTCCTTTCAACTTCAAGCTCTCCAGCCACAGACTCAATTGCAGCCT
CCACCACTTCCTGCTCCTTGCTTTTCCATGCTTCCTTCTCCTCAGCAAAGCACCTCATCAGATAGCTTATGTCATTCTGCTCATATCTTTGCTCTTGGATAAGCTGATTG
ATCTGCAATCGAGCCCTCTCCAGCTCGGCATAAGACCTGATCAAGAAGACAAGAGAAAACCTAGAAATATCCGAACGAGGAAGCCAAAAAGCAGAAACCCACCTCGGAAA
CAGGGCTATGTGACGGATCGGAGAGATGGGGAGGCAAAGAACCAGAATGAACCGACCGTGTCGTTTTCTCTCTGGCCTTCGTTTCCTTTCTCGACTTCCTCTCGTCCAAG
ACCAGCCCCTCCTTCAGCCTCGTCGACGGCAACTCATTCATCTCCCACAGCGTCGCCGCCAGCTTCCTCGCCGATACCGGAGCTTGATTGGACCGACCACTGCCACACTT
ATACAGCTCGTACTTGGGGGACTCAGTGGAACGAAATGCAGATGCTGGAGATCTCGATCGCGAGCAGAGCCAAAATCAAGAGCAACGCCAAATGGGTTTCTGAATCAACA
CTGAAATACAAAGTTCAAAAGAAAGCCCACGAAGAAACTGCGAGTAATATAGAAGAGGAAGCAATTCAAATAGCCATAATACAGACTGGTTTAAAACTCTCTCACGGCCG
AATAAAGGGCAAATTATTGAACGAGGGCGCCACTTTTGTTCCGACTCCAAGTCCCACCTATCATCTCTCTGTTCCCTATTTCAACCTCACATGGCAGTTCCCACAGAGCT
CACTCACTCCAACAATGGCGTTTCCATCCACTTTTCTTTCCTTCTACTTTTACCTTTCCCTCCATTTTACTTCTTCTTTTTCCAAATTTACCCCTCGCTTCCCTCCTTCC
CTACTTCTCCGCCCTCAAAAACTTCCGATCCATTCCCCCGACGATCGCCTCTACCGAATCAACTACTTCACTCAGATTCTCGATCACTTCAACTTCAATCCCCAAGCCTA
CCAAATGTTTCAGCAGCGCTACCTGATCAACGACACTCATTGGGGCGGCGCCGCCGAGAATGCTCCGATCTTCGTTTACACCGGAAACGAAGGCGATATTGAATGGTTCG
CGCAGAACACGGGTTTATTACTCGAATCTGCGCCACGTTTCCGAGCTCTCGTCGTTTTTATCGAGCATCGATTTTACGGGAAATCGATTCCGTTTGGGGGAGATGAAGAT
GTGGCTAATAGTAATTCGAGTACGCTTGGATATCTAAGCTCCACGCAGGCGTTGGCAGATTATGCAACTCTGATCACCGACTTGAAGAAGAATTTGACGGCGATTGATTC
CCCGGTCGTCGTGTTTGGTGGCTCTTATGGAGGAATGCTGGCAGCATGGTTTAGGTTGAAATACCCTCACATTTCCATTGGAGCTGTAGCATCATCTGCTCCCATCCTCC
AATTTGAAAACATTACTTCTCCTTATGCCTTCAACAACATCATCACCCAAGATTTCAAGAGTGAGAGCCAGAATTGTTACAGAGTGATCAAAGGCTCATGGGACCTGATT
GAAAGGGCAGCTAACCAACCAGGAGGACCCGAGATGCTTCGCAAGTCATTCAAATTTTGCAAAAGGACGGAGTCCGAAATTATTCAAAATTGGCTCTACATTGCATTTTT
CTACACGGCGATGACAGATTATCCCACGCCCTCTAATTTTTTAAATCCGCTGCCAGCTTATCCAGTTAAACAGATGTGCAAGGCGATTGACGATCGGACGGCTGGGAACG
ACACGTTTGCAAAGTTGTATGGTGCGGCTAACGTCTACTATAACTACACTGGAACGGCGACGTGTTTTGATCTGGACGATGATTCAGATCCTCACGACCTTGGAAATTGG
ACTTGGCAGGCGTGTACGGAGATGATATTGCCAATAGGGGGCAACACCAAGGAGAGCATATTTCCAGCTTCTACATGGAACTATGCCAACCGAGTCAACTTCTGCAGTCG
CTTGTTTGACGTGGAGCCTCGCCGTATTTGGATCACCACAGAGTTTGGGGGCCATAACATTGAGAGAGTTTTGAAGAAGTTTGGAAGCAACATAATCTTCTTCAATGGCT
TGAGAGATCCTTGGAGTGGTGGAGGGGTGCTGAAAAACATATCCACGAGTATAATAGCTATCGTAGCAAAAGAAGGAGCTCATCATGTAGACTTGAGATTCTCAACTCCT
GAAGATCCAAAATGGCTGAAAGACGTGAGAAAGCAGCAGCTCAACGTCATTCAGGATTGGCTTTCCCAATATTACCTCGACTTGGCCCATAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCGTCTCAGCCAGTTCTCTCCCGAGTTCTTGTTCAAGCTCTCGAACCTTCTCCGAAGTTTCCTTTCAACTTCAAGCTCTCCAGCCACAGACTCAATTGCAGCCT
CCACCACTTCCTGCTCCTTGCTTTTCCATGCTTCCTTCTCCTCAGCAAAGCACCTCATCAGATAGCTTATGTCATTCTGCTCATATCTTTGCTCTTGGATAAGCTGATTG
ATCTGCAATCGAGCCCTCTCCAGCTCGGCATAAGACCTGATCAAGAAGACAAGAGAAAACCTAGAAATATCCGAACGAGGAAGCCAAAAAGCAGAAACCCACCTCGGAAA
CAGGGCTATGTGACGGATCGGAGAGATGGGGAGGCAAAGAACCAGAATGAACCGACCGTGTCGTTTTCTCTCTGGCCTTCGTTTCCTTTCTCGACTTCCTCTCGTCCAAG
ACCAGCCCCTCCTTCAGCCTCGTCGACGGCAACTCATTCATCTCCCACAGCGTCGCCGCCAGCTTCCTCGCCGATACCGGAGCTTGATTGGACCGACCACTGCCACACTT
ATACAGCTCGTACTTGGGGGACTCAGTGGAACGAAATGCAGATGCTGGAGATCTCGATCGCGAGCAGAGCCAAAATCAAGAGCAACGCCAAATGGGTTTCTGAATCAACA
CTGAAATACAAAGTTCAAAAGAAAGCCCACGAAGAAACTGCGAGTAATATAGAAGAGGAAGCAATTCAAATAGCCATAATACAGACTGGTTTAAAACTCTCTCACGGCCG
AATAAAGGGCAAATTATTGAACGAGGGCGCCACTTTTGTTCCGACTCCAAGTCCCACCTATCATCTCTCTGTTCCCTATTTCAACCTCACATGGCAGTTCCCACAGAGCT
CACTCACTCCAACAATGGCGTTTCCATCCACTTTTCTTTCCTTCTACTTTTACCTTTCCCTCCATTTTACTTCTTCTTTTTCCAAATTTACCCCTCGCTTCCCTCCTTCC
CTACTTCTCCGCCCTCAAAAACTTCCGATCCATTCCCCCGACGATCGCCTCTACCGAATCAACTACTTCACTCAGATTCTCGATCACTTCAACTTCAATCCCCAAGCCTA
CCAAATGTTTCAGCAGCGCTACCTGATCAACGACACTCATTGGGGCGGCGCCGCCGAGAATGCTCCGATCTTCGTTTACACCGGAAACGAAGGCGATATTGAATGGTTCG
CGCAGAACACGGGTTTATTACTCGAATCTGCGCCACGTTTCCGAGCTCTCGTCGTTTTTATCGAGCATCGATTTTACGGGAAATCGATTCCGTTTGGGGGAGATGAAGAT
GTGGCTAATAGTAATTCGAGTACGCTTGGATATCTAAGCTCCACGCAGGCGTTGGCAGATTATGCAACTCTGATCACCGACTTGAAGAAGAATTTGACGGCGATTGATTC
CCCGGTCGTCGTGTTTGGTGGCTCTTATGGAGGAATGCTGGCAGCATGGTTTAGGTTGAAATACCCTCACATTTCCATTGGAGCTGTAGCATCATCTGCTCCCATCCTCC
AATTTGAAAACATTACTTCTCCTTATGCCTTCAACAACATCATCACCCAAGATTTCAAGAGTGAGAGCCAGAATTGTTACAGAGTGATCAAAGGCTCATGGGACCTGATT
GAAAGGGCAGCTAACCAACCAGGAGGACCCGAGATGCTTCGCAAGTCATTCAAATTTTGCAAAAGGACGGAGTCCGAAATTATTCAAAATTGGCTCTACATTGCATTTTT
CTACACGGCGATGACAGATTATCCCACGCCCTCTAATTTTTTAAATCCGCTGCCAGCTTATCCAGTTAAACAGATGTGCAAGGCGATTGACGATCGGACGGCTGGGAACG
ACACGTTTGCAAAGTTGTATGGTGCGGCTAACGTCTACTATAACTACACTGGAACGGCGACGTGTTTTGATCTGGACGATGATTCAGATCCTCACGACCTTGGAAATTGG
ACTTGGCAGGCGTGTACGGAGATGATATTGCCAATAGGGGGCAACACCAAGGAGAGCATATTTCCAGCTTCTACATGGAACTATGCCAACCGAGTCAACTTCTGCAGTCG
CTTGTTTGACGTGGAGCCTCGCCGTATTTGGATCACCACAGAGTTTGGGGGCCATAACATTGAGAGAGTTTTGAAGAAGTTTGGAAGCAACATAATCTTCTTCAATGGCT
TGAGAGATCCTTGGAGTGGTGGAGGGGTGCTGAAAAACATATCCACGAGTATAATAGCTATCGTAGCAAAAGAAGGAGCTCATCATGTAGACTTGAGATTCTCAACTCCT
GAAGATCCAAAATGGCTGAAAGACGTGAGAAAGCAGCAGCTCAACGTCATTCAGGATTGGCTTTCCCAATATTACCTCGACTTGGCCCATAACTGA
Protein sequenceShow/hide protein sequence
MTSSQPVLSRVLVQALEPSPKFPFNFKLSSHRLNCSLHHFLLLAFPCFLLLSKAPHQIAYVILLISLLLDKLIDLQSSPLQLGIRPDQEDKRKPRNIRTRKPKSRNPPRK
QGYVTDRRDGEAKNQNEPTVSFSLWPSFPFSTSSRPRPAPPSASSTATHSSPTASPPASSPIPELDWTDHCHTYTARTWGTQWNEMQMLEISIASRAKIKSNAKWVSEST
LKYKVQKKAHEETASNIEEEAIQIAIIQTGLKLSHGRIKGKLLNEGATFVPTPSPTYHLSVPYFNLTWQFPQSSLTPTMAFPSTFLSFYFYLSLHFTSSFSKFTPRFPPS
LLLRPQKLPIHSPDDRLYRINYFTQILDHFNFNPQAYQMFQQRYLINDTHWGGAAENAPIFVYTGNEGDIEWFAQNTGLLLESAPRFRALVVFIEHRFYGKSIPFGGDED
VANSNSSTLGYLSSTQALADYATLITDLKKNLTAIDSPVVVFGGSYGGMLAAWFRLKYPHISIGAVASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWDLI
ERAANQPGGPEMLRKSFKFCKRTESEIIQNWLYIAFFYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDRTAGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGNW
TWQACTEMILPIGGNTKESIFPASTWNYANRVNFCSRLFDVEPRRIWITTEFGGHNIERVLKKFGSNIIFFNGLRDPWSGGGVLKNISTSIIAIVAKEGAHHVDLRFSTP
EDPKWLKDVRKQQLNVIQDWLSQYYLDLAHN