; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G009570 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G009570
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUPF0481 protein At3g47200
Genome locationchr01:7765683..7771431
RNA-Seq ExpressionLsi01G009570
SyntenyLsi01G009570
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR004158 - Protein of unknown function DUF247, plant
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440314.1 PREDICTED: UPF0481 protein At3g47200 [Cucumis melo]1.2e-27894.8Show/hide
Query:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEED--LESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH
        ++SVDSHGK ELQL + KQIQSESH+VIIE+ED KLEED   ESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH
Subjt:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEED--LESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH

Query:  HGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI
        HGKRRLRQMERHKWRSLYHILER+  DIK+YLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI
Subjt:  HGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI

Query:  QRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW
        QRDMIMLENQ+PLFVLDRL+ELQLGD YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW
Subjt:  QRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW

Query:  IKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA
        +KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA
Subjt:  IKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA

Query:  YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

XP_011657877.1 UPF0481 protein At3g47200 [Cucumis sativus]5.3e-27994.21Show/hide
Query:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEE---DLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPY
        ++SVDSHGK ELQLQ+ KQIQSESHHVI+E+EDQKLEE   +LESP SEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQI+SLGPY
Subjt:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEE---DLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPY

Query:  HHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHS
        HHGKRRLRQMERHKWRSLYHIL+R+ QDIK+YLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHS
Subjt:  HHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHS

Query:  IQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKV
        IQRDMIMLENQ+PLFVLDRL+ELQLGD YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKV
Subjt:  IQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKV

Query:  WIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV
        W+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV
Subjt:  WIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV

Query:  AYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPP
        +YLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPP
Subjt:  AYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPP

Query:  N
        N
Subjt:  N

XP_023003973.1 UPF0481 protein At3g47200-like [Cucurbita maxima]6.1e-27592.43Show/hide
Query:  NRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGP
        N N  +SVDSHGKP+LQLQ+ +QIQSESHHVI+EDEDQKLEED ESPESEWVI+IKEKL+QAHQDEVESSWAKLCIYKVPHYLKDG+DKAVVPQI+SLGP
Subjt:  NRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGP

Query:  YHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH
        YHHGKRRLRQMERHKWRSLYHILER   DI IYLDAMKELEE AR+CYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH
Subjt:  YHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH

Query:  SIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPK
        SIQRDMIMLENQ+PLFVLDRL+ +QLG+ YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAFDPLG QDGLHCLDVFRRSLLRSG KLAPK
Subjt:  SIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPK

Query:  VWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED
        VWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED
Subjt:  VWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED

Query:  VAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP
        VAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRA+LKHNYFSNPWAIISL+AAVVLLLLTFAQ FYGVY YY+P
Subjt:  VAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP

Query:  PN
        PN
Subjt:  PN

XP_023518140.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]2.3e-27492.23Show/hide
Query:  NRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGP
        N N ++SVDSHGKPELQLQ+ +QIQSESHHVI+EDEDQKLEED ESPESEWVI+IKEKL+QAHQDEVESSWAKLCIYKVPHYLKDG+DKAVVPQI+SLGP
Subjt:  NRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGP

Query:  YHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH
        YHHGKRRLRQMERHKWRSLYHILER   DI IYLDAMKELEE AR+CYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH
Subjt:  YHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH

Query:  SIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPK
        SIQRDMIMLENQ+PLFVLDRL+ LQLG+ YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAFDPLG QDGLHCLDVFRRSLLRSG KLAPK
Subjt:  SIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPK

Query:  VWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED
        VWIKRRSH +RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED
Subjt:  VWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED

Query:  VAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP
        VAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQ+SEDVN YYNHRWNAWRA+LKHNYFSNPWAIISL+AAVVLLLLTFAQ FYGVY YY+P
Subjt:  VAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP

Query:  PN
        PN
Subjt:  PN

XP_038880921.1 UPF0481 protein At3g47200-like [Benincasa hispida]7.4e-28195.78Show/hide
Query:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHG
        ++SVDS GK E QLQ+LKQIQSESHHVIIEDEDQKLEED ESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHG
Subjt:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHG

Query:  KRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQR
        KRRLRQMERHKWRSLYHILER  QDIK+YLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQR
Subjt:  KRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQR

Query:  DMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIK
        DMIMLENQ+PLFVLDRL+ LQLGD YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+K
Subjt:  DMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIK

Query:  RRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYL
        RRSHANRVADKRRQQLIHCVKELKEAG+RF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYL
Subjt:  RRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYL

Query:  HYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        HYCGIIEHWLGSDEEVA+LFNRLCQEVVYDIN+SYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
Subjt:  HYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

TrEMBL top hitse value%identityAlignment
A0A0A0KID5 Uncharacterized protein2.6e-27994.21Show/hide
Query:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEE---DLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPY
        ++SVDSHGK ELQLQ+ KQIQSESHHVI+E+EDQKLEE   +LESP SEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQI+SLGPY
Subjt:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEE---DLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPY

Query:  HHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHS
        HHGKRRLRQMERHKWRSLYHIL+R+ QDIK+YLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHS
Subjt:  HHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHS

Query:  IQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKV
        IQRDMIMLENQ+PLFVLDRL+ELQLGD YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKV
Subjt:  IQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKV

Query:  WIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV
        W+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV
Subjt:  WIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV

Query:  AYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPP
        +YLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPP
Subjt:  AYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPP

Query:  N
        N
Subjt:  N

A0A1S3B0V1 UPF0481 protein At3g472005.7e-27994.8Show/hide
Query:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEED--LESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH
        ++SVDSHGK ELQL + KQIQSESH+VIIE+ED KLEED   ESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH
Subjt:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEED--LESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH

Query:  HGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI
        HGKRRLRQMERHKWRSLYHILER+  DIK+YLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI
Subjt:  HGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI

Query:  QRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW
        QRDMIMLENQ+PLFVLDRL+ELQLGD YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW
Subjt:  QRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW

Query:  IKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA
        +KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA
Subjt:  IKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA

Query:  YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

A0A5D3CR40 UPF0481 protein5.7e-27994.8Show/hide
Query:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEED--LESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH
        ++SVDSHGK ELQL + KQIQSESH+VIIE+ED KLEED   ESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH
Subjt:  SSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEED--LESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYH

Query:  HGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI
        HGKRRLRQMERHKWRSLYHILER+  DIK+YLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI
Subjt:  HGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSI

Query:  QRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW
        QRDMIMLENQ+PLFVLDRL+ELQLGD YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW
Subjt:  QRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW

Query:  IKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA
        +KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA
Subjt:  IKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVA

Query:  YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  YLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

A0A6J1KY55 UPF0481 protein At3g47200-like2.9e-27592.43Show/hide
Query:  NRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGP
        N N  +SVDSHGKP+LQLQ+ +QIQSESHHVI+EDEDQKLEED ESPESEWVI+IKEKL+QAHQDEVESSWAKLCIYKVPHYLKDG+DKAVVPQI+SLGP
Subjt:  NRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGP

Query:  YHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH
        YHHGKRRLRQMERHKWRSLYHILER   DI IYLDAMKELEE AR+CYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH
Subjt:  YHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH

Query:  SIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPK
        SIQRDMIMLENQ+PLFVLDRL+ +QLG+ YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAFDPLG QDGLHCLDVFRRSLLRSG KLAPK
Subjt:  SIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPK

Query:  VWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED
        VWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED
Subjt:  VWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHED

Query:  VAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP
        VAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRA+LKHNYFSNPWAIISL+AAVVLLLLTFAQ FYGVY YY+P
Subjt:  VAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP

Query:  PN
        PN
Subjt:  PN

A0A803NR13 Uncharacterized protein9.1e-28555.89Show/hide
Query:  LALLSFFFLSISASAL--SRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRSPPAR
        L L+SF  ++ S++ +  S R+D E++EIY  W+ ++ K YNG+ E ++RFQIFKDNL F+D HN+ ENR+YK+GLN+FADLTN+EYR  +LGTRS   R
Subjt:  LALLSFFFLSISASAL--SRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRSPPAR

Query:  RVMKAKSASRRYA--VNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDN
        RVMKA++ASRRYA   N+  +LP+SVDWR  GAV                                                                  
Subjt:  RVMKAKSASRRYA--VNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDN

Query:  GGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLV
                               N KVV+I GYEDV   DE+ALK A+AHQP+SVAIEA G ALQLYQSGVFTG+CG+ LDHGV  VGYGTENGVDYWLV
Subjt:  GGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLV

Query:  RNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKN---GNNPTTSYLICWKLIMSFDVSIHVGLKFGLLLYLEIVHVNDSVQYQFVLVEIQCFCL
        RNSWGT WGE+GYFKLERN+  T NGKCGIA+ ASYPVKN    + P+  YL+                                               
Subjt:  RNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKN---GNNPTTSYLICWKLIMSFDVSIHVGLKFGLLLYLEIVHVNDSVQYQFVLVEIQCFCL

Query:  WGMRVLQNPVKVQFSSVSMTSSVDYPKNVQNNSTNRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDE-----DQKLEEDLESPESEWVITIKEKLNQ
                                           R ++++V           +++Q  SES  V++++E     D  +EE L+SP+SEWVI+I EKL Q
Subjt:  WGMRVLQNPVKVQFSSVSMTSSVDYPKNVQNNSTNRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDE-----DQKLEEDLESPESEWVITIKEKLNQ

Query:  AHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVE
        A QD+   SWAKLCIYKVPHYL++G+DKA  PQI+SLGPYHHGKRRLRQM+RHKWRSL   L+R NQDIK+YLD++KE+EE+ R CYEG  + SSNEFVE
Subjt:  AHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVE

Query:  MMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGD--LYQKGLVAELALRFFDPLTPNDEPLTKSSLN
        MMVLDGCFVLELFRGAA GFK LGYPRNDPIFAMRGSMHSIQRDMIMLENQ+PLF+L+RL+ LQLG   L  KGL+++L L+FFDPL P DEPL+KS   
Subjt:  MMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGD--LYQKGLVAELALRFFDPLTPNDEPLTKSSLN

Query:  KLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLI
                           GLHCLDVFRRSLLR GP+  P+VW+KR SHA+RVADKRRQQLIHCV EL+EAG++F+K+KTDRFWDI F NG+++IPRLLI
Subjt:  KLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLI

Query:  HDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWR
        HDGT+SLFLNLIA+EQCH D  NDITSYV+FMDNLI+S EDV YLHYCGIIEHWLGSD EV+DLFNRLCQEVV+DINDSYLS+LS DVN+YYNHRWNAWR
Subjt:  HDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWR

Query:  ATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        A+L+HNYFSNPWAIIS VAAVVLLLLTFAQ FYGVY++Y PPN
Subjt:  ATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

SwissProt top hitse value%identityAlignment
P25776 Oryzain alpha chain1.4e-12562.64Show/hide
Query:  AAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNSEN----RTYKVGLNKFADLTNDEYRAVYLGT
        AA +LL LLS     +S  +   RS+ E R +Y  W A+HGK+YN + E E+R+  F+DNL +ID+HN+       ++++GLN+FADLTN+EYR  YLG 
Subjt:  AAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNSEN----RTYKVGLNKFADLTNDEYRAVYLGT

Query:  RSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQF
        R+ P R     +  S RY   + + LPESVDWR++GAVA +K+QG CGSCWAFS IAAVEGINQIVTG+LISLSEQELV+CD  YN GCNGGLMDYAF F
Subjt:  RSPPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQF

Query:  IIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVD
        II+NGG+DTE+DYPY+G D +CD  RKNAKVV ID YEDV  N E +L+KA+A+QPVSVAIEAGG A QLY SG+FTGKCG+ALDHGV AVGYGTENG D
Subjt:  IIDNGGLDTEEDYPYEGVDGQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVD

Query:  YWLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNP
        YW+VRNSWG  WGE GY ++ERN+K  ++GKCGIA+  SYP+K G NP
Subjt:  YWLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGNNP

P43297 Cysteine proteinase RD21A1.9e-12564.63Show/hide
Query:  SALSRRSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKDNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAV
        S    RS+ EV  IY+ WL KHGKA   N + E+++RF+IFKDNL F+D+HN +N +Y++GL +FADLTNDEYR+ YLG +          +  S RY  
Subjt:  SALSRRSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKDNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAV

Query:  NNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDG
           D LPES+DWR +GAVA VK+QG CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV+CD  YN GCNGGLMDYAF+FII NGG+DT++DYPY+GVDG
Subjt:  NNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDG

Query:  QCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKL
         CD  RKNAKVV ID YEDV    EE+LKKA+AHQP+S+AIEAGG A QLY SG+F G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY ++
Subjt:  QCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKL

Query:  ERNVKHTTNGKCGIAMMASYPVKNGNNP
         RN+  +++GKCGIA+  SYP+KNG NP
Subjt:  ERNVKHTTNGKCGIAMMASYPVKNGNNP

Q94B08 Germination-specific cysteine protease 11.3e-12666.87Show/hide
Query:  RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKDNLNFIDDHNSENR--TYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRY--A
        R+D EVR IY  W A+HGK  N     I +++KRF IFKDNL FID HN +N+  TYK+GL KF DLTNDEYR +YLG R+ PARR+ KAK+ +++Y  A
Subjt:  RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKDNLNFIDDHNSENR--TYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRY--A

Query:  VNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVD
        VN ++ +PE+VDWR +GAV P+K+QG+CGSCWAFST AAVEGIN+IVTGELISLSEQELV+CDK YN GCNGGLMDYAFQFI+ NGGL+TE+DYPY G  
Subjt:  VNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVD

Query:  GQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFK
        G+C+   KN++VV+IDGYEDV   DE ALKKAI++QPVSVAIEAGG   Q YQSG+FTG CG+ LDH VVAVGYG+ENGVDYW+VRNSWG  WGE+GY +
Subjt:  GQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFK

Query:  LERNVKHTTNGKCGIAMMASYPVKNGNNP
        +ERN+  + +GKCGIA+ ASYPVK   NP
Subjt:  LERNVKHTTNGKCGIAMMASYPVKNGNNP

Q9FMH8 Probable cysteine protease RD21B1.3e-12667.69Show/hide
Query:  RSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKDNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNR
        RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFKDNL FID+HN++N +YK+GL +FADLTN+EYR++YLG +  P +RV+K    S RY     
Subjt:  RSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKDNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAKSASRRYAVNNR

Query:  DRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCD
        D LP+SVDWR  GAVA VK+QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV+CD  YN GCNGGLMDYAF+FII NGG+DTE DYPY+  DG+CD
Subjt:  DRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQCD

Query:  PTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERN
          RKNAKVV ID YEDV  N E +LKKA+AHQP+SVAIEAGG A QLY SGVF G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY K+ RN
Subjt:  PTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERN

Query:  VKHTTNGKCGIAMMASYPVKNGNNP
        ++  T GKCGIAM ASYP+K G NP
Subjt:  VKHTTNGKCGIAMMASYPVKNGNNP

Q9LT78 Probable cysteine protease RD21C2.5e-12263.22Show/hide
Query:  LALLSFFFLSISASALS------RRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRS
        LALL F  L IS S  S       R++ E R +Y+ WL ++ K YNG+ E+E+RF+IFKDNL F+++H+S  NRTY+VGL +FADLTNDE+RA+YL ++ 
Subjt:  LALLSFFFLSISASALS------RRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNS-ENRTYKVGLNKFADLTNDEYRAVYLGTRS

Query:  PPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFII
           R  +K +    +Y     D LP+++DWR++GAV PVK+QGSCGSCWAFS I AVEGINQI TGELISLSEQELV+CD  YN GC GGLMDYAF+FII
Subjt:  PPARRVMKAKSASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFII

Query:  DNGGLDTEEDYPYEGVD-GQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDY
        +NGG+DTEEDYPY   D   C+  +KN +VV IDGYEDV  NDE++LKKA+A+QP+SVAIEAGG A QLY SGVFTG CG++LDHGVVAVGYG+E G DY
Subjt:  DNGGLDTEEDYPYEGVD-GQCDPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDY

Query:  WLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVK-NGNNP
        W+VRNSWG+ WGE GYFKLERN+K  ++GKCG+AMMASYP K +G+NP
Subjt:  WLVRNSWGTEWGEDGYFKLERNVKHTTNGKCGIAMMASYPVK-NGNNP

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)3.2e-20571.27Show/hide
Query:  LEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKE
        ++E  +    +WVI+I +KL QAH+D+  + W KLCIY+VP+YL++ ++K+  PQ +SLGPYHHGK+RLR M+RHKWR++  +L+R NQ IK+Y+DAM+E
Subjt:  LEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKE

Query:  LEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELA
        LEE+AR CYEGP S SSNEF+EM+VLDGCFVLELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQ+PLFVL+RL+ELQLG   Q GLVA+LA
Subjt:  LEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELA

Query:  LRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKT
        +RFFDPL P DEPLTKS  +KLE+SL    +FDP      LHCLDVFRRSLLRS PK  P++  KR S   RVADKRRQQLIHCV ELKEAGI+F+++KT
Subjt:  LRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKT

Query:  DRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSY
        DRFWD+ F NG ++IPRLLIHDGT+SLFLNLIAFEQCH+D SNDITSY++FMDNLIDSHEDV+YLHYCGIIEHWLGSD EVADLFNRLCQEVV+D  DSY
Subjt:  DRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSY

Query:  LSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        LS+LS +VNRYY+H+WNAWRATLKH YF+NPWAI+S  AAV+LL+LTF+Q+FY VYAYYKPP+
Subjt:  LSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

AT3G50130.1 Plant protein of unknown function (DUF247)4.5e-17562.55Show/hide
Query:  QKLEEDLESPE---SEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYL
        QKL++  + PE    EWVI+I++K+ QA +++  +SW KLCIY+VP YL++   K+  PQ +SLGP+HHG + L  M+RHKWR++  ++ R   DI++Y+
Subjt:  QKLEEDLESPE---SEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYL

Query:  DAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGL
        DAMKELE+RAR CYEGP   SSN+F EM+VLDGCFVLELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQ+PLFVL+RL+E+QLG  +Q GL
Subjt:  DAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGL

Query:  VAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQD--GLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGI
        V+ LA+RFFDPL P DEPLTK+     + SL     F+P+  +D   LHCLDVFRR+LLR      P++   R S   RVADKR+QQLIHCV EL+EAGI
Subjt:  VAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQD--GLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGI

Query:  RFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVV
        +F+ +KTDRFWDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH+D SNDITSY++FMDNLIDS EDV YLHYCGIIEHWLG+D EVADLFNRLCQEV 
Subjt:  RFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVV

Query:  YDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        +D  +SYLSQLS  V+R Y+ +WN  +A LKH YF+NPWA  S  AA+VLL+LT  Q+F+  Y Y+ PP+
Subjt:  YDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

AT3G50140.1 Plant protein of unknown function (DUF247)8.1e-16960.93Show/hide
Query:  QKLEEDLESPE---SEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYL
        QKL+   + PE    EWVI IK+K+ Q  +D   +SW K+CIY+VP  LK  +  +  PQ +SLGPYHHG   LR M+ HKWR++  +++R  Q I++Y+
Subjt:  QKLEEDLESPE---SEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYL

Query:  DAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGL
        DAMKELEERAR CYEGP   SSN+F +M+VLDGCFVL+LFRGA EGF +LGY RNDP+FAMRGSMHSI+RDM+MLENQ+PLFVL+RL+ELQLG  YQ GL
Subjt:  DAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGL

Query:  VAELALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG--YQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG
        VA+LA+RFF+PL P     T  S  K+E+S   N   F+P+    ++ LHCLDVFRRSLL+   K  P++   R S    VADKR+QQL+HCV EL+EAG
Subjt:  VAELALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG--YQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG

Query:  IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEV
        I+F+++K+DRFWDI F NG ++IP+LLIHDGT+SLF NLIA+EQCH+D +NDITSY++FMDNLIDS ED+ YLHY  IIEHWLG+D EVAD+FNRLCQEV
Subjt:  IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEV

Query:  VYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
         +D+ ++YLS+LS  V+RYYN +WN  +ATLKH YFSNPWA  S  AAV+LLLLT  Q+F+  Y Y+KPP+
Subjt:  VYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

AT3G50150.1 Plant protein of unknown function (DUF247)6.5e-15855.27Show/hide
Query:  KNVQNNSTNRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESP---ESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDK
        ++   +   R +S  V  H   E+  + L   Q+  +HV    E  K+E   E P     EWVI+IK+K+ +A   +  +SW KLCIY+VP YL++ + K
Subjt:  KNVQNNSTNRNVSSSVDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESP---ESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDK

Query:  AVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSF-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPR
        + +PQ +S+GPYHHGK  LR MERHKWR++  I+ R   +I++Y+DAMKELEE AR CY+GP    +SNEF EM+VLDGCFVLELF+G  +GF+++GY R
Subjt:  AVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSF-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPR

Query:  NDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRR
        NDP+FA RG MHSIQRDMIMLENQ+PLFVLDRL+ LQ G   Q G+VAE+A+RFF  L P  E LTKS     E SL +    D LG   GLHCLDVF R
Subjt:  NDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRR

Query:  SLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV
        SL++S      +   +   + +    +++QQLIHCV EL+ AG+ F +K+T + WDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH   SN+ITSY+
Subjt:  SLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV

Query:  VFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFA
        +FMDNLI+S +DV+YLH+ GIIEHWLGSD EVADLFNRLC+EV++D  D YLSQLS +VNRYY+ +WN+ +ATL+  YF+NPWA  S  AAV+LL LTF 
Subjt:  VFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFA

Query:  QAFYGVYAYYKP
        Q+F+ VYAYYKP
Subjt:  QAFYGVYAYYKP

AT3G50170.1 Plant protein of unknown function (DUF247)7.6e-18362.58Show/hide
Query:  HGKPEL-------QLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHH
        HG PE+        +Q  KQ  SES   ++E+  +      E+    WVI+I++KL QA +D+  + W KLCIY+VPHYL++ + K+  PQ +SLGPYHH
Subjt:  HGKPEL-------QLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHH

Query:  GKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQ
        GK+RLR MERHKWR+L  +L+R  Q I++Y +AM+ELEE+AR CYEGP S S NEF EM+VLDGCFVLELFRG  EGF ++GY RNDP+FAMRG MHSIQ
Subjt:  GKRRLRQMERHKWRSLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQ

Query:  RDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWI
        RDMIMLENQ+PLFVLDRL+ELQLG   Q G+VA +A++FFDPL P  E LTK   +KL + L    + D LG +  LHCLDVFRRSLL+S P    +  +
Subjt:  RDMIMLENQVPLFVLDRLIELQLGDLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWI

Query:  KRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAY
        KR +   RV DKR+QQL+HCV EL+EAG++F+K+KTDRFWDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH++ SN ITSY++FMDNLI+S EDV+Y
Subjt:  KRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAY

Query:  LHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP
        LHYCGIIEHWLGSD EVADLFNRLCQEVV+D  DS+LS+LS DVNRYYN +WN  +ATL H YF+NPWA  S  AAV+LLLLT  Q+FY VYAYYKP
Subjt:  LHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCCCCTCTTTGCTCGCCCTCCTCTCCTTCTTCTTCCTTTCCATTTCCGCCTCCGCACTCAGCCGCCGGAGCGACGGCGAGGTTAGAGAAATCTATGACCT
GTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAGCGGGAGAAGAGGTTTCAGATCTTCAAGGATAATCTGAACTTTATCGATGATCATAATTCTGAGAATC
GCACGTATAAGGTTGGATTGAACAAGTTCGCCGATCTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCCCCTGCTCGACGAGTCATGAAGGCAAAG
TCCGCCAGCCGCCGATACGCCGTCAACAACCGCGATCGGTTGCCGGAATCTGTTGATTGGAGGTCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAG
TTGTTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAGATCGTTACTGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAACTGTGACAAAAAGTACA
ATTCAGGTTGCAATGGAGGTCTTATGGACTATGCCTTCCAATTCATTATTGACAATGGCGGCTTGGACACTGAGGAAGATTATCCTTATGAGGGCGTCGATGGTCAATGC
GATCCCACCAGGAAAAATGCCAAGGTTGTTAACATCGACGGATACGAGGATGTCCTTGCGAATGACGAGGAAGCATTGAAGAAGGCCATTGCTCACCAACCAGTTAGCGT
CGCCATTGAAGCTGGTGGCTTGGCTTTGCAACTTTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTACGGCACAG
AGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGAATGGGGTGAGGATGGCTACTTCAAACTAGAGCGCAATGTAAAGCACACTACCAATGGGAAGTGT
GGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGCAACAACCCAACAACATCATACTTAATTTGTTGGAAGTTGATTATGTCTTTTGATGTTTCAATTCATGTTGG
TCTGAAGTTTGGCCTATTGCTATACCTAGAAATTGTGCATGTAAATGATTCAGTTCAATATCAATTTGTCCTTGTTGAAATACAGTGTTTTTGTTTATGGGGGATGAGAG
TTCTTCAAAACCCAGTCAAAGTGCAATTTAGTTCTGTTTCTATGACCTCATCGGTCGATTACCCAAAAAATGTGCAAAACAATTCAACCAACCGTAATGTGTCCAGCTCT
GTTGATTCTCATGGAAAACCAGAACTCCAGCTCCAGAAATTAAAACAGATTCAATCAGAATCACATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCT
CGAATCACCGGAATCAGAATGGGTTATCACCATCAAGGAAAAGCTTAACCAAGCTCATCAAGATGAAGTAGAGAGTTCATGGGCGAAGCTCTGCATTTACAAGGTCCCCC
ACTACCTGAAAGATGGTGAAGACAAAGCTGTTGTTCCTCAGATTATCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGG
TCGCTTTACCACATCCTAGAGAGAGCAAATCAGGACATAAAGATTTATCTGGATGCCATGAAAGAACTTGAAGAAAGAGCTCGTAATTGTTATGAAGGACCATTCAGTTT
TAGCAGCAATGAATTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATC
CAATCTTCGCAATGCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTAGAAAATCAGGTGCCCCTGTTTGTATTGGATCGGCTGATAGAGCTTCAGCTTGGT
GACCTTTACCAGAAAGGGCTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCGTTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATC
TCTCGGAAATGCAACTGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGGCGAAGTCTCCTCCGGTCTGGCCCAAAATTAGCACCGAAAG
TGTGGATCAAACGGCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGCGTGAAAGAGTTGAAAGAGGCAGGGATTAGATTCCAGAAGAAG
AAAACTGATCGGTTTTGGGACATAAATTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTTGA
ACAATGTCATCTTGATTGCAGCAACGACATAACCTCCTATGTGGTTTTCATGGATAATCTAATCGATTCTCATGAAGACGTTGCTTACCTCCATTACTGTGGAATAATAG
AGCATTGGCTTGGTAGTGATGAGGAGGTTGCAGACCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGACATCAATGATAGCTATTTGTCCCAATTGTCTGAGGATGTG
AATCGCTACTACAACCATAGATGGAATGCTTGGAGAGCAACTTTAAAACACAACTACTTCAGCAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTT
GCTTACTTTTGCACAAGCCTTCTATGGAGTTTATGCTTATTACAAGCCCCCAAATTGA
mRNA sequenceShow/hide mRNA sequence
ACTTTTGAGTTCCTTTAGGCTATAAATACCCATAGCCACTTCCCCACTTTCTCCATAACTCAGAATCGAACAATCTTCCCTGTTCTTGAAGCAACCTTCGCCATGGCCGC
CGCCCCCTCTTTGCTCGCCCTCCTCTCCTTCTTCTTCCTTTCCATTTCCGCCTCCGCACTCAGCCGCCGGAGCGACGGCGAGGTTAGAGAAATCTATGACCTGTGGCTGG
CGAAGCACGGCAAGGCCTATAACGGAATCGAAGAGCGGGAGAAGAGGTTTCAGATCTTCAAGGATAATCTGAACTTTATCGATGATCATAATTCTGAGAATCGCACGTAT
AAGGTTGGATTGAACAAGTTCGCCGATCTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCCCCTGCTCGACGAGTCATGAAGGCAAAGTCCGCCAG
CCGCCGATACGCCGTCAACAACCGCGATCGGTTGCCGGAATCTGTTGATTGGAGGTCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGTTGTTGGG
CATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAGATCGTTACTGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAACTGTGACAAAAAGTACAATTCAGGT
TGCAATGGAGGTCTTATGGACTATGCCTTCCAATTCATTATTGACAATGGCGGCTTGGACACTGAGGAAGATTATCCTTATGAGGGCGTCGATGGTCAATGCGATCCCAC
CAGGAAAAATGCCAAGGTTGTTAACATCGACGGATACGAGGATGTCCTTGCGAATGACGAGGAAGCATTGAAGAAGGCCATTGCTCACCAACCAGTTAGCGTCGCCATTG
AAGCTGGTGGCTTGGCTTTGCAACTTTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTACGGCACAGAGAACGGA
GTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGAATGGGGTGAGGATGGCTACTTCAAACTAGAGCGCAATGTAAAGCACACTACCAATGGGAAGTGTGGGATCGC
AATGATGGCTTCTTACCCTGTTAAGAATGGCAACAACCCAACAACATCATACTTAATTTGTTGGAAGTTGATTATGTCTTTTGATGTTTCAATTCATGTTGGTCTGAAGT
TTGGCCTATTGCTATACCTAGAAATTGTGCATGTAAATGATTCAGTTCAATATCAATTTGTCCTTGTTGAAATACAGTGTTTTTGTTTATGGGGGATGAGAGTTCTTCAA
AACCCAGTCAAAGTGCAATTTAGTTCTGTTTCTATGACCTCATCGGTCGATTACCCAAAAAATGTGCAAAACAATTCAACCAACCGTAATGTGTCCAGCTCTGTTGATTC
TCATGGAAAACCAGAACTCCAGCTCCAGAAATTAAAACAGATTCAATCAGAATCACATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCTCGAATCAC
CGGAATCAGAATGGGTTATCACCATCAAGGAAAAGCTTAACCAAGCTCATCAAGATGAAGTAGAGAGTTCATGGGCGAAGCTCTGCATTTACAAGGTCCCCCACTACCTG
AAAGATGGTGAAGACAAAGCTGTTGTTCCTCAGATTATCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTA
CCACATCCTAGAGAGAGCAAATCAGGACATAAAGATTTATCTGGATGCCATGAAAGAACTTGAAGAAAGAGCTCGTAATTGTTATGAAGGACCATTCAGTTTTAGCAGCA
ATGAATTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTC
GCAATGCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTAGAAAATCAGGTGCCCCTGTTTGTATTGGATCGGCTGATAGAGCTTCAGCTTGGTGACCTTTA
CCAGAAAGGGCTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCGTTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAA
ATGCAACTGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGGCGAAGTCTCCTCCGGTCTGGCCCAAAATTAGCACCGAAAGTGTGGATC
AAACGGCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGCGTGAAAGAGTTGAAAGAGGCAGGGATTAGATTCCAGAAGAAGAAAACTGA
TCGGTTTTGGGACATAAATTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTTGAACAATGTC
ATCTTGATTGCAGCAACGACATAACCTCCTATGTGGTTTTCATGGATAATCTAATCGATTCTCATGAAGACGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGG
CTTGGTAGTGATGAGGAGGTTGCAGACCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGACATCAATGATAGCTATTTGTCCCAATTGTCTGAGGATGTGAATCGCTA
CTACAACCATAGATGGAATGCTTGGAGAGCAACTTTAAAACACAACTACTTCAGCAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTTGCTTACTT
TTGCACAAGCCTTCTATGGAGTTTATGCTTATTACAAGCCCCCAAATTGA
Protein sequenceShow/hide protein sequence
MAAAPSLLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKDNLNFIDDHNSENRTYKVGLNKFADLTNDEYRAVYLGTRSPPARRVMKAK
SASRRYAVNNRDRLPESVDWRSRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVNCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGVDGQC
DPTRKNAKVVNIDGYEDVLANDEEALKKAIAHQPVSVAIEAGGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTEWGEDGYFKLERNVKHTTNGKC
GIAMMASYPVKNGNNPTTSYLICWKLIMSFDVSIHVGLKFGLLLYLEIVHVNDSVQYQFVLVEIQCFCLWGMRVLQNPVKVQFSSVSMTSSVDYPKNVQNNSTNRNVSSS
VDSHGKPELQLQKLKQIQSESHHVIIEDEDQKLEEDLESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIISLGPYHHGKRRLRQMERHKWR
SLYHILERANQDIKIYLDAMKELEERARNCYEGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQVPLFVLDRLIELQLG
DLYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKK
KTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDV
NRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN