; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G16930 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G16930
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0481 protein At3g47200
Genome locationClcChr01:29722517..29727531
RNA-Seq ExpressionClc01G16930
SyntenyClc01G16930
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR004158 - Protein of unknown function DUF247, plant
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440314.1 PREDICTED: UPF0481 protein At3g47200 [Cucumis melo]2.3e-29593.81Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVES
        MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+VIIE+ED KLEEDP  ESPE+EWV+TIKEKL+QAHQDEVES
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVES

Query:  TWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF
        +WAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLD MKELEE+ARNCYEGP S SSNEFVEMMVLDGCF
Subjt:  TWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF

Query:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT
        VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Subjt:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT

Query:  AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN
        AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLN
Subjt:  AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN

Query:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGN
        LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF N
Subjt:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGN

Query:  PWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        PWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  PWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

XP_011657877.1 UPF0481 protein At3g47200 [Cucumis sativus]3.6e-29694.19Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLHQAHQDEVE
        MVAVFNKELLSWYLITLKLRETVESGLPRNS+SANSVDSHGK E+QLQE KQIQSESHHVI+E+EDQKL EEDP  ESP +EWV+TIKEKL+QAHQDEVE
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLHQAHQDEVE

Query:  STWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC
        S+WAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHIL+R+KQDIK+YLD MKELEE+ARNCYEGP S SSNEFVEMMVLDGC
Subjt:  STWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC

Query:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA
        FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 
Subjt:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA

Query:  TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL
        TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFL
Subjt:  TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL

Query:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFG
        NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV+YLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF 
Subjt:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFG

Query:  NPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  NPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

XP_023003973.1 UPF0481 protein At3g47200-like [Cucurbita maxima]6.0e-29192.66Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS S NSVDSHGKP++QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKEKL QAHQDEVES+W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL +QLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRA+LKHNYF NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW

Query:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        AIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Subjt:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

XP_023518140.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]1.0e-29092.66Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS SANSVDSHGKPE+QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKEKL QAHQDEVES+W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL LQLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSH +RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQ+SEDVN YYNHRWNAWRA+LKHNYF NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW

Query:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        AIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Subjt:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

XP_038880921.1 UPF0481 protein At3g47200-like [Benincasa hispida]5.4e-30096.05Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW
        MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDS GK E QLQE KQIQSESHHVIIEDEDQKLEEDPESPE+EWV+TIKEKL+QAHQDEVES+W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER KQDIK+YLD MKELEE+ARNCYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL LQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELKEAG+RF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDIN+SYLSQLSEDVNRYYNHRWNAWRATLKHNYF NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW

Query:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
Subjt:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

TrEMBL top hitse value%identityAlignment
A0A0A0KID5 Uncharacterized protein1.7e-29694.19Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLHQAHQDEVE
        MVAVFNKELLSWYLITLKLRETVESGLPRNS+SANSVDSHGK E+QLQE KQIQSESHHVI+E+EDQKL EEDP  ESP +EWV+TIKEKL+QAHQDEVE
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLHQAHQDEVE

Query:  STWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC
        S+WAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHIL+R+KQDIK+YLD MKELEE+ARNCYEGP S SSNEFVEMMVLDGC
Subjt:  STWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC

Query:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA
        FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 
Subjt:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA

Query:  TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL
        TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFL
Subjt:  TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL

Query:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFG
        NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV+YLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF 
Subjt:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFG

Query:  NPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  NPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

A0A1S3B0V1 UPF0481 protein At3g472001.1e-29593.81Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVES
        MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+VIIE+ED KLEEDP  ESPE+EWV+TIKEKL+QAHQDEVES
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVES

Query:  TWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF
        +WAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLD MKELEE+ARNCYEGP S SSNEFVEMMVLDGCF
Subjt:  TWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF

Query:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT
        VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Subjt:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT

Query:  AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN
        AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLN
Subjt:  AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN

Query:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGN
        LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF N
Subjt:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGN

Query:  PWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        PWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  PWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

A0A5D3CR40 UPF0481 protein1.1e-29593.81Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVES
        MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+VIIE+ED KLEEDP  ESPE+EWV+TIKEKL+QAHQDEVES
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVES

Query:  TWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF
        +WAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLD MKELEE+ARNCYEGP S SSNEFVEMMVLDGCF
Subjt:  TWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF

Query:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT
        VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Subjt:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT

Query:  AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN
        AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLN
Subjt:  AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN

Query:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGN
        LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF N
Subjt:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGN

Query:  PWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        PWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Subjt:  PWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

A0A6J1HGP0 UPF0481 protein At3g47200-like8.4e-29192.66Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS SANSVDSHGKPE+QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKE L QAHQDEVES+W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL LQLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQ+SEDVN YYNHRWNAWRA+LKHNYF NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW

Query:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        AIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Subjt:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

A0A6J1KY55 UPF0481 protein At3g47200-like2.9e-29192.66Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS S NSVDSHGKP++QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKEKL QAHQDEVES+W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL +QLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRA+LKHNYF NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW

Query:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        AIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Subjt:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

SwissProt top hitse value%identityAlignment
P25776 Oryzain alpha chain1.4e-12058.12Show/hide
Query:  ATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSEN----RTYKVGLNMFADLTNDEYRAVYLGTR
        A  LL LLS      S  +   RS+ E R +Y  W A+HGK+YN + E E+R+  F++NL +IDEHN+       ++++GLN FADLTN+EYR  YLG R
Subjt:  ATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSEN----RTYKVGLNMFADLTNDEYRAVYLGTR

Query:  SPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFI
        + P R     +  S RY     + LPESVDWRT+GAVA +K+QG CGSCWAFS IAAVEGINQIVTG+LISLSEQELV CD  YN GCNGGLMDYAF FI
Subjt:  SPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFI

Query:  IDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDY
        I+NGG+DTE+DYPY+G D +CD  RKNAKVV+ID YEDV  + E +L+KAVA+QPVSVAIEA G A QLY SG+FTGKCG+ALDHGV AVGYGTENG DY
Subjt:  IDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDY

Query:  WLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRCFNS--CCTVVSF
        W+VRNSWG  WGE GY ++ERN+K  ++GKCGIA+  SYP+K G           S T      D  + C +S  CC +  +
Subjt:  WLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRCFNS--CCTVVSF

P43297 Cysteine proteinase RD21A8.5e-12359.5Show/hide
Query:  RSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR
        RS+ EV  IY+ WL KHGKA   N + E+++RF+IFK+NL F+DEHN +N +Y++GL  FADLTNDEYR+ YLG +          +  S RY   + D 
Subjt:  RSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR

Query:  LPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPT
        LPES+DWR +GAVA VK+QG CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV CD  YN GCNGGLMDYAF+FII NGG+DT++DYPY+G DG CD  
Subjt:  LPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPT

Query:  RKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVK
        RKNAKVV+ID YEDVP   E++LKKAVAHQP+S+AIEA G A QLY SG+F G CG+ LDHGV+AVGYGTENG DYW+VRNSWG  WGE GY ++ RN+ 
Subjt:  RKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVK

Query:  HTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRC--FNSCCTVVSF
         +++GKCGIA+  SYP+KNG           S      + D  + C   N+CC +  +
Subjt:  HTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRC--FNSCCTVVSF

Q94B08 Germination-specific cysteine protease 11.3e-12361.05Show/hide
Query:  MATATTLLALLSFFFLSNSASALTR--------------RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKENLNFIDEHNSENR--TYKVGL
        MA +T +L+LL  + + + AS                  R+D EVR IY  W A+HGK  N     I +++KRF IFK+NL FID HN +N+  TYK+GL
Subjt:  MATATTLLALLSFFFLSNSASALTR--------------RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKENLNFIDEHNSENR--TYKVGL

Query:  NMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR-LPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC
          F DLTNDEYR +YLG R+ PARR+ KAK  +++Y+  +  + +PE+VDWR +GAV P+K+QG+CGSCWAFST AAVEGIN+IVTGELISLSEQELV C
Subjt:  NMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR-LPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC

Query:  DKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCG
        DK YN GCNGGLMDYAFQFI+ NGGL+TE+DYPY GF G+C+   KN++VVSIDGYEDVP  DE ALKKA+++QPVSVAIEA G   Q YQSG+FTG CG
Subjt:  DKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCG

Query:  SALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVK
        + LDH V+AVGYG+ENGVDYW+VRNSWG  WGE+GY ++ERN+  + +GKCGIA+ ASYPVK
Subjt:  SALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVK

Q9FMH8 Probable cysteine protease RD21B3.4e-12461.88Show/hide
Query:  TRRSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVN
        T RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFK+NL FIDEHN++N +YK+GL  FADLTN+EYR++YLG +  P +RV+K    S RY   
Subjt:  TRRSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVN

Query:  IRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQ
        + D LP+SVDWR  GAVA VK+QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV CD  YN GCNGGLMDYAF+FII NGG+DTE DYPY+  DG+
Subjt:  IRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQ

Query:  CDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLE
        CD  RKNAKVV+ID YEDVP + E +LKKA+AHQP+SVAIEA G A QLY SGVF G CG+ LDHGV+AVGYGTENG DYW+VRNSWG  WGE GY K+ 
Subjt:  CDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLE

Query:  RNVKHTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRC--FNSCCTVVSF
        RN++  T GKCGIAM ASYP+K G           S        D  F C   N+CC +  +
Subjt:  RNVKHTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRC--FNSCCTVVSF

Q9LT78 Probable cysteine protease RD21C3.0e-12061.93Show/hide
Query:  MATA--TTLLALLSFFFLSNSAS------ALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNS-ENRTYKVGLNMFADLTNDEY
        MAT+  +  LALL F  L  S S        T R++ E R +Y+ WL ++ K YNG+ E+E+RF+IFK+NL F++EH+S  NRTY+VGL  FADLTNDE+
Subjt:  MATA--TTLLALLSFFFLSNSAS------ALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNS-ENRTYKVGLNMFADLTNDEY

Query:  RAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGL
        RA+YL ++    R  +K +    +Y   + D LP+++DWR +GAV PVK+QGSCGSCWAFS I AVEGINQI TGELISLSEQELV CD  YN GC GGL
Subjt:  RAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGL

Query:  MDYAFQFIIDNGGLDTEEDYPYEGFD-GQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVG
        MDYAF+FII+NGG+DTEEDYPY   D   C+  +KN +VV+IDGYEDVP +DEK+LKKA+A+QP+SVAIEA G A QLY SGVFTG CG++LDHGV+AVG
Subjt:  MDYAFQFIIDNGGLDTEEDYPYEGFD-GQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVG

Query:  YGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKN
        YG+E G DYW+VRNSWG+ WGE GYFKLERN+K  ++GKCG+AMMASYP K+
Subjt:  YGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKN

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)1.1e-21567.23Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW
        MVAVF K++LSWYL+TLK+RE +E+    + L     +  G PEI   +Q Q    +     +     ++E P+    +WV++I +KL QAH+D+  + W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
         KLCIY+VP+YL++ ++K+  PQ VSLGPYHHGK+RLR M+RHKWR++  +L+R  Q IK+Y+D M+ELEEKAR CYEGPLSLSSNEF+EM+VLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQLPLFVL+RLLELQLG   Q GLVA+LA+RFFDPL P DEPLTKS  +KLE+SL    +F
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DP      LHCLDVFRRSLLRS PK  P++  KR S   RVADKRRQQLIHCV ELKEAGI+F+++KTDRFWD+ F NG ++IPRLLIHDGT+SLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW
        AFEQCH+D SNDITSY++FMDNLIDSHEDV+YLHYCGIIEHWLGSD EVA+LFNRLCQEVV+D  DSYLS+LS +VNRYY+H+WNAWRATLKH YF NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPW

Query:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        AI+S  AAV+LL+LTF+Q+FY VYAYYKPP+
Subjt:  AIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

AT3G50130.1 Plant protein of unknown function (DUF247)6.8e-17662.74Show/hide
Query:  QKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGM
        QK  + PE    EWV++I++K+ QA +++  ++W KLCIY+VP YL++   K+  PQ VSLGP+HHG + L  M+RHKWR++  ++ R K DI++Y+D M
Subjt:  QKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGM

Query:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAE
        KELE++AR CYEGP+ LSSN+F EM+VLDGCFVLELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQLPLFVL+RLLE+QLG  +Q GLV+ 
Subjt:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAE

Query:  LALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQD--GLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQ
        LA+RFFDPL P DEPLTK+     + SL     F+P+  +D   LHCLDVFRR+LLR      P++   R S   RVADKR+QQLIHCV EL+EAGI+F+
Subjt:  LALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQD--GLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQ

Query:  KKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDI
         +KTDRFWDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH+D SNDITSY++FMDNLIDS EDV YLHYCGIIEHWLG+D EVA+LFNRLCQEV +D 
Subjt:  KKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDI

Query:  NDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
         +SYLSQLS  V+R Y+ +WN  +A LKH YF NPWA  S  AA+VLL+LT  Q+F+  Y Y+ PP+
Subjt:  NDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

AT3G50140.1 Plant protein of unknown function (DUF247)2.8e-16960.68Show/hide
Query:  QKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGM
        Q   + PE    EWV+ IK+K+ Q  +D   ++W K+CIY+VP  LK  +  +  PQ VSLGPYHHG   LR M+ HKWR++  +++R KQ I++Y+D M
Subjt:  QKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGM

Query:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAE
        KELEE+AR CYEGP+ LSSN+F +M+VLDGCFVL+LFRGA EGF +LGY RNDP+FAMRGSMHSI+RDM+MLENQLPLFVL+RLLELQLG  YQ GLVA+
Subjt:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAE

Query:  LALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG--YQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRF
        LA+RFF+PL P     T  S  K+E+S   N   F+P+    ++ LHCLDVFRRSLL+   K  P++   R S    VADKR+QQL+HCV EL+EAGI+F
Subjt:  LALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG--YQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRF

Query:  QKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYD
        +++K+DRFWDI F NG ++IP+LLIHDGT+SLF NLIA+EQCH+D +NDITSY++FMDNLIDS ED+ YLHY  IIEHWLG+D EVA++FNRLCQEV +D
Subjt:  QKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYD

Query:  INDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
        + ++YLS+LS  V+RYYN +WN  +ATLKH YF NPWA  S  AAV+LLLLT  Q+F+  Y Y+KPP+
Subjt:  INDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN

AT3G50150.1 Plant protein of unknown function (DUF247)2.6e-15957.71Show/hide
Query:  QSESHHVIIEDEDQKL---EEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYH
        Q+  +HV    E  K+   EE P     EWV++IK+K+ +A   +  ++W KLCIY+VP YL++ + K+ +PQ VS+GPYHHGK  LR MERHKWR++  
Subjt:  QSESHHVIIEDEDQKL---EEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYH

Query:  ILERAKQDIKIYLDGMKELEEKARNCYEGPLSL-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDR
        I+ R K +I++Y+D MKELEE+AR CY+GP+ + +SNEF EM+VLDGCFVLELF+G  +GF+++GY RNDP+FA RG MHSIQRDMIMLENQLPLFVLDR
Subjt:  ILERAKQDIKIYLDGMKELEEKARNCYEGPLSL-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDR

Query:  LLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQL
        LL LQ G   Q G+VAE+A+RFF  L P  E LTKS     E SL +    D LG   GLHCLDVF RSL++S      +   +   + +    +++QQL
Subjt:  LLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQL

Query:  IHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEV
        IHCV EL+ AG+ F +K+T + WDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH   SN+ITSY++FMDNLI+S +DV+YLH+ GIIEHWLGSD EV
Subjt:  IHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEV

Query:  AELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP
        A+LFNRLC+EV++D  D YLSQLS +VNRYY+ +WN+ +ATL+  YF NPWA  S  AAV+LL LTF Q+F+ VYAYYKP
Subjt:  AELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP

AT3G50170.1 Plant protein of unknown function (DUF247)3.7e-19060.98Show/hide
Query:  VFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEI-------QLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEV
        + NK++L+WYL++LKLR+  ++   ++S      + HG PE+        +Q  KQ  SES   ++E+  ++   D       WV++I++KL QA +D+ 
Subjt:  VFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEI-------QLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEV

Query:  ESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDG
         + W KLCIY+VPHYL++ + K+  PQ VSLGPYHHGK+RLR MERHKWR+L  +L+R KQ I++Y + M+ELEEKAR CYEGP+SLS NEF EM+VLDG
Subjt:  ESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDG

Query:  CFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN
        CFVLELFRG  EGF ++GY RNDP+FAMRG MHSIQRDMIMLENQLPLFVLDRLLELQLG   Q G+VA +A++FFDPL P  E LTK   +KL + L  
Subjt:  CFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN

Query:  ATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLF
          + D LG +  LHCLDVFRRSLL+S P    +  +KR +   RV DKR+QQL+HCV EL+EAG++F+K+KTDRFWDI F NG ++IP+LLIHDGT+SLF
Subjt:  ATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLF

Query:  LNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF
         NLIAFEQCH++ SN ITSY++FMDNLI+S EDV+YLHYCGIIEHWLGSD EVA+LFNRLCQEVV+D  DS+LS+LS DVNRYYN +WN  +ATL H YF
Subjt:  LNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF

Query:  GNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP
         NPWA  S  AAV+LLLLT  Q+FY VYAYYKP
Subjt:  GNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCGCCACCACTTTGCTCGCCCTGCTCTCCTTCTTCTTCCTATCTAATTCTGCCTCCGCTCTCACCCGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCT
GTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCGGAGAATC
GGACTTATAAGGTTGGATTGAACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAG
ACCGCCAGCCGCCGATACGCCGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGACCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAG
CTGCTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAAATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACA
ATTCAGGTTGCAATGGAGGCCTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGC
GATCCCACCAGGAAAAATGCCAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGT
CGCCATTGAAGCTAGTGGCTTAGCTTTGCAACTCTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCATCGCTGTTGGTTATGGCACAG
AGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAAGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAAGTGT
GGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGTTGTGAAACATCTGCTACTTCTAGTTTGATGGAAGTTGATTATGTCTTTCGATGTTTCAATTCATGTTGTAC
AGTGGTATCATTTCTGGGGAATGAGAGTTCTTCTAAAGCCAGTCAAAGTCTTAACAATTTCAGTTCTGTGCACAAAATGGTGGCTGTGTTCAATAAAGAGTTATTGAGCT
GGTACCTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATGGAAAACCAGAAATCCAGCTCCAG
GAACAGAAACAGATTCAATCAGAATCCCATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAGACAGAATGGGTTGTCACCATCAA
GGAAAAGCTTCACCAAGCTCATCAAGATGAAGTAGAAAGTACATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGATGGTGAAGACAAAGCTGTTGTTC
CTCAGATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACATCCTAGAGAGAGCAAAGCAGGAC
ATAAAGATTTATCTGGACGGCATGAAAGAACTTGAAGAAAAAGCCCGTAATTGTTATGAAGGACCGCTTAGTTTAAGCAGCAATGAATTTGTGGAAATGATGGTGCTCGA
TGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAATGCGTGGCTCAATGCATTCGATCC
AGAGGGATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTTGAGCTTCAGCTTGGTGACCACTACCAGAAAGGACTCGTAGCCGAATTAGCA
CTCAGATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCAACCGCCTTTGACCCGCTTGGTTA
TCAAGACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTGGCCCGAAATTAGCACCGAAAGTGTGGATCAAACGGCGGTCTCATGCGAATCGGGTGG
CCGATAAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGATTTTGGGACATAAATTTCAACAAT
GGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTTGATTGCAGCAATGACATAACCTC
TTATGTGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGGAAGTGATGAAGAAGTTGCAGAGC
TTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACAACCATAGATGGAATGCTTGGAGA
GCAACTTTGAAACACAACTACTTCGGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTTGCTTACTTTTGCACAAGCCTTCTATGGAGTTTATGC
TTATTACAAACCCCCAAATTGA
mRNA sequenceShow/hide mRNA sequence
CTCACTTTTGAGTTCCTTCAGGCTATAAATAACCATAGCCTCTTCCCCACTTTCTCCATAACTCAGAATCGAACAATCTTCCCTGTTCTCCAAGCAACCTTCGCCATGGC
CACCGCCACCACTTTGCTCGCCCTGCTCTCCTTCTTCTTCCTATCTAATTCTGCCTCCGCTCTCACCCGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCTGTGGC
TGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCGGAGAATCGGACT
TATAAGGTTGGATTGAACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAGACCGC
CAGCCGCCGATACGCCGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGACCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGCTGCT
GGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAAATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACAATTCA
GGTTGCAATGGAGGCCTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGCGATCC
CACCAGGAAAAATGCCAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGTCGCCA
TTGAAGCTAGTGGCTTAGCTTTGCAACTCTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCATCGCTGTTGGTTATGGCACAGAGAAC
GGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAAGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAAGTGTGGGAT
CGCAATGATGGCTTCTTACCCTGTTAAGAATGGTTGTGAAACATCTGCTACTTCTAGTTTGATGGAAGTTGATTATGTCTTTCGATGTTTCAATTCATGTTGTACAGTGG
TATCATTTCTGGGGAATGAGAGTTCTTCTAAAGCCAGTCAAAGTCTTAACAATTTCAGTTCTGTGCACAAAATGGTGGCTGTGTTCAATAAAGAGTTATTGAGCTGGTAC
CTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATGGAAAACCAGAAATCCAGCTCCAGGAACA
GAAACAGATTCAATCAGAATCCCATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAGACAGAATGGGTTGTCACCATCAAGGAAA
AGCTTCACCAAGCTCATCAAGATGAAGTAGAAAGTACATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGATGGTGAAGACAAAGCTGTTGTTCCTCAG
ATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACATCCTAGAGAGAGCAAAGCAGGACATAAA
GATTTATCTGGACGGCATGAAAGAACTTGAAGAAAAAGCCCGTAATTGTTATGAAGGACCGCTTAGTTTAAGCAGCAATGAATTTGTGGAAATGATGGTGCTCGATGGTT
GCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAATGCGTGGCTCAATGCATTCGATCCAGAGG
GATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTTGAGCTTCAGCTTGGTGACCACTACCAGAAAGGACTCGTAGCCGAATTAGCACTCAG
ATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCAACCGCCTTTGACCCGCTTGGTTATCAAG
ACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTGGCCCGAAATTAGCACCGAAAGTGTGGATCAAACGGCGGTCTCATGCGAATCGGGTGGCCGAT
AAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGATTTTGGGACATAAATTTCAACAATGGGGT
TATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTTGATTGCAGCAATGACATAACCTCTTATG
TGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGGAAGTGATGAAGAAGTTGCAGAGCTTTTC
AATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACAACCATAGATGGAATGCTTGGAGAGCAAC
TTTGAAACACAACTACTTCGGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTTGCTTACTTTTGCACAAGCCTTCTATGGAGTTTATGCTTATT
ACAAACCCCCAAATTGA
Protein sequenceShow/hide protein sequence
MATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAK
TASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQC
DPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKC
GIAMMASYPVKNGCETSATSSLMEVDYVFRCFNSCCTVVSFLGNESSSKASQSLNNFSSVHKMVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQ
EQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQD
IKIYLDGMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELA
LRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNN
GVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWR
ATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN