; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G070890 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G070890
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUPF0481 protein At3g47200
Genome locationCicolChr04:27692417..27702672
RNA-Seq ExpressionCcUC04G070890
SyntenyCcUC04G070890
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR004158 - Protein of unknown function DUF247, plant
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440314.1 PREDICTED: UPF0481 protein At3g47200 [Cucumis melo]1.2e-28393.76Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLNQAHHDEVES
        MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+VIIE+ED KLEEDP  ESPE+EWV+TIKEKLNQAH DEVES
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLNQAHHDEVES

Query:  SWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF
        SWAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLDAMKELEE+ARNCYEGP S SSNEFVEMMVLDGCF
Subjt:  SWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF

Query:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT
        VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLG++YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Subjt:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT

Query:  AFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN
        AFDPLGYQDGLHCLDVFRRSLLRS PKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLN
Subjt:  AFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN

Query:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN
        LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN
Subjt:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN

Query:  PWAIISLVAAVIL
        PWAIISL+AAV+L
Subjt:  PWAIISLVAAVIL

XP_011657877.1 UPF0481 protein At3g47200 [Cucumis sativus]1.8e-28494.16Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLNQAHHDEVE
        MVAVFNKELLSWYLITLKLRETVESGLPRNS+SANSVDSHGK E+QLQE KQIQSESHHVI+E+EDQKL EEDP  ESP +EWV+TIKEKLNQAH DEVE
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLNQAHHDEVE

Query:  SSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC
        SSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHIL+R+KQDIK+YLDAMKELEE+ARNCYEGP S SSNEFVEMMVLDGC
Subjt:  SSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC

Query:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA
        FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLG++YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 
Subjt:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA

Query:  TAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL
        TAFDPLGYQDGLHCLDVFRRSLLRS PKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFL
Subjt:  TAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL

Query:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFS
        NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV+YLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFS
Subjt:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFS

Query:  NPWAIISLVAAVIL
        NPWAIISL+AAV+L
Subjt:  NPWAIISLVAAVIL

XP_023003973.1 UPF0481 protein At3g47200-like [Cucurbita maxima]1.6e-28093.15Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS S NSVDSHGKP++QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKEKL+QAH DEVESSW
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLDAMKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL +QLGE+YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRS  KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRA+LKHNYFSNPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW

Query:  AIISLVAAVIL
        AIISL+AAV+L
Subjt:  AIISLVAAVIL

XP_023518140.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]2.7e-28093.15Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS SANSVDSHGKPE+QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKEKL+QAH DEVESSW
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLDAMKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL LQLGE+YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRS  KLAPKVWIKRRSH +RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQ+SEDVN YYNHRWNAWRA+LKHNYFSNPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW

Query:  AIISLVAAVIL
        AIISL+AAV+L
Subjt:  AIISLVAAVIL

XP_038880921.1 UPF0481 protein At3g47200-like [Benincasa hispida]7.8e-28895.89Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW
        MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDS GK E QLQE KQIQSESHHVIIEDEDQKLEEDPESPE+EWV+TIKEKLNQAH DEVESSW
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER KQDIK+YLDAMKELEE+ARNCYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL LQLG+HYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLGYQDGLHCLDVFRRSLLRS PKLAPKVW+KRRSHANRVADKRRQQLIHCVKELKEAG+RF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDIN+SYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW

Query:  AIISLVAAVIL
        AIISLVAAV+L
Subjt:  AIISLVAAVIL

TrEMBL top hitse value%identityAlignment
A0A0A0KID5 Uncharacterized protein8.7e-28594.16Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLNQAHHDEVE
        MVAVFNKELLSWYLITLKLRETVESGLPRNS+SANSVDSHGK E+QLQE KQIQSESHHVI+E+EDQKL EEDP  ESP +EWV+TIKEKLNQAH DEVE
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKL-EEDP--ESPETEWVVTIKEKLNQAHHDEVE

Query:  SSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC
        SSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHIL+R+KQDIK+YLDAMKELEE+ARNCYEGP S SSNEFVEMMVLDGC
Subjt:  SSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGC

Query:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA
        FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLG++YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 
Subjt:  FVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA

Query:  TAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL
        TAFDPLGYQDGLHCLDVFRRSLLRS PKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFL
Subjt:  TAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFL

Query:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFS
        NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDV+YLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFS
Subjt:  NLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFS

Query:  NPWAIISLVAAVIL
        NPWAIISL+AAV+L
Subjt:  NPWAIISLVAAVIL

A0A1S3B0V1 UPF0481 protein At3g472005.7e-28493.76Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLNQAHHDEVES
        MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+VIIE+ED KLEEDP  ESPE+EWV+TIKEKLNQAH DEVES
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLNQAHHDEVES

Query:  SWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF
        SWAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLDAMKELEE+ARNCYEGP S SSNEFVEMMVLDGCF
Subjt:  SWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF

Query:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT
        VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLG++YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Subjt:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT

Query:  AFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN
        AFDPLGYQDGLHCLDVFRRSLLRS PKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLN
Subjt:  AFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN

Query:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN
        LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN
Subjt:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN

Query:  PWAIISLVAAVIL
        PWAIISL+AAV+L
Subjt:  PWAIISLVAAVIL

A0A5D3CR40 UPF0481 protein5.7e-28493.76Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLNQAHHDEVES
        MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+VIIE+ED KLEEDP  ESPE+EWV+TIKEKLNQAH DEVES
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDP--ESPETEWVVTIKEKLNQAHHDEVES

Query:  SWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF
        SWAKLCIYKVPHYLKDGEDKAVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLDAMKELEE+ARNCYEGP S SSNEFVEMMVLDGCF
Subjt:  SWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCF

Query:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT
        VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLG++YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Subjt:  VLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT

Query:  AFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN
        AFDPLGYQDGLHCLDVFRRSLLRS PKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLN
Subjt:  AFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLN

Query:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN
        LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN
Subjt:  LIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSN

Query:  PWAIISLVAAVIL
        PWAIISL+AAV+L
Subjt:  PWAIISLVAAVIL

A0A6J1HGP0 UPF0481 protein At3g47200-like2.2e-28093.15Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS SANSVDSHGKPE+QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKE L+QAH DEVESSW
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLDAMKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL LQLGE+YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRS  KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQ+SEDVN YYNHRWNAWRA+LKHNYFSNPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW

Query:  AIISLVAAVIL
        AIISL+AAV+L
Subjt:  AIISLVAAVIL

A0A6J1KY55 UPF0481 protein At3g47200-like7.6e-28193.15Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW
        MVAVFNKELLSWYLITLKL+ETVESGLPRNS S NSVDSHGKP++QLQE +QIQSESHHVI+EDEDQKLEED ESPE+EWV++IKEKL+QAH DEVESSW
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
        AKLCIYKVPHYLKDG+DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLDAMKELEE AR+CYEGP S SSNEFVEMMVLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLL +QLGE+YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DPLG QDGLHCLDVFRRSLLRS  KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
        AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRA+LKHNYFSNPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW

Query:  AIISLVAAVIL
        AIISL+AAV+L
Subjt:  AIISLVAAVIL

SwissProt top hitse value%identityAlignment
P25776 Oryzain alpha chain7.0e-11461.56Show/hide
Query:  LQATFAMATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSEN----RTYKVGLNMFADLTNDE
        ++ + A+A A  LL LLS      S  +   RS+ E R +Y  W A+HGK+YN + E E+R+  F++NL +IDEHN+       ++++GLN FADLTN+E
Subjt:  LQATFAMATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSEN----RTYKVGLNMFADLTNDE

Query:  YRAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGG
        YR  YLG R+ P R     +  S RY     + LPESVDWR +GAVA +K+QG CGSCWAFS IAAVEGINQIVTG+LISLSEQELV CD  YN GCNGG
Subjt:  YRAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGG

Query:  LMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVG
        LMDYAF FII+NGG+DTE+DYPY+G D +CD  RKNAKVV+ID YEDV  + E +L+KAVA+QPVSVAIEA G A QLY SG+FTGKCG+ALDHGV AVG
Subjt:  LMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVG

Query:  YGTENGVDYWLVRNSWGTGWGEDGYFKLERNVK
        YGTENG DYW+VRNSWG  WGE GY ++ERN+K
Subjt:  YGTENGVDYWLVRNSWGTGWGEDGYFKLERNVK

P43297 Cysteine proteinase RD21A9.8e-11665.55Show/hide
Query:  RSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR
        RS+ EV  IY+ WL KHGKA   N + E+++RF+IFK+NL F+DEHN +N +Y++GL  FADLTNDEYR+ YLG +          +  S RY   + D 
Subjt:  RSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR

Query:  LPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPT
        LPES+DWR +GAVA VK+QG CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV CD  YN GCNGGLMDYAF+FII NGG+DT++DYPY+G DG CD  
Subjt:  LPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPT

Query:  RKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNV
        RKNAKVV+ID YEDVP   E++LKKAVAHQP+S+AIEA G A QLY SG+F G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY ++ RN+
Subjt:  RKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNV

Q94B08 Germination-specific cysteine protease 11.2e-11660.46Show/hide
Query:  MATATTLLALLSFFFLSNSASALTR--------------RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKENLNFIDEHNSENR--TYKVGL
        MA +T +L+LL  + + + AS                  R+D EVR IY  W A+HGK  N     I +++KRF IFK+NL FID HN +N+  TYK+GL
Subjt:  MATATTLLALLSFFFLSNSASALTR--------------RSDGEVREIYDLWLAKHGKAYNG----IEEREKRFQIFKENLNFIDEHNSENR--TYKVGL

Query:  NMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR-LPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC
          F DLTNDEYR +YLG R+ PARR+ KAK  +++Y+  +  + +PE+VDWR +GAV P+K+QG+CGSCWAFST AAVEGIN+IVTGELISLSEQELV C
Subjt:  NMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDR-LPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC

Query:  DKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCG
        DK YN GCNGGLMDYAFQFI+ NGGL+TE+DYPY GF G+C+   KN++VVSIDGYEDVP  DE ALKKA+++QPVSVAIEA G   Q YQSG+FTG CG
Subjt:  DKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCG

Query:  SALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNG
        + LDH VVAVGYG+ENGVDYW+VRNSWG  WGE+GY ++ERN+  + +G
Subjt:  SALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNG

Q9FMH8 Probable cysteine protease RD21B4.0e-11766.78Show/hide
Query:  TRRSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVN
        T RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFK+NL FIDEHN++N +YK+GL  FADLTN+EYR++YLG +  P +RV+K    S RY   
Subjt:  TRRSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVN

Query:  IRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQ
        + D LP+SVDWR  GAVA VK+QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV CD  YN GCNGGLMDYAF+FII NGG+DTE DYPY+  DG+
Subjt:  IRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQ

Query:  CDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLE
        CD  RKNAKVV+ID YEDVP + E +LKKA+AHQP+SVAIEA G A QLY SGVF G CG+ LDHGVVAVGYGTENG DYW+VRNSWG  WGE GY K+ 
Subjt:  CDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLE

Query:  RNVKHTT
        RN++  T
Subjt:  RNVKHTT

Q9LT78 Probable cysteine protease RD21C1.8e-11460.41Show/hide
Query:  TFAMATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNS-ENRTYKVGLNMFADLTNDEYRAVYL
        T A+   + LL  LS   L +  +  T R++ E R +Y+ WL ++ K YNG+ E+E+RF+IFK+NL F++EH+S  NRTY+VGL  FADLTNDE+RA+YL
Subjt:  TFAMATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNS-ENRTYKVGLNMFADLTNDEYRAVYL

Query:  GTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAF
         ++    R  +K +    +Y   + D LP+++DWRA+GAV PVK+QGSCGSCWAFS I AVEGINQI TGELISLSEQELV CD  YN GC GGLMDYAF
Subjt:  GTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAF

Query:  QFIIDNGGLDTEEDYPYEGFD-GQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTEN
        +FII+NGG+DTEEDYPY   D   C+  +KN +VV+IDGYEDVP +DEK+LKKA+A+QP+SVAIEA G A QLY SGVFTG CG++LDHGVVAVGYG+E 
Subjt:  QFIIDNGGLDTEEDYPYEGFD-GQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTEN

Query:  GVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGNSVHKMVA
        G DYW+VRNSWG+ WGE GYFKLERN+K ++    V  M +
Subjt:  GVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGNSVHKMVA

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)9.4e-20767.51Show/hide
Query:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW
        MVAVF K++LSWYL+TLK+RE +E+    + L     +  G PEI   +Q Q    +     +     ++E P+    +WV++I +KL QAH D+  + W
Subjt:  MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEVESSW

Query:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL
         KLCIY+VP+YL++ ++K+  PQ VSLGPYHHGK+RLR M+RHKWR++  +L+R  Q IK+Y+DAM+ELEEKAR CYEGPLSLSSNEF+EM+VLDGCFVL
Subjt:  AKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVL

Query:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
        ELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQLPLFVL+RLLELQLG   Q GLVA+LA+RFFDPL P DEPLTKS  +KLE+SL    +F
Subjt:  ELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF

Query:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI
        DP      LHCLDVFRRSLLRSSPK  P++  KR S   RVADKRRQQLIHCV ELKEAGI+F+++KTDRFWD+ F NG ++IPRLLIHDGT+SLFLNLI
Subjt:  DPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLI

Query:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW
        AFEQCH+D SNDITSY++FMDNLIDSHEDV+YLHYCGIIEHWLGSD EVA+LFNRLCQEVV+D  DSYLS+LS +VNRYY+H+WNAWRATLKH YF+NPW
Subjt:  AFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPW

Query:  AIISLVAAVIL
        AI+S  AAVIL
Subjt:  AIISLVAAVIL

AT3G50130.1 Plant protein of unknown function (DUF247)6.4e-17163.76Show/hide
Query:  QKLEEDPESPETEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAM
        QK  + PE    EWV++I++K+ QA  ++  +SW KLCIY+VP YL++   K+  PQ VSLGP+HHG + L  M+RHKWR++  ++ R K DI++Y+DAM
Subjt:  QKLEEDPESPETEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAM

Query:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAE
        KELE++AR CYEGP+ LSSN+F EM+VLDGCFVLELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQLPLFVL+RLLE+QLG+ +Q GLV+ 
Subjt:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAE

Query:  LALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQD--GLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQ
        LA+RFFDPL P DEPLTK+     + SL     F+P+  +D   LHCLDVFRR+LLR      P++   R S   RVADKR+QQLIHCV EL+EAGI+F+
Subjt:  LALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQD--GLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQ

Query:  KKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDI
         +KTDRFWDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH+D SNDITSY++FMDNLIDS EDV YLHYCGIIEHWLG+D EVA+LFNRLCQEV +D 
Subjt:  KKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDI

Query:  NDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVIL
         +SYLSQLS  V+R Y+ +WN  +A LKH YF+NPWA  S  AA++L
Subjt:  NDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVIL

AT3G50140.1 Plant protein of unknown function (DUF247)4.4e-16462.05Show/hide
Query:  QKLEEDPESPETEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAM
        Q   + PE    EWV+ IK+K+ Q   D   +SW K+CIY+VP  LK  +  +  PQ VSLGPYHHG   LR M+ HKWR++  +++R KQ I++Y+DAM
Subjt:  QKLEEDPESPETEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAM

Query:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAE
        KELEE+AR CYEGP+ LSSN+F +M+VLDGCFVL+LFRGA EGF +LGY RNDP+FAMRGSMHSI+RDM+MLENQLPLFVL+RLLELQLG  YQ GLVA+
Subjt:  KELEEKARNCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAE

Query:  LALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG--YQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRF
        LA+RFF+PL P     T  S  K+E+S   N   F+P+    ++ LHCLDVFRRSLL+ S K  P++   R S    VADKR+QQL+HCV EL+EAGI+F
Subjt:  LALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG--YQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRF

Query:  QKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYD
        +++K+DRFWDI F NG ++IP+LLIHDGT+SLF NLIA+EQCH+D +NDITSY++FMDNLIDS ED+ YLHY  IIEHWLG+D EVA++FNRLCQEV +D
Subjt:  QKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYD

Query:  INDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVIL
        + ++YLS+LS  V+RYYN +WN  +ATLKH YFSNPWA  S  AAVIL
Subjt:  INDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVIL

AT3G50150.1 Plant protein of unknown function (DUF247)7.8e-15358.01Show/hide
Query:  QSESHHVIIEDEDQKL---EEDPESPETEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYH
        Q+  +HV    E  K+   EE P     EWV++IK+K+ +A   +  +SW KLCIY+VP YL++ + K+ +PQ VS+GPYHHGK  LR MERHKWR++  
Subjt:  QSESHHVIIEDEDQKL---EEDPESPETEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYH

Query:  ILERAKQDIKIYLDAMKELEEKARNCYEGPLSL-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDR
        I+ R K +I++Y+DAMKELEE+AR CY+GP+ + +SNEF EM+VLDGCFVLELF+G  +GF+++GY RNDP+FA RG MHSIQRDMIMLENQLPLFVLDR
Subjt:  ILERAKQDIKIYLDAMKELEEKARNCYEGPLSL-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDR

Query:  LLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQL
        LL LQ G   Q G+VAE+A+RFF  L P  E LTKS     E SL +    D LG   GLHCLDVF RSL++SS     +   +   + +    +++QQL
Subjt:  LLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQL

Query:  IHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEV
        IHCV EL+ AG+ F +K+T + WDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH   SN+ITSY++FMDNLI+S +DV+YLH+ GIIEHWLGSD EV
Subjt:  IHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEV

Query:  AELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVIL
        A+LFNRLC+EV++D  D YLSQLS +VNRYY+ +WN+ +ATL+  YF+NPWA  S  AAVIL
Subjt:  AELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVIL

AT3G50170.1 Plant protein of unknown function (DUF247)9.5e-18360.97Show/hide
Query:  VFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEI-------QLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEV
        + NK++L+WYL++LKLR+  ++   ++S      + HG PE+        +Q  KQ  SES   ++E+  ++   D       WV++I++KL QA  D+ 
Subjt:  VFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEI-------QLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLNQAHHDEV

Query:  ESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDG
         + W KLCIY+VPHYL++ + K+  PQ VSLGPYHHGK+RLR MERHKWR+L  +L+R KQ I++Y +AM+ELEEKAR CYEGP+SLS NEF EM+VLDG
Subjt:  ESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNEFVEMMVLDG

Query:  CFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN
        CFVLELFRG  EGF ++GY RNDP+FAMRG MHSIQRDMIMLENQLPLFVLDRLLELQLG   Q G+VA +A++FFDPL P  E LTK   +KL + L  
Subjt:  CFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN

Query:  ATAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLF
          + D LG +  LHCLDVFRRSLL+SSP    +  +KR +   RV DKR+QQL+HCV EL+EAG++F+K+KTDRFWDI F NG ++IP+LLIHDGT+SLF
Subjt:  ATAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLF

Query:  LNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF
         NLIAFEQCH++ SN ITSY++FMDNLI+S EDV+YLHYCGIIEHWLGSD EVA+LFNRLCQEVV+D  DS+LS+LS DVNRYYN +WN  +ATL H YF
Subjt:  LNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYF

Query:  SNPWAIISLVAAVIL
        +NPWA  S  AAVIL
Subjt:  SNPWAIISLVAAVIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTATAAATAACCATAGCCTCTTCCCCACTTTCTCCATAACTCAGAATCGAACAATCTTCCCTGTTCTCCAAGCAACCTTCGCCATGGCCACCGCCACCACTTT
GCTCGCCCTGCTCTCCTTCTTCTTCCTATCCAATTCTGCCTCCGCTCTCACACGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCTGTGGCTGGCGAAGCACGGCA
AGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCAGAGAATCGGACTTATAAGGTTGGATTG
AACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAGACCGCCAGCCGCCGATACGC
CGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGGCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGCTGCTGGGCATTCTCGACCA
TAGCAGCTGTTGAAGGCATAAATCAGATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGC
CTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGCGATCCCACCAGGAAAAATGC
CAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGTCGCCATTGAAGCTAGTGGCT
TAGCTTTGCAACTCTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTATGGCACAGAGAACGGAGTTGATTATTGG
CTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAGGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAATTCTGTGCACAAAATGGTGGCTGT
GTTCAATAAAGAGTTATTGAGCTGGTACCTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATG
GAAAACCAGAAATCCAGCTCCAGGAACAGAAACAGATTCAATCAGAATCCCATCATGTCATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAG
ACAGAATGGGTTGTCACCATCAAGGAAAAGCTTAACCAAGCTCATCATGATGAAGTAGAAAGTTCATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGA
TGGTGAAGACAAAGCTGTTGTTCCTCAGATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACA
TCCTAGAAAGAGCAAAGCAGGACATAAAGATTTATCTGGACGCCATGAAAGAACTTGAAGAAAAAGCCCGTAATTGTTACGAAGGACCGCTTAGTTTAAGCAGCAATGAA
TTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAAT
GCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTCGAGCTTCAGCTTGGTGAGCACTACCAGA
AAGGACTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCA
ACCGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTAGCCCAAAATTAGCACCGAAAGTGTGGATCAAACG
GCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGAT
TTTGGGACATAAACTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTT
GATTGCAGCAATGACATAACCTCTTATGTGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGG
TAGTGATGAAGAAGTTGCAGAGCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACA
ACCATAGATGGAATGCTTGGAGAGCAACTTTGAAACACAACTACTTCAGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAATTCTAGAGGGCAGAGAAATTATG
CAACATGGTTCTCAATCCAATGATGTACAAAGGACAGCCATATTAATCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTATAAATAACCATAGCCTCTTCCCCACTTTCTCCATAACTCAGAATCGAACAATCTTCCCTGTTCTCCAAGCAACCTTCGCCATGGCCACCGCCACCACTTT
GCTCGCCCTGCTCTCCTTCTTCTTCCTATCCAATTCTGCCTCCGCTCTCACACGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCTGTGGCTGGCGAAGCACGGCA
AGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCAGAGAATCGGACTTATAAGGTTGGATTG
AACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAGACCGCCAGCCGCCGATACGC
CGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGGCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGCTGCTGGGCATTCTCGACCA
TAGCAGCTGTTGAAGGCATAAATCAGATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGC
CTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGCGATCCCACCAGGAAAAATGC
CAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGTCGCCATTGAAGCTAGTGGCT
TAGCTTTGCAACTCTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCGTCGCTGTTGGTTATGGCACAGAGAACGGAGTTGATTATTGG
CTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAGGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAATTCTGTGCACAAAATGGTGGCTGT
GTTCAATAAAGAGTTATTGAGCTGGTACCTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATG
GAAAACCAGAAATCCAGCTCCAGGAACAGAAACAGATTCAATCAGAATCCCATCATGTCATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAG
ACAGAATGGGTTGTCACCATCAAGGAAAAGCTTAACCAAGCTCATCATGATGAAGTAGAAAGTTCATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGA
TGGTGAAGACAAAGCTGTTGTTCCTCAGATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACA
TCCTAGAAAGAGCAAAGCAGGACATAAAGATTTATCTGGACGCCATGAAAGAACTTGAAGAAAAAGCCCGTAATTGTTACGAAGGACCGCTTAGTTTAAGCAGCAATGAA
TTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAAT
GCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTCGAGCTTCAGCTTGGTGAGCACTACCAGA
AAGGACTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCA
ACCGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTAGCCCAAAATTAGCACCGAAAGTGTGGATCAAACG
GCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGAT
TTTGGGACATAAACTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTT
GATTGCAGCAATGACATAACCTCTTATGTGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGG
TAGTGATGAAGAAGTTGCAGAGCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACA
ACCATAGATGGAATGCTTGGAGAGCAACTTTGAAACACAACTACTTCAGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAATTCTAGAGGGCAGAGAAATTATG
CAACATGGTTCTCAATCCAATGATGTACAAAGGACAGCCATATTAATCAAATAA
Protein sequenceShow/hide protein sequence
MEAINNHSLFPTFSITQNRTIFPVLQATFAMATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGL
NMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGG
LMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGTENGVDYW
LVRNSWGTGWGEDGYFKLERNVKHTTNGNSVHKMVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPE
TEWVVTIKEKLNQAHHDEVESSWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDAMKELEEKARNCYEGPLSLSSNE
FVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGEHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA
TAFDPLGYQDGLHCLDVFRRSLLRSSPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHL
DCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFSNPWAIISLVAAVILEGREIM
QHGSQSNDVQRTAILIK