; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G11280 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G11280
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUncharacterised conserved protein UCP031088, alpha/beta hydrolase
Genome locationClcChr04:24837845..24841945
RNA-Seq ExpressionClc04G11280
SyntenyClc04G11280
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR002471 - Peptidase S9, serine active site
IPR016969 - Uncharacterised conserved protein UCP031088, alpha/beta hydrolase, At1g15070
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99015.1 putative catalytic [Cucumis melo var. makuwa]1.7e-23182.12Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S   GQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPLA               +P QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPL+ 
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

XP_004137487.1 uncharacterized protein LOC101216390 [Cucumis sativus]4.2e-23884.09Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RSEN  R LHRF GQ   EPSW LRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILEVRGLGLST       TE+I SETL KQPLVKAS Y++S  S++SSRDGQTSNIA QL Q
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLINIIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFD+YLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKVDPQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDPAQNFNVPV PIGPLLVIAHPLASRPPYVLAWLK Q+S +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFE+GGLRDR+GTF+YKD+LRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPL+ 
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

XP_008462055.1 PREDICTED: uncharacterized protein LOC103500486 isoform X1 [Cucumis melo]1.2e-22482.86Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S  DGQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDP QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL

XP_008462198.1 PREDICTED: uncharacterized protein LOC103500486 isoform X3 [Cucumis melo]3.7e-23482.51Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S  DGQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDP QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPL+ 
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

XP_038894452.1 uncharacterized protein LOC120083031 [Benincasa hispida]7.9e-25387.43Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATLSLS FDLSS  +S++RRR LHR   + KS+PSWALRRRNV+ VKS RAFYGGASGLN NKEKGLICTADELHYVSVPNSDWKLALWRYLPS+RAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILEVRGLGLSTDRG+MK TEQIRSETL+KQPLVKASTY+SS  S ISSRDGQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLIN+IDGAQQL PFQPFNLQGVTSALEE QEQL VYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA IS CS KKVDPQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               +DPAQN NVPVIPIGPLLVIAHPLASRPPYVL+WLKGQISA+DMLHPTLLEKLVMNGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFEEGGL DRSGTFKYKDYLRQGNVP+LALAGDQDLICPPEAVYETVKEIP QLVSYKVLGK GGPHYAHYDIVGS LASSEVYPLIT
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

TrEMBL top hitse value%identityAlignment
A0A0A0LVT9 Uncharacterized protein2.0e-23884.09Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RSEN  R LHRF GQ   EPSW LRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILEVRGLGLST       TE+I SETL KQPLVKAS Y++S  S++SSRDGQTSNIA QL Q
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLINIIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFD+YLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKVDPQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDPAQNFNVPV PIGPLLVIAHPLASRPPYVLAWLK Q+S +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFE+GGLRDR+GTF+YKD+LRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPL+ 
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

A0A1S3CG41 uncharacterized protein LOC103500486 isoform X15.8e-22582.86Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S  DGQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDP QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL

A0A1S3CGE4 uncharacterized protein LOC103500486 isoform X31.8e-23482.51Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S  DGQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDP QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPL+ 
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

A0A1S3CGR6 uncharacterized protein LOC103500486 isoform X24.1e-22382.65Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S   GQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPL               KDP QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRL

A0A5D3BKY6 Putative catalytic8.3e-23282.12Show/hide
Query:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS
        MATL L  FDLSS  RS+NR   L RF GQ + EPSWALRRRNVV VKS +AFYGGA GLN NK  GLICTADELHYVSVPNSDWKLALWRY+PSLRAPS
Subjt:  MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPS

Query:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ
        RNHPLLLLSG+GSNALGYDLSPESSFARYMSN GYDTWILE+RGLGLST       TEQI SETL KQPLV  S Y++SV SS+S   GQTSNIA QLRQ
Subjt:  RNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQ

Query:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ
        WNKNLI+IIDGAQQL PFQPF +QGVTSALEE QEQLDVYEKYDWDFDNYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMIS CS KKV+PQ
Subjt:  WNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQ

Query:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG
        LASVVTLASSLDYRPSNSSLRLLLPLA               +P QN NVPV PIGPLLVIAHPLASRPPY+LAWLK QIS +DMLHPTLLEKLV+NGFG
Subjt:  LASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFG

Query:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT
        SVPAKVL+QLSSVFEEGGL DR+GTF+YK++LRQGN+PILALAGD+DLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPL+ 
Subjt:  SVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLIT

Query:  DFLNHHDAI
        DFLN HD +
Subjt:  DFLNHHDAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15060.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase2.5e-13548.78Show/hide
Query:  KGLICTADELHYVSVPNSDWKLALWRYLPSLRAPSRNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTE-------
        K  +CTADELHYVSVPN+DW+LALWRYLP  +AP+RNHPLLLLSG+G+NA+GYDLSP  SFAR+MS  G++TWILEVRG GLST   ++K  E       
Subjt:  KGLICTADELHYVSVPNSDWKLALWRYLPSLRAPSRNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTE-------

Query:  -QIRS--------ETLSKQP-------------------LVKASTYDSS---------------------------------------------------
         QI S        ET S +                    + +AS +D S                                                   
Subjt:  -QIRS--------ETLSKQP-------------------LVKASTYDSS---------------------------------------------------

Query:  -VRSSISS--RDGQTSNIAAQLRQWNKNLINII-DGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLA
         +RS + S     Q S +  Q+R   + L+N+  DG + +SP      + +T+ +E+ Q+QLD+  KYDWDFD+YLEED+PAA+EY+R QSKP DGKL A
Subjt:  -VRSSISS--RDGQTSNIAAQLRQWNKNLINII-DGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLA

Query:  IGHSMGGILLYAMISCCSSKKVDPQLASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAW
        IGHSMGGILLYAM+S C+ +  +P +A+V TLASS+DY  SNS+L+LL+PLA               +PA+  +VPV+P+G LL  A PL++RPPYVL+W
Subjt:  IGHSMGGILLYAMISCCSSKKVDPQLASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAW

Query:  LKGQISADDMLHPTLLEKLVMNGFGSVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKP
        L   IS+ DM+HP +LEKLV+N F ++PAK+L+QL++ F EGGLRDRSG F YKD+L + +VP+LALAGD+DLICPP AV +TVK  P  LV+YK+LG+P
Subjt:  LKGQISADDMLHPTLLEKLVMNGFGSVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKP

Query:  GGPHYAHYDIVGSRLASSEVYPLITDFLNHHDA
         GPHYAHYD+VG RLA  +VYP IT+FL+HHD+
Subjt:  GGPHYAHYDIVGSRLASSEVYPLITDFLNHHDA

AT1G73750.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase1.9e-12450.45Show/hide
Query:  ICTADELHYVSVPNSDWKLALWRYLPSLRAPSRNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQ
        ICTADELHYV VPNSDW++ALWRYLPS +AP RNHPLLLLSGIG+NA+ YDLSPE SFAR MS  G+DTWILE+RG GLS+                   
Subjt:  ICTADELHYVSVPNSDWKLALWRYLPSLRAPSRNHPLLLLSGIGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQ

Query:  PLVKASTYDSSVRSSISSRDGQ---TSNIAAQLRQWNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQ
                  SV +++   + Q    SN+       ++ L N++DG  ++   Q      ++    + +++ ++   Y+WDFDNYLEED+P+AM+Y+R Q
Subjt:  PLVKASTYDSSVRSSISSRDGQ---TSNIAAQLRQWNKNLINIIDGAQQLSPFQPFNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQ

Query:  SKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQLASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPL
        +K  DGKLLA+GHSMGGILLYA++S C  K +D  LA V TLAS+ DY  S + L+ LLP+               K+PAQ  N+P++PI  +L +AHPL
Subjt:  SKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQLASVVTLASSLDYRPSNSSLRLLLPLASDRISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPL

Query:  ASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFGSVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQ
          RPPY L+WL   ISA  M+ P ++EKLV+N   +VP K+L+QL++  + GGLRDR+GTF YKD++ + NVPILALAGD D+ICPP+AVY+TVK IP  
Subjt:  ASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFGSVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPILALAGDQDLICPPEAVYETVKEIPRQ

Query:  LVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITDFLNHHD
        L +YKV+G PGGPHY H D++  R A +EVYPLIT FL   D
Subjt:  LVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITDFLNHHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCCTTTCGCTGTCTTATTTCGATCTCTCTTCGTTCAGCCGGAGCGAAAACCGCCGTCGTCAACTCCACCGATTCTGGGGACAACAGAAATCAGAACCCTCGTG
GGCCTTACGCCGACGGAATGTGGTCGGCGTGAAGTCCTTCAGGGCATTTTACGGCGGGGCGTCTGGATTGAATGTCAATAAGGAGAAGGGTTTGATCTGTACTGCCGACG
AGCTTCATTACGTCTCTGTTCCTAACTCTGATTGGAAACTTGCGCTCTGGCGTTACTTGCCTTCTCTTCGGGCGCCATCAAGGAATCATCCGCTGTTGCTGTTATCAGGG
ATTGGAAGCAATGCTCTTGGATATGACCTTTCTCCAGAGTCCTCATTTGCTCGCTACATGTCCAACCATGGATACGACACATGGATTCTTGAAGTTCGAGGATTGGGACT
TAGCACCGACAGAGGAGAAATGAAGTATACTGAACAGATACGATCTGAAACTCTGTCAAAACAGCCATTAGTGAAGGCTAGTACATATGACAGTTCTGTGCGTTCCAGTA
TTTCTTCCAGAGATGGACAGACTTCAAACATTGCTGCTCAACTTAGGCAATGGAATAAAAATCTTATCAATATAATTGACGGAGCTCAACAACTGAGTCCATTCCAGCCT
TTTAATTTACAAGGTGTTACCTCTGCTTTAGAAGAGATCCAGGAACAACTTGATGTGTATGAGAAGTATGATTGGGACTTTGACAACTACTTGGAAGAAGACATGCCTGC
TGCGATGGAGTACATAAGGAACCAATCCAAACCAAATGATGGCAAGTTACTAGCGATCGGCCACTCGATGGGGGGTATCCTGCTGTATGCTATGATCTCTTGCTGTAGCT
CTAAAAAAGTTGATCCGCAGTTGGCATCAGTTGTTACTCTGGCTTCATCACTTGATTACAGACCTTCAAATTCGTCACTCAGACTTCTTTTACCTTTGGCAAGTGATCGT
ATAAGCTATTGCAAATCCAACTTTAATGTACAGAAAGATCCTGCACAAAATTTTAATGTTCCAGTGATTCCCATTGGGCCATTGCTTGTTATTGCTCATCCTCTCGCATC
ACGTCCTCCTTATGTCTTGGCTTGGTTAAAGGGTCAAATCTCTGCAGACGACATGTTACATCCCACATTGCTTGAGAAGCTTGTGATGAATGGCTTTGGATCTGTGCCTG
CAAAGGTTCTTGTGCAGCTATCGTCTGTTTTTGAAGAGGGTGGCTTACGTGACAGGAGTGGTACATTCAAATACAAGGATTATCTACGCCAAGGCAATGTTCCAATCCTT
GCTCTTGCTGGAGACCAAGACCTTATTTGTCCACCTGAAGCTGTATATGAAACTGTGAAAGAAATTCCTAGGCAGTTGGTTTCCTACAAAGTTCTCGGCAAGCCTGGTGG
TCCTCACTATGCTCACTATGATATTGTGGGAAGTCGTTTGGCATCAAGTGAAGTATATCCATTGATAACCGATTTTCTCAACCACCATGACGCGATTTGA
mRNA sequenceShow/hide mRNA sequence
CGCCATTTTCAAACCATGTCATTTTCTTGCTTTCTAATTACCCACAAGCAATAACTTCTCCGCCATTCTATACTTCAAGCTCCGATCTTCTCCAACGCTCAGGAAATTAT
GGCGACCCTTTCGCTGTCTTATTTCGATCTCTCTTCGTTCAGCCGGAGCGAAAACCGCCGTCGTCAACTCCACCGATTCTGGGGACAACAGAAATCAGAACCCTCGTGGG
CCTTACGCCGACGGAATGTGGTCGGCGTGAAGTCCTTCAGGGCATTTTACGGCGGGGCGTCTGGATTGAATGTCAATAAGGAGAAGGGTTTGATCTGTACTGCCGACGAG
CTTCATTACGTCTCTGTTCCTAACTCTGATTGGAAACTTGCGCTCTGGCGTTACTTGCCTTCTCTTCGGGCGCCATCAAGGAATCATCCGCTGTTGCTGTTATCAGGGAT
TGGAAGCAATGCTCTTGGATATGACCTTTCTCCAGAGTCCTCATTTGCTCGCTACATGTCCAACCATGGATACGACACATGGATTCTTGAAGTTCGAGGATTGGGACTTA
GCACCGACAGAGGAGAAATGAAGTATACTGAACAGATACGATCTGAAACTCTGTCAAAACAGCCATTAGTGAAGGCTAGTACATATGACAGTTCTGTGCGTTCCAGTATT
TCTTCCAGAGATGGACAGACTTCAAACATTGCTGCTCAACTTAGGCAATGGAATAAAAATCTTATCAATATAATTGACGGAGCTCAACAACTGAGTCCATTCCAGCCTTT
TAATTTACAAGGTGTTACCTCTGCTTTAGAAGAGATCCAGGAACAACTTGATGTGTATGAGAAGTATGATTGGGACTTTGACAACTACTTGGAAGAAGACATGCCTGCTG
CGATGGAGTACATAAGGAACCAATCCAAACCAAATGATGGCAAGTTACTAGCGATCGGCCACTCGATGGGGGGTATCCTGCTGTATGCTATGATCTCTTGCTGTAGCTCT
AAAAAAGTTGATCCGCAGTTGGCATCAGTTGTTACTCTGGCTTCATCACTTGATTACAGACCTTCAAATTCGTCACTCAGACTTCTTTTACCTTTGGCAAGTGATCGTAT
AAGCTATTGCAAATCCAACTTTAATGTACAGAAAGATCCTGCACAAAATTTTAATGTTCCAGTGATTCCCATTGGGCCATTGCTTGTTATTGCTCATCCTCTCGCATCAC
GTCCTCCTTATGTCTTGGCTTGGTTAAAGGGTCAAATCTCTGCAGACGACATGTTACATCCCACATTGCTTGAGAAGCTTGTGATGAATGGCTTTGGATCTGTGCCTGCA
AAGGTTCTTGTGCAGCTATCGTCTGTTTTTGAAGAGGGTGGCTTACGTGACAGGAGTGGTACATTCAAATACAAGGATTATCTACGCCAAGGCAATGTTCCAATCCTTGC
TCTTGCTGGAGACCAAGACCTTATTTGTCCACCTGAAGCTGTATATGAAACTGTGAAAGAAATTCCTAGGCAGTTGGTTTCCTACAAAGTTCTCGGCAAGCCTGGTGGTC
CTCACTATGCTCACTATGATATTGTGGGAAGTCGTTTGGCATCAAGTGAAGTATATCCATTGATAACCGATTTTCTCAACCACCATGACGCGATTTGATTTTACCATACC
ATTCAACTCCTCTAGGTATATACATCCCACCAGTCGAAATAAGGAATAGTAGTATATCTTCCACGATAGAGTATACACAGATTTCTCGCTGTATGCACAATTGTAAAGCA
AACATACAATCCTGTGAAGCATACTTCTTGAAGTGAATTATTGGCATAGAAAGTTCTAGGTTATGTAATTTGGAATGGAATCTTTATACCAACTTCTGGAAATATTTCTC
CAGTAAAACTGAAGTCTCCCCATTTAGTTTATTCTTCACTCTGTGGAATCACTGGATAACAACTCAATTCAATAAATCGCAAGTCCATGTCATCCAAGATCCCAAGAGAT
TCAATTACAAAAAGGGTGCAATCATAAAATGTTTTTGCAAATTATAATTTGATTTCAACCACTAACTCAATCACGAGAGGCAGAGCTGTGCCATAATGAAAGCAGATTAC
AACATTTTACATGCTCATAAGTTGGATAAGAGTTTTCTTTCTTTTTTTTCCCTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MATLSLSYFDLSSFSRSENRRRQLHRFWGQQKSEPSWALRRRNVVGVKSFRAFYGGASGLNVNKEKGLICTADELHYVSVPNSDWKLALWRYLPSLRAPSRNHPLLLLSG
IGSNALGYDLSPESSFARYMSNHGYDTWILEVRGLGLSTDRGEMKYTEQIRSETLSKQPLVKASTYDSSVRSSISSRDGQTSNIAAQLRQWNKNLINIIDGAQQLSPFQP
FNLQGVTSALEEIQEQLDVYEKYDWDFDNYLEEDMPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISCCSSKKVDPQLASVVTLASSLDYRPSNSSLRLLLPLASDR
ISYCKSNFNVQKDPAQNFNVPVIPIGPLLVIAHPLASRPPYVLAWLKGQISADDMLHPTLLEKLVMNGFGSVPAKVLVQLSSVFEEGGLRDRSGTFKYKDYLRQGNVPIL
ALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITDFLNHHDAI