; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1187 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1187
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUncharacterised conserved protein UCP031088, alpha/beta hydrolase
Genome locationMC03:17957176..17963806
RNA-Seq ExpressionMC03g1187
SyntenyMC03g1187
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR016969 - Uncharacterised conserved protein UCP031088, alpha/beta hydrolase, At1g15070
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137193.1 uncharacterized protein LOC111008720 isoform X1 [Momordica charantia]0.098.77Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_022137194.1 uncharacterized protein LOC111008720 isoform X2 [Momordica charantia]0.090.59Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLL                                        ESVPSKVLVQLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_022994272.1 uncharacterized protein LOC111490055 isoform X2 [Cucurbita maxima]6.97e-27178.3Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI RSD    +L+  R++ K +  W L RRN++AV+   AF GG      NKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSS+SSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASS+VYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_023541739.1 uncharacterized protein LOC111801808 [Cucurbita pepo subsp. pepo]1.06e-27278.7Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LS  DL SI RSD    YL+  R++ K +  W L RRN++AV+   AF GG      NKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T +K+P A+ G Y SS GSSISSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHLCQ+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASSEVYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_038894452.1 uncharacterized protein LOC120083031 [Benincasa hispida]6.20e-27378.46Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAV---RAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI +S     +L+ +RQ  K    W LRRRNVIAV   RAF GGA GL  NKEK  ICTADELHYVSVPNSDW+LALWRY PS RAPS
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAV---RAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQ
        RNHPLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRG GLSTDR +MK+ EQIRS TL K+P  +   Y SS GS ISS+DG+TS IATQLRQ
Subjt:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQ

Query:  WNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQ
        WN+NL+++I+GAQQLGP +PFNLQGVTSALE FQEQL +YEKYDWDFD+YLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSF KVDPQ
Subjt:  WNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQ

Query:  LASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFE
        LASVVTL SSLD+RPSNSSLRLLLPL   + + NVP  PIGPLL IAHPLASRPPY+LSWLKGQISA DMLHPTLLEKLV++G+ SVP+KVL+QLS+VFE
Subjt:  LASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFE

Query:  EGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EGGL DR+G F+YKD+L Q N+P+LALAGDQD+ICPPEAVYETVK+IP + VSYKVLGK GGPHYAHYDIVGS LASSEVYPLIT+FLNRHD
Subjt:  EGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

TrEMBL top hitse value%identityAlignment
A0A6J1C6J5 uncharacterized protein LOC111008720 isoform X20.090.59Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLL                                        ESVPSKVLVQLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1C9M1 uncharacterized protein LOC111008720 isoform X10.098.77Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1GS92 uncharacterized protein LOC111457026 isoform X28.91e-26777.28Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LS   L S+ RSD    +L+  R++ K +  W L RRN++AV+   AF GG      +KEK  ICTADELHYVSVPNSDW+LALWRY P  +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSSISSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA+IS CSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASSEVYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1JVB9 uncharacterized protein LOC111490055 isoform X23.37e-27178.3Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI RSD    +L+  R++ K +  W L RRN++AV+   AF GG      NKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSS+SSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASS+VYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1K0R1 uncharacterized protein LOC111490055 isoform X14.88e-27178.3Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI RSD    +L+  R++ K +  W L RRN++AV+   AF GG      NKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSS+SSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASS+VYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15060.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase4.3e-13748.24Show/hide
Query:  RRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVR
        R  ++  RAF   +  L     K S+CTADELHYVSVPN+DWRLALWRY P P+AP+RNHPLLLLSGVG+NA+GYDLSPG SFAR+MS QG++TWILEVR
Subjt:  RRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVR

Query:  GSGLST---DRVEMKENEQIRSGTLEKRPSAETGKY----------------------VSSVG-------------------------------------
        G+GLST   D  +++E+    S  +E    A  GK                       VS VG                                     
Subjt:  GSGLST---DRVEMKENEQIRSGTLEKRPSAETGKY----------------------VSSVG-------------------------------------

Query:  ---------------------SSISSK------DGETSTIATQLRQWNQNLMDIIEGAQQLGPLRPFNLQ-GVTSALEGFQEQLDMYEKYDWDFDHYLEE
                             + I SK        + S +  Q+R   Q L+++ +  Q+       +LQ  +T+ +E FQ+QLD+  KYDWDFDHYLEE
Subjt:  ---------------------SSISSK------DGETSTIATQLRQWNQNLMDIIEGAQQLGPLRPFNLQ-GVTSALEGFQEQLDMYEKYDWDFDHYLEE

Query:  DLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASR
        D+PAA+EY+R QSKP DGKL AIGHSMGGILLYA +SRC+F   +P +A+V TL SS+D+  SNS+L+LL+PLA  +++ +VP  P+G LL  A PL++R
Subjt:  DLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASR

Query:  PPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVS
        PPY+LSWL   IS+ DM+HP +LEKLVL+ + ++P+K+L+QL+T F EGGLRDR+G F YKDHL ++++P+LALAGD+D+ICPP AV +TVK  P   V+
Subjt:  PPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVS

Query:  YKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        YK+LG+P GPHYAHYD+VG RLA  +VYP IT FL+ HD
Subjt:  YKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

AT1G73750.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase3.0e-12250.23Show/hide
Query:  SSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLE
        + ICTADELHYV VPNSDWR+ALWRY PSP+AP RNHPLLLLSG+G+NA+ YDLSP  SFAR MS  G+DTWILE+RG+GLS+                 
Subjt:  SSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLE

Query:  KRPSAETGKYVSSVGSSISSKDGETSTIATQLRQW---NQNLMDIIEGAQQLGPLRPFNLQGVTSALEG-FQEQLDMYEKYDWDFDHYLEEDLPAAMEYI
                    SV +++   + +   ++  L  +   ++ L ++++G  ++       +Q   S   G F+++ ++   Y+WDFD+YLEED+P+AM+Y+
Subjt:  KRPSAETGKYVSSVGSSISSKDGETSTIATQLRQW---NQNLMDIIEGAQQLGPLRPFNLQGVTSALEG-FQEQLDMYEKYDWDFDHYLEEDLPAAMEYI

Query:  RNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLK
        R Q+K  DGKLLA+GHSMGGILLYA +SRC F  +D  LA V TL S+ D+  S + L+ LLP+   + + N+P  PI  +L +AHPL  RPPY LSWL 
Subjt:  RNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLK

Query:  GQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGG
          ISA  M+ P ++EKLVL+   +VP K+L+QL+T  + GGLRDR G F YKDH+ ++N+PILALAGD DIICPP+AVY+TVK IP    +YKV+G PGG
Subjt:  GQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGG

Query:  PHYAHYDIVGSRLASSEVYPLITNFLNRHD
        PHY H D++  R A +EVYPLIT FL + D
Subjt:  PHYAHYDIVGSRLASSEVYPLITNFLNRHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTCTTCCATTGTCTCGCTCGGATCTCTGTTCGATCAGCCGGAGCGACGGCCTCTGCTGTTACCTGAACCCAGTCCGGCAACGACGGAAACCAGATGGCTGTTG
GGTCTTACGCCGACGAAATGTGATCGCGGTCAGGGCGTTTGACGGTGGGGCATTCGGTTTGTACGGCAACAAGGAGAAGAGTTCAATCTGTACTGCCGACGAGCTTCATT
ACGTCTCTGTTCCCAACTCCGATTGGAGGCTCGCTCTGTGGCGTTACCCCCCTTCTCCTCGGGCGCCGTCAAGGAATCATCCGCTTTTGCTGTTATCAGGGGTTGGGAGC
AATGCTCTTGGCTATGACCTTTCTCCAGGGTCCTCATTTGCTCGCTACATGTCCAACCAAGGATATGATACTTGGATTCTTGAAGTTCGAGGATCTGGCCTTAGTACTGA
CAGAGTAGAAATGAAGGAAAATGAGCAGATACGCTCTGGAACTCTGGAGAAACGTCCATCAGCCGAGACTGGCAAATATGTTAGTTCTGTGGGTTCCAGCATTTCTTCAA
AAGATGGAGAGACTTCTACAATTGCTACTCAACTTAGGCAATGGAATCAAAATCTTATGGATATAATCGAAGGAGCTCAACAACTGGGTCCATTGCGGCCTTTTAACTTA
CAAGGTGTTACCTCTGCATTAGAAGGCTTCCAGGAACAACTTGATATGTATGAGAAATATGATTGGGACTTCGACCACTACTTGGAAGAAGATTTGCCCGCTGCGATGGA
GTACATAAGGAACCAATCTAAACCCAACGATGGCAAGTTACTAGCAATTGGCCACTCGATGGGGGGTATCTTGCTATATGCAAAGATCTCACGATGTAGCTTTAACAAAG
TTGACCCGCAGTTGGCATCAGTTGTGACTTTGGGTTCATCACTTGACTTCAGACCTTCAAATTCATCACTCAGATTGCTATTACCTTTGGCAAGTGATTCTTTTAATGTT
CCTGCGTTTCCCATTGGGCCATTGCTTGGTATTGCTCATCCCCTCGCATCGCGTCCTCCTTATATCTTGTCCTGGTTAAAGGGTCAAATTTCTGCAGGAGACATGCTACA
TCCTACCTTGCTTGAGAAGCTTGTTCTTGATGGCTATGAATCTGTGCCTTCAAAGGTTCTCGTGCAGCTATCAACTGTTTTTGAAGAAGGGGGCTTGCGCGACAGGAACG
GGATGTTCCAGTACAAGGACCATCTATGCCAAAGCAACATTCCAATCCTTGCACTTGCTGGAGACCAAGACATCATTTGTCCACCTGAAGCTGTTTATGAAACTGTGAAG
CAAATTCCCTGGCGGTATGTTTCCTACAAAGTTCTTGGGAAGCCTGGTGGTCCTCACTATGCTCACTATGATATCGTGGGAAGTCGTTTGGCATCAAGCGAAGTATATCC
ATTGATAACCAATTTTCTCAACCGTCATGACTAG
mRNA sequenceShow/hide mRNA sequence
CTCGCCGCGAATTTATGGCCTATTCCCAATGCGTGACGCAACCTCCGCCTTCACCAAGTTTTGTTTCTTTCCTCTGATTCTTCTTTTTTCCTTAAAATCTTGTACGAATC
CTGGTTTCCCACATTATAATCGAGCCTACTTTTCAGTTTTCTTTCCTACCAATTACCCACAAGCAATAACCTCTCCACGCTATATAACTTCAACCCAAGATCTTCTTCCA
ACCCAGAGATTATGGCCACTCTTCCATTGTCTCGCTCGGATCTCTGTTCGATCAGCCGGAGCGACGGCCTCTGCTGTTACCTGAACCCAGTCCGGCAACGACGGAAACCA
GATGGCTGTTGGGTCTTACGCCGACGAAATGTGATCGCGGTCAGGGCGTTTGACGGTGGGGCATTCGGTTTGTACGGCAACAAGGAGAAGAGTTCAATCTGTACTGCCGA
CGAGCTTCATTACGTCTCTGTTCCCAACTCCGATTGGAGGCTCGCTCTGTGGCGTTACCCCCCTTCTCCTCGGGCGCCGTCAAGGAATCATCCGCTTTTGCTGTTATCAG
GGGTTGGGAGCAATGCTCTTGGCTATGACCTTTCTCCAGGGTCCTCATTTGCTCGCTACATGTCCAACCAAGGATATGATACTTGGATTCTTGAAGTTCGAGGATCTGGC
CTTAGTACTGACAGAGTAGAAATGAAGGAAAATGAGCAGATACGCTCTGGAACTCTGGAGAAACGTCCATCAGCCGAGACTGGCAAATATGTTAGTTCTGTGGGTTCCAG
CATTTCTTCAAAAGATGGAGAGACTTCTACAATTGCTACTCAACTTAGGCAATGGAATCAAAATCTTATGGATATAATCGAAGGAGCTCAACAACTGGGTCCATTGCGGC
CTTTTAACTTACAAGGTGTTACCTCTGCATTAGAAGGCTTCCAGGAACAACTTGATATGTATGAGAAATATGATTGGGACTTCGACCACTACTTGGAAGAAGATTTGCCC
GCTGCGATGGAGTACATAAGGAACCAATCTAAACCCAACGATGGCAAGTTACTAGCAATTGGCCACTCGATGGGGGGTATCTTGCTATATGCAAAGATCTCACGATGTAG
CTTTAACAAAGTTGACCCGCAGTTGGCATCAGTTGTGACTTTGGGTTCATCACTTGACTTCAGACCTTCAAATTCATCACTCAGATTGCTATTACCTTTGGCAAGTGATT
CTTTTAATGTTCCTGCGTTTCCCATTGGGCCATTGCTTGGTATTGCTCATCCCCTCGCATCGCGTCCTCCTTATATCTTGTCCTGGTTAAAGGGTCAAATTTCTGCAGGA
GACATGCTACATCCTACCTTGCTTGAGAAGCTTGTTCTTGATGGCTATGAATCTGTGCCTTCAAAGGTTCTCGTGCAGCTATCAACTGTTTTTGAAGAAGGGGGCTTGCG
CGACAGGAACGGGATGTTCCAGTACAAGGACCATCTATGCCAAAGCAACATTCCAATCCTTGCACTTGCTGGAGACCAAGACATCATTTGTCCACCTGAAGCTGTTTATG
AAACTGTGAAGCAAATTCCCTGGCGGTATGTTTCCTACAAAGTTCTTGGGAAGCCTGGTGGTCCTCACTATGCTCACTATGATATCGTGGGAAGTCGTTTGGCATCAAGC
GAAGTATATCCATTGATAACCAATTTTCTCAACCGTCATGACTAGGGTAGATTTCACCATACCATCCAACTCTAGATATATACATTTCATCAGTCAAAAGAACAGTTGTA
TATCTTCCTTATTGATAGAATATCCACGAAGATCTATTTCCTAAGATCTGCTGTATACAAAATTGTAAAGCAAACACACAATCCTGTGGAGGGTATATTACTGGCTTCGA
AAATTGTGGGCTGTGTAATTTTCCTCAGATAATAATAGAAATTGTCCAAGCAATAATGTTTTTATCAACTTTTCAAAATATTGCTTCAGTAAAACTGAAGTCTTATATTT
CTCCATTTGTTTAATTCGTAAATACAACTTGATTCAATAAAAGGCAGGGAGGTCAGCTAAAAGACCCAAGAGATTCGATTTCAAGCCATAGGAACGATGTAATCGTAAAA
TGATTTTGCAAATCAGCATAAGATTTCGACTATTGACTCAATCACAAAAGGCAGAGCCACACCATAATGTAAACAGATTACATCAATTTACATCTTGAAAAGCTTGGTAA
GAACCTGGTTTTTTTAGTTTTGCCCGACAAAATCTCGTCGGAAGAACATAAGAAAGGCATCGAACCTTGAGGTCTTAAAGCTATGCAATCAAGTCAACCCAAAAGCACCA
TTACTTAAAATCTCTGCAGAACTGAGGAATAAGAAAATGGCGCCCCATCATATCTCTGATAAGTAGAAAGGGAAGCTTCACACAAATGAATGAATGCTTTGCTCATGTAA
AAAGGCAATGAAATCTATACACCACATAGGGGAGAAAACAAGCAGGCAGAGTCGAAGAGGTGCTAGACGTGGTGGGGCAAACCGTTCAGCCTCGGCCCACCACTGCTGTC
TTACAGACCGGGCACATATTCTTCTGTGCAAGCCACTGTTTTATACAGTGTATGTGATAGCTGTGCCCACATTCCAGCTTACCCATTTCATCGTCTACTTCATAGTCCTC
CTGCTTGTACAAAATGATTTTAGGTGTCAAGAATGAGGTCGGAAAGAACAATTACATATATGAGTTCTAATCAAAACCACTAAGCTGGACTCGAGTAACTACAACTTTAC
CTGGCAGATACTGCACTTCCGATCTACTTGAGATAATAAATGGGATGCTAGCTCATTTACGATAAAAGGCTTCATCTTCCTAATACATTTGCCGATCTCGTCTTCTTTCA
GCCCGGTACTCACATGACCAATCCTCTCACCGAGTTCAAGCAGCTCCTGTTTCCATCTCGAGATAACAAAGATGGGTTAACATCAGATTGTTGCAGGAAAGAAGAGGTAG
AAGGAATTGGTGAAAGTTATGAACCTCATATGACATGTTATCAACATCGAGCCGCAATTCCCTAAATTGATCGTGAGTATCAAACCTTCCACCCATCAGCAAACTGCTCT
GGAACATCATAATCTGCTATAAATAATATATCAAGTGAGCATACCCGTATTATATGATCATTATGAAACAAGGGAGGGAGGGAGTGACAATACAGAAACATGATAAAATT
TACACTATATGATTCAAAATATGGTGTATATTGCAACATTAATCTATGCTTTCTTCCATGATTCCTATCATTAACTATAAACTTGCTGTTGGAATCAATGGATGAGATCA
AAGACATTCCAAATAGGAAACGTAATCTAGCAAAACAGATTAGCTTGTAAGCAAGTTCAAATTGTTTCTTTGAGCCGATGCCATCTGCCAAACCATCTCATAAAAGCAGA
ATATCTAATTTTTCAAATATCGGAACCTTACCAAGACATCCAAATGTGATCTGCCAAACCATTTCAAGCAAATAAAGAACAGAAAAATAAAACACTGAAGCTTATATAGT
TATATTTATGGCATTGGGTACAAATATGAATGATGGCTCACCTCAGCCAGCCCGTCCGGGGATGGATGCCGAACATGGCGATAATACCGACTCCGAGACACTTCCAATGA
CCGTGCTGTTGGAAGATCAGAATCAGAATCCAAAAACAAGAGGGTCTCGGGGTTCACTGTTCGCCTTCCTAAGCAAGAACGCTGGCGATGGTAGAAGCAGAGGAAGTTAA
AACTATAACACATTAAACAAGAACAAGAGTTTCTCATTTTTATAGCCCCAAAGGAGGACTTCATTCGCAGGACCTTCAAATTTAATACATACCCATTTGCATTAAATATT
CAAATTTACAAAGAAGAATGAAAAGGGAGCAAGTTCCAAGAATGGAAGTTGAACAGGTTCAAACTGATTTAGAACCATATTGCAGGCAAGAAAACAAACCAAGCCGAGGA
AGTTCAGTAAGAGCCCATATCTGAAGAACTAGTAATAAGATGACTAAAGCAGAACAAAATAAACATCTCAAGTTTCTTTACCAACGAATGCTTAACCCACAACCATAGAA
GAGAGATGTTCAAGAATGTTAAGAAGAAATAGAAAAGCGAGTTACCTCCCTCTGGCTCATCTTCTCCACATCAATTTTTCCCCTCCCAGAAGCATGCCTTCTCGCGACAA
CGCAATCCACAGAAGCAGCAGCATCAGGAGAGAATCCAATTCCAGGGCCGCACCAAACATCTTGAGCGTCTAAACAGCTAGCAGAATTGAAATTCGAATTTGGGTGAAAA
TAAGTTCCATCACCAATTCCTTGCTGGGTTTTGTTCTTGGAGCTCTTCTGCTTTTTTTTCCTCGTCTTCTTCTTCTGCCAATCCGCGGAAGTTCGAATTACAGCCGGGAC
CGATACCTGCTGAGAAGCGGAGGCAGTGCATCCGAGTCCCCTGAAAGTTGCAGAAGAGAAGTTGCTCTTCTTCCTATTCGTGACACTTGAAGAGGGTAAAGATTCAGTGG
CGCCGCTATTGGAAAAGGTCGAGAGAAGGAGAGAGGAAATGGTGGATTTGCAGCGAGTGGATTGAATTATCGAAGGGATTGATGGATTTGGATCTGATTCTGAAATGGGT
TGGCTCAAATGGCCTCTGGGTCTTCTCAATTTGATGTGTTCACCCACTGTAGTAGTCTGTGTAACAACAGGCATGACTCTTAAACAAGAATTAGAATGTAACAAAGAGAA
GACGAGGTGGAAAAGGGAAGTTAAGAGAAGAAATGGAAATGAATGAAAGAACAAATACAACTGCAATTTAAGCCGTTTGCAAGAGAAAGGGGAAATGGATTATGGAATGA
AGTGGGACTGAATTTGAGTTGAAAGAGAGAAAAGCATAAAACTTTAATAAACTTTAATTTAATCAAAGCTTTTTGTTCCCTTCCCAGATTTGAATGCAGAGGTATGAAGA
AGGGGAAAGAAAAAAAGAGTGATTATGAAGCGAATTTCACTCTCTCATTTCTTTTCCTTTTTTTTTTCTCCTCCTTTCCTGTGCTGTTTGTTAGGTGGGCTGTGTTTGTC
TTCTTGTTGGAGTTTGTGGGTATGTGCCATCATAGAGAGCAGAGTCTCACAGACTCAGGAATCTGTGAACAACCTTTTTTGC
Protein sequenceShow/hide protein sequence
MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGS
NALGYDLSPGSSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQNLMDIIEGAQQLGPLRPFNL
QGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPLASDSFNV
PAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLVQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVK
QIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD