; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021663 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021663
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUncharacterised conserved protein UCP031088, alpha/beta hydrolase
Genome locationscaffold348:159715..162665
RNA-Seq ExpressionMS021663
SyntenyMS021663
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR016969 - Uncharacterised conserved protein UCP031088, alpha/beta hydrolase, At1g15070
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137193.1 uncharacterized protein LOC111008720 isoform X1 [Momordica charantia]4.5e-28298.36Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVL+QLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_022137194.1 uncharacterized protein LOC111008720 isoform X2 [Momordica charantia]1.9e-25190.18Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLL                                        ESVPSKVL+QLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_022994270.1 uncharacterized protein LOC111490055 isoform X1 [Cucurbita maxima]1.1e-21678.5Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI RSD    +L+  R++ K +  W L RRN++AV+   AF G      GNKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSS+SSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASS+VYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_023541739.1 uncharacterized protein LOC111801808 [Cucurbita pepo subsp. pepo]7.9e-21878.7Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LS  DL SI RSD    YL+  R++ K +  W L RRN++AV+   AF G      GNKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T +K+P A+ G Y SS GSSISSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHLCQ+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASSEVYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

XP_038894452.1 uncharacterized protein LOC120083031 [Benincasa hispida]1.2e-21878.86Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIA---VRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI +S     +L+ +RQ  K    W LRRRNVIA   VRAF GGA GL  NKEK  ICTADELHYVSVPNSDW+LALWRY PS RAPS
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIA---VRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQ
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRG GLSTDR +MK+ EQIRS TL K+P  +   Y SS GS ISS+DG+TS IATQLRQ
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQ

Query:  WNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQ
        WN+NL+++I+GAQQLGP +PFNLQGVTSALE FQEQL +YEKYDWDFD+YLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSF KVDPQ
Subjt:  WNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQ

Query:  LASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFE
        LASVVTL SSLD+RPSNSSLRLLLPL   + + NVP  PIGPLL IAHPLASRPPY+LSWLKGQISA DMLHPTLLEKLV++G+ SVP+KVLMQLS+VFE
Subjt:  LASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFE

Query:  EGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EGGL DR+G F+YKD+L Q N+P+LALAGDQD+ICPPEAVYETVK+IP + VSYKVLGK GGPHYAHYDIVGS LASSEVYPLIT+FLNRHD
Subjt:  EGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

TrEMBL top hitse value%identityAlignment
A0A6J1C6J5 uncharacterized protein LOC111008720 isoform X29.0e-25290.18Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLL                                        ESVPSKVL+QLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1C9M1 uncharacterized protein LOC111008720 isoform X12.2e-28298.36Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
        MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNH

Query:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
        PLLLLSGVGSNALGYDLSP SSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ
Subjt:  PLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQ

Query:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
        NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS
Subjt:  NLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLAS

Query:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG
        VVTLGSSLDFRPSNSSLRLLLPL     +FNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVL+QLSTVFEEGG
Subjt:  VVTLGSSLDFRPSNSSLRLLLPLAS--DSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGG

Query:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
Subjt:  LRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1GS92 uncharacterized protein LOC111457026 isoform X22.2e-21377.28Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LS   L S+ RSD    +L+  R++ K +  W L RRN++AV+   AF G      G+KEK  ICTADELHYVSVPNSDW+LALWRY P  +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSSISSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENE-QIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA+IS CSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASSEVYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1JVB9 uncharacterized protein LOC111490055 isoform X25.5e-21778.5Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI RSD    +L+  R++ K +  W L RRN++AV+   AF G      GNKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSS+SSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASS+VYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

A0A6J1K0R1 uncharacterized protein LOC111490055 isoform X15.5e-21778.5Show/hide
Query:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS
        MATL LSR DL SI RSD    +L+  R++ K +  W L RRN++AV+   AF G      GNKEK SICTADELHYVSVPNSDW+LALWRY PS +A S
Subjt:  MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVR---AFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLST RVE K+   QIRS T EK+P A+ G Y SS GSS+SSK G+ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKEN-EQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLR

Query:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP
         WN+NL++IIEGAQQLGPL+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSFNKVDP
Subjt:  QWNQNLMDIIEGAQQLGPLRPFNLQGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDP

Query:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF
        QLASVVTL SSLD+RPSNSSLRLLLPL   + +FNVP FPIGPLL IAHPLASRPPY+L WLK QIS  DML PTLLEKLVL+G+ESVP+KVL+QLS+VF
Subjt:  QLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVF

Query:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        EEGGLRDRNG FQY DHL Q+N+PILA+AGDQD ICPPEAVYETVK IP  +VSY+VLGKPGGPHY+HYD+VGSRLASS+VYPLIT+FLNRHD
Subjt:  EEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15060.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase6.2e-13648.05Show/hide
Query:  RRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVR
        R  ++  RAF   +  L     K S+CTADELHYVSVPN+DWRLALWRY P P+AP+RNHPLLLLSGVG+NA+GYDLSP  SFAR+MS QG++TWILEVR
Subjt:  RRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVR

Query:  GSGLST---DRVEMKENEQIRSGTLEKRPSAETGKY----------------------VSSVG-------------------------------------
        G+GLST   D  +++E+    S  +E    A  GK                       VS VG                                     
Subjt:  GSGLST---DRVEMKENEQIRSGTLEKRPSAETGKY----------------------VSSVG-------------------------------------

Query:  ---------------------SSISSK------DGETSTIATQLRQWNQNLMDIIEGAQQLGPLRPFNLQ-GVTSALEGFQEQLDMYEKYDWDFDHYLEE
                             + I SK        + S +  Q+R   Q L+++ +  Q+       +LQ  +T+ +E FQ+QLD+  KYDWDFDHYLEE
Subjt:  ---------------------SSISSK------DGETSTIATQLRQWNQNLMDIIEGAQQLGPLRPFNLQ-GVTSALEGFQEQLDMYEKYDWDFDHYLEE

Query:  DLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASR
        D+PAA+EY+R QSKP DGKL AIGHSMGGILLYA +SRC+F   +P +A+V TL SS+D+  SNS+L+LL+PLA  +++ +VP  P+G LL  A PL++R
Subjt:  DLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPLA--SDSFNVPAFPIGPLLGIAHPLASR

Query:  PPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVS
        PPY+LSWL   IS+ DM+HP +LEKLVL+ + ++P+K+L+QL+T F EGGLRDR+G F YKDHL ++++P+LALAGD+D+ICPP AV +TVK  P   V+
Subjt:  PPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVS

Query:  YKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD
        YK+LG+P GPHYAHYD+VG RLA  +VYP IT FL+ HD
Subjt:  YKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD

AT1G73750.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase2.7e-12350.47Show/hide
Query:  SSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLE
        + ICTADELHYV VPNSDWR+ALWRY PSP+AP RNHPLLLLSG+G+NA+ YDLSPE SFAR MS  G+DTWILE+RG+GLS+                 
Subjt:  SSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLE

Query:  KRPSAETGKYVSSVGSSISSKDGETSTIATQLRQW---NQNLMDIIEGAQQLGPLRPFNLQGVTSALEG-FQEQLDMYEKYDWDFDHYLEEDLPAAMEYI
                    SV +++   + +   ++  L  +   ++ L ++++G  ++       +Q   S   G F+++ ++   Y+WDFD+YLEED+P+AM+Y+
Subjt:  KRPSAETGKYVSSVGSSISSKDGETSTIATQLRQW---NQNLMDIIEGAQQLGPLRPFNLQGVTSALEG-FQEQLDMYEKYDWDFDHYLEEDLPAAMEYI

Query:  RNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLK
        R Q+K  DGKLLA+GHSMGGILLYA +SRC F  +D  LA V TL S+ D+  S + L+ LLP+   + + N+P  PI  +L +AHPL  RPPY LSWL 
Subjt:  RNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPL--ASDSFNVPAFPIGPLLGIAHPLASRPPYILSWLK

Query:  GQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGG
          ISA  M+ P ++EKLVL+   +VP K+L+QL+T  + GGLRDR G F YKDH+ ++N+PILALAGD DIICPP+AVY+TVK IP    +YKV+G PGG
Subjt:  GQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVKQIPWRYVSYKVLGKPGG

Query:  PHYAHYDIVGSRLASSEVYPLITNFLNRHD
        PHY H D++  R A +EVYPLIT FL + D
Subjt:  PHYAHYDIVGSRLASSEVYPLITNFLNRHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTCTTCCATTGTCTCGCTCGGATCTCTGTTCGATCAGCCGGAGCGACGGCCTCTGCTGTTACCTGAACCCAGTCCGGCAGCGACGGAAACCAGATGGCTGCTG
GGTCTTACGCCGACGAAATGTGATCGCGGTCAGGGCGTTTGACGGTGGGGCATTCGGTTTGTACGGCAACAAGGAGAAGAGTTCAATCTGTACTGCCGACGAGCTTCATT
ACGTCTCTGTTCCCAACTCCGATTGGAGGCTCGCTCTGTGGCGTTACCCCCCTTCTCCTCGGGCGCCGTCAAGGAATCATCCGCTTTTGCTGTTATCAGGGGTTGGGAGC
AATGCTCTTGGCTATGACCTTTCTCCAGAGTCCTCATTTGCTCGCTACATGTCCAACCAAGGATATGATACTTGGATTCTTGAAGTTCGAGGATCTGGCCTTAGTACTGA
CAGAGTAGAAATGAAGGAAAATGAGCAGATACGCTCTGGAACTCTGGAGAAACGTCCATCAGCCGAGACTGGCAAATATGTTAGTTCTGTGGGTTCCAGCATTTCTTCAA
AAGATGGAGAGACTTCTACAATTGCTACTCAACTTAGGCAATGGAATCAAAATCTTATGGATATAATCGAAGGAGCTCAACAACTGGGTCCATTGCGGCCTTTTAACTTA
CAAGGTGTTACCTCTGCATTAGAAGGCTTCCAGGAACAACTTGATATGTATGAGAAATATGATTGGGACTTCGACCACTACTTGGAAGAAGATTTGCCCGCTGCGATGGA
GTACATAAGGAACCAATCTAAACCCAACGATGGCAAGTTACTAGCAATTGGCCACTCGATGGGGGGTATCTTGCTATATGCAAAGATCTCACGATGCAGCTTTAACAAAG
TTGACCCGCAGTTGGCATCAGTTGTGACTTTGGGTTCATCACTTGACTTCAGACCTTCAAATTCATCACTCAGATTGCTATTACCTTTGGCAAGTGATTCTTTTAATGTT
CCTGCGTTTCCCATTGGGCCATTGCTTGGTATTGCTCATCCCCTCGCATCGCGTCCTCCTTATATCTTGTCCTGGTTAAAGGGTCAAATTTCTGCAGGAGACATGCTACA
TCCTACCTTGCTTGAGAAGCTTGTTCTTGATGGCTATGAATCTGTGCCTTCAAAGGTTCTCATGCAGCTATCAACTGTTTTTGAAGAAGGGGGCTTGCGCGACAGGAACG
GGATGTTCCAGTACAAGGACCATCTATGCCAAAGCAACATTCCAATCCTTGCACTTGCTGGAGACCAAGACATCATTTGTCCACCTGAAGCTGTTTATGAAACTGTGAAA
CAAATTCCCTGGCGGTATGTTTCCTACAAAGTTCTTGGGAAGCCTGGTGGTCCTCACTATGCTCACTATGATATCGTGGGAAGTCGTTTGGCATCAAGCGAAGTATATCC
ATTGATAACCAATTTTCTCAACCGTCATGAC
mRNA sequenceShow/hide mRNA sequence
ATGGCCACTCTTCCATTGTCTCGCTCGGATCTCTGTTCGATCAGCCGGAGCGACGGCCTCTGCTGTTACCTGAACCCAGTCCGGCAGCGACGGAAACCAGATGGCTGCTG
GGTCTTACGCCGACGAAATGTGATCGCGGTCAGGGCGTTTGACGGTGGGGCATTCGGTTTGTACGGCAACAAGGAGAAGAGTTCAATCTGTACTGCCGACGAGCTTCATT
ACGTCTCTGTTCCCAACTCCGATTGGAGGCTCGCTCTGTGGCGTTACCCCCCTTCTCCTCGGGCGCCGTCAAGGAATCATCCGCTTTTGCTGTTATCAGGGGTTGGGAGC
AATGCTCTTGGCTATGACCTTTCTCCAGAGTCCTCATTTGCTCGCTACATGTCCAACCAAGGATATGATACTTGGATTCTTGAAGTTCGAGGATCTGGCCTTAGTACTGA
CAGAGTAGAAATGAAGGAAAATGAGCAGATACGCTCTGGAACTCTGGAGAAACGTCCATCAGCCGAGACTGGCAAATATGTTAGTTCTGTGGGTTCCAGCATTTCTTCAA
AAGATGGAGAGACTTCTACAATTGCTACTCAACTTAGGCAATGGAATCAAAATCTTATGGATATAATCGAAGGAGCTCAACAACTGGGTCCATTGCGGCCTTTTAACTTA
CAAGGTGTTACCTCTGCATTAGAAGGCTTCCAGGAACAACTTGATATGTATGAGAAATATGATTGGGACTTCGACCACTACTTGGAAGAAGATTTGCCCGCTGCGATGGA
GTACATAAGGAACCAATCTAAACCCAACGATGGCAAGTTACTAGCAATTGGCCACTCGATGGGGGGTATCTTGCTATATGCAAAGATCTCACGATGCAGCTTTAACAAAG
TTGACCCGCAGTTGGCATCAGTTGTGACTTTGGGTTCATCACTTGACTTCAGACCTTCAAATTCATCACTCAGATTGCTATTACCTTTGGCAAGTGATTCTTTTAATGTT
CCTGCGTTTCCCATTGGGCCATTGCTTGGTATTGCTCATCCCCTCGCATCGCGTCCTCCTTATATCTTGTCCTGGTTAAAGGGTCAAATTTCTGCAGGAGACATGCTACA
TCCTACCTTGCTTGAGAAGCTTGTTCTTGATGGCTATGAATCTGTGCCTTCAAAGGTTCTCATGCAGCTATCAACTGTTTTTGAAGAAGGGGGCTTGCGCGACAGGAACG
GGATGTTCCAGTACAAGGACCATCTATGCCAAAGCAACATTCCAATCCTTGCACTTGCTGGAGACCAAGACATCATTTGTCCACCTGAAGCTGTTTATGAAACTGTGAAA
CAAATTCCCTGGCGGTATGTTTCCTACAAAGTTCTTGGGAAGCCTGGTGGTCCTCACTATGCTCACTATGATATCGTGGGAAGTCGTTTGGCATCAAGCGAAGTATATCC
ATTGATAACCAATTTTCTCAACCGTCATGAC
Protein sequenceShow/hide protein sequence
MATLPLSRSDLCSISRSDGLCCYLNPVRQRRKPDGCWVLRRRNVIAVRAFDGGAFGLYGNKEKSSICTADELHYVSVPNSDWRLALWRYPPSPRAPSRNHPLLLLSGVGS
NALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTDRVEMKENEQIRSGTLEKRPSAETGKYVSSVGSSISSKDGETSTIATQLRQWNQNLMDIIEGAQQLGPLRPFNL
QGVTSALEGFQEQLDMYEKYDWDFDHYLEEDLPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAKISRCSFNKVDPQLASVVTLGSSLDFRPSNSSLRLLLPLASDSFNV
PAFPIGPLLGIAHPLASRPPYILSWLKGQISAGDMLHPTLLEKLVLDGYESVPSKVLMQLSTVFEEGGLRDRNGMFQYKDHLCQSNIPILALAGDQDIICPPEAVYETVK
QIPWRYVSYKVLGKPGGPHYAHYDIVGSRLASSEVYPLITNFLNRHD