; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026786 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026786
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationchr08:11693599..11697795
RNA-Seq ExpressionIVF0026786
SyntenyIVF0026786
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]3.45e-31492.13Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYE+       YIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NN EFGKVKQ               GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+AASP+RSYTA+PPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
        YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSPSGADRCHMITQWGFTL
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL

Query:  FLILARHM
        FLILA HM
Subjt:  FLILARHM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]0.092.13Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYE+       YIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NN EFGKVKQ               GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+AASP+RSYTA+PPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
        YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSPSGADRCHMITQWGFTL
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL

Query:  FLILARHM
        FLILA HM
Subjt:  FLILARHM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]0.095.47Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYE+       YIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NNAEFGKVKQ               GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
        YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL

Query:  FLILARHM
        FLILARHM
Subjt:  FLILARHM

XP_031740216.1 uncharacterized protein LOC101216010 isoform X2 [Cucumis sativus]1.38e-30991.94Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYE+       YIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NN EFGKVKQ               GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+AASP+RSYTA+PPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSP
        YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSP
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSP

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]1.46e-29284.88Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSG VADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDLGLNPSYRGHDIVATFNVER VSLLEDN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        +QLRTDIFEEF IPSIKV+ILSLE L GSNRTKVVFS+DPDTD+SEISSTYLSLIRS I SLVTNQFL ITKS FGEA+SFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYE+       Y+KLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NN EFGKVKQ               GNGP RSPSPAP PQPHN+  PPTHHHHHH T L  AISPAPATEKGAPEYGSPAPERS ASP+RSYTA+PPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSP--QHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVAPSPS--GADRCHM
        Y  KRKSGRKEGKQSHLTPLASPN+SPDHSAASPSP  QH++NPPAAP+ PAPALTPLPNVIYAHVQPPSKS+SN P     NPS APSPS  GADRC M
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSP--QHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVAPSPS--GADRCHM

Query:  ITQWGFTLFLILARHM
        ITQWGFTLFLILA HM
Subjt:  ITQWGFTLFLILARHM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein2.2e-25892.13Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDL LNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFS+DPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYE+       YIKLWNAEGSTVT PTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NN EFGKVKQ               GNGPVRSPSPAPTPQPHN HHPPTHHHHHHHTPL  AISPAPATEKGAPEYGSPAPER+AASP+RSYTA+PPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
        YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSN PANPS+APSPSGADRCHMITQWGFTL
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL

Query:  FLILARHM
        FLILA HM
Subjt:  FLILARHM

A0A1S3C173 uncharacterized protein LOC1034958525.3e-26895.47Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
        DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFL

Query:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
        LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYE+       YIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL
Subjt:  LQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGL

Query:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
        NNAEFGKVKQ               GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ
Subjt:  NNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQ

Query:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
        YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL
Subjt:  YRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTL

Query:  FLILARHM
        FLILARHM
Subjt:  FLILARHM

A0A5A7SNH7 Zinc finger family protein, putative isoform 14.1e-22094.42Show/hide
Query:  RGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA
        +GHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA
Subjt:  RGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEA

Query:  YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTP
        YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYE+       YIKLWNAEGSTVTAPTIVQTSVLLEVGNTP
Subjt:  YSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTP

Query:  SMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGS
        SMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQ               GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGS
Subjt:  SMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGS

Query:  PAPERSAASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANP
        PAPERSAASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANP
Subjt:  PAPERSAASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANP

Query:  SVAPSPSGADRCHMITQWGFTLFLILARHM
        SVAPSPSGADRCHMITQWGFTLFLILARHM
Subjt:  SVAPSPSGADRCHMITQWGFTLFLILARHM

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X31.8e-20276.6Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGE P PSA+ S PS     GRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HY+DQKDLGLNPSYRGHDIVATF VER VSLL+DN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
        ++LRTDIFEEFPIPSIKV+ILSL  LSGSNRTKVVF IDPDTDD EI STYLSLIRS   S+VTNQ FL ITKS FGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLG
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL AGLRLAPYE+       YIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSM+RLKQLAQTIS SNSSNLG
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLG

Query:  LNNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGC
        LNN EFGKVKQ               G GP+RSPSPAPTPQPHN HHPP+HHHHHHH PL   ISPAPA E GAPEYG  AP +SAASP+RSY A+PPGC
Subjt:  LNNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGC

Query:  QYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPAN--------PSVAPSPSGADRCH
        Q  YKRKSGRKEGKQ HL+PLASP+ISP HSAASPS QH        VSP  A TPLP+VIYAHVQPPSKSDSN P          PS +PSPS A    
Subjt:  QYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPAN--------PSVAPSPSGADRCH

Query:  MITQWGFTLFLILARHM
        MIT+WGFTL LI+A +M
Subjt:  MITQWGFTLFLILARHM

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X41.0e-20276.89Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGKNDGE P PSA+ S PS     GRCC GCVSIRRLIGFRCIFILLLSVALFVSAV WLPPF+HY+DQKDLGLNPSYRGHDIVATF VER VSLL+DN 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
        ++LRTDIFEEFPIPSIKV+ILSL  LSGSNRTKVVF IDPDTDD EI STYLSLIRS   S+VTNQ FL ITKS FGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLG
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL AGLRLAPYE+       YIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSM+RLKQLAQTIS SNSSNLG
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLG

Query:  LNNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGC
        LNN EFGKVKQ               G GP+RSPSPAPTPQPHN HHPP+HHHHHHH PL   ISPAPA E GAPEYG  AP +SAASP+RSY A+PPGC
Subjt:  LNNAEFGKVKQF--------------GNGPVRSPSPAPTPQPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGC

Query:  QYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVAPSPSGADRCH--MI
        Q  YKRKSGRKEGKQ HL+PLASP+ISP HSAASPS QH        VSP  A TPLP+VIYAHVQPPSKSDSN P     +PS+ PSPS +   H  MI
Subjt:  QYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDP----ANPSVAPSPSGADRCH--MI

Query:  TQWGFTLFLILARHM
        T+WGFTL LI+A +M
Subjt:  TQWGFTLFLILARHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)1.6e-3034.38Show/hide
Query:  ADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLG---LNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIP-SIKV
        + GR C    S  RL+G RC+ +L+LS A+ +SA+ WL P    ++ K  G   LN S     + A+F +++ VS +  +  ++  DI     +  + KV
Subjt:  ADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLG---LNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEEFPIP-SIKV

Query:  NILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIH
         +LSL     SN T V F++ P   D EIS   LSL+RS    L   +  L +T S FG+  SF+VLKFPGGIT+ P + A +     +LF+ T+  SI 
Subjt:  NILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIH

Query:  QIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQ-----FG
         +Q     L    +  L L PYE        + +L N +GST++ P   Q  V   +      +RL    Q I  S + NLGL+ A FG+VK      + 
Subjt:  QIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQ-----FG

Query:  NGPVRSP----SPAPTP
        +G V       +PAPTP
Subjt:  NGPVRSP----SPAPTP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein8.0e-9145.26Show/hide
Query:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF
        MGK + +  L  A              C  C  I   +GF+C+F+LLLSVALF+SA+  L PF    D++D  L+P +RGH IVA+F++ RS S L +N 
Subjt:  MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNF

Query:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF
         QL+ DIF+E    SIKV IL++EP    N TKVVF IDPDT   EI    LS I+ +  S++ NQ  L +TKS FGE + FEVLKFPGGIT+IPPQSAF
Subjt:  DQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQ-FLSITKSTFGEAYSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLG
         LQK +I+FNFTLN+SIHQIQ++F+ L SQLK GL LAPYE        Y+ L N+EGSTV+ PT V +SVLL VG + S  RLKQL  TI+GS S NLG
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLG

Query:  LNNAEFGKVKQF---------GNGPVRSPSPAPTP-QPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPE---YGSPAPERSAASPQRSYTAEP---P
        LNN  FGKVKQ           +   +SPSP+P+P   H+HHH   HHHHHHH            + K APE     SPAP RS    +R+ +A P   P
Subjt:  LNNAEFGKVKQF---------GNGPVRSPSPAPTP-QPHNHHHPPTHHHHHHHTPLISAISPAPATEKGAPE---YGSPAPERSAASPQRSYTAEP---P

Query:  GCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPA-----PALTPLPNVIYAHVQPPSKSDSNDP-ANPSVAPSP---SGAD
        G +  +K K       Q   TP  +P        ++ +P HQ++ P AP+S A     P   PLP+V++AH   P  ++  +P AN    P P   S A 
Subjt:  GCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPA-----PALTPLPNVIYAHVQPPSKSDSNDP-ANPSVAPSP---SGAD

Query:  RCHMITQWGFTLFLILA
               W   L LI+A
Subjt:  RCHMITQWGFTLFLILA

AT3G56590.1 hydroxyproline-rich glycoprotein family protein5.7e-8944.93Show/hide
Query:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLL
        MGKN   EQ LP    A  +R +G      CC  C  I      RC+ IL  S A+F+SA+ WLPPF+ +AD  DL L+P ++ H IVA+F+V + +S +
Subjt:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLL

Query:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP
        EDN  QL  DI +E   P  KV +L+LE L   NRT V+F+IDP+ ++S+I +   SLI++   +LV  Q    +T+S FGE + FEVLKFPGGIT+IPP
Subjt:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNS
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQLK G+ LA YE        YI L N+ GSTV  PTIV +SVLL  G   S  RLKQLAQTI+ S+S
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNS

Query:  SNLGLNNAEFGKVKQ------FGNGPVRSPSPAPTPQPHNHHHPPTH-HHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQYR
         NLGLN+  FGKVKQ        + P  S +P+P+PQP  H +P  H HHHHHH  L    S +P T+  AP   + AP + +  P R+     P C Y 
Subjt:  SNLGLNNAEFGKVKQ------FGNGPVRSPSPAPTPQPHNHHHPPTH-HHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQYR

Query:  YKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPS
         +R  G          P  +P+ S  H    P+P    NP        P  +PLP+V++AH+ PPSKS          +PSP+
Subjt:  YKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSDSNDPANPSVAPSPS

AT3G56590.2 hydroxyproline-rich glycoprotein family protein6.8e-9045.4Show/hide
Query:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLL
        MGKN   EQ LP    A  +R +G      CC  C  I      RC+ IL  S A+F+SA+ WLPPF+ +AD  DL L+P ++ H IVA+F+V + +S +
Subjt:  MGKND-GEQPLP---SAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLL

Query:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP
        EDN  QL  DI +E   P  KV +L+LE L   NRT V+F+IDP+ ++S+I +   SLI++   +LV  Q    +T+S FGE + FEVLKFPGGIT+IPP
Subjt:  EDNFDQLRTDIFEEFPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQF-LSITKSTFGEAYSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNS
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQLK G+ LA YE        YI L N+ GSTV  PTIV +SVLL  G   S  RLKQLAQTI+ S+S
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNS

Query:  SNLGLNNAEFGKVKQ------FGNGPVRSPSPAPTPQPHNHHHPPTH-HHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQYR
         NLGLN+  FGKVKQ        + P  S +P+P+PQP  H +P  H HHHHHH  L    S +P T+  AP   + AP + +  P R+     P C Y 
Subjt:  SNLGLNNAEFGKVKQ------FGNGPVRSPSPAPTPQPHNHHHPPTH-HHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQYR

Query:  YKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSD-SNDPA---NPSVAPSPSGA
         +R  G          P  +P+ S  H    P+P    NP        P  +PLP+V++AH+ PPSKS   ++P    +PS AP+PS A
Subjt:  YKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALTPLPNVIYAHVQPPSKSD-SNDPA---NPSVAPSPSGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGACTCCAGGCCCTCCGGTCTGGTTGCCGATGGTCGATGCTGTCGTGGGTGTGTTTCGATTCGAAGACT
CATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTGTTTGGCTGCCCCCTTTTATCCATTATGCAGATCAAAAGGATCTGGGTC
TCAATCCGTCGTATCGAGGTCATGATATTGTAGCAACATTTAATGTTGAGAGATCGGTGTCTTTGCTGGAAGACAATTTTGATCAACTCCGAACCGACATTTTTGAAGAG
TTTCCTATACCTTCTATCAAAGTGAATATACTGTCTCTAGAACCATTGTCTGGATCAAACCGTACAAAAGTTGTATTCAGCATCGATCCAGATACTGATGATTCGGAAAT
CTCGTCAACGTATCTAAGTTTAATCAGGTCGATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACCAAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGC
TGAAATTCCCTGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAA
GTACATTTCAGCGAACTGACCAGCCAACTGAAGGCGGGATTACGACTAGCTCCATATGAGTTGTGTTGTGCTAAATGCAGATTTTATATTAAACTTTGGAATGCGGAAGG
TTCAACTGTGACAGCCCCTACAATTGTCCAGACTTCTGTACTTCTTGAAGTTGGAAATACTCCGTCGATGCGACGGCTGAAGCAGCTGGCTCAAACAATCTCGGGTTCTA
ATTCTAGCAATCTCGGCCTGAATAATGCGGAGTTTGGTAAGGTGAAACAGTTCGGGAACGGCCCCGTAAGGTCGCCTTCTCCTGCTCCTACACCCCAGCCCCATAACCAC
CATCATCCTCCAACTCACCACCACCACCACCATCACACCCCTCTAATCTCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAGAATATGGTTCGCCTGCCCC
TGAAAGAAGTGCAGCATCACCTCAGAGAAGTTATACGGCTGAGCCGCCCGGTTGTCAATATAGGTACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAA
CCCCGCTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCCATCACCACAACATCAAATAAACCCACCAGCAGCTCCCGTCTCACCAGCTCCGGCATTAACT
CCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCTAATGACCCCGCAAATCCATCAGTTGCACCATCTCCATCTGGCGCAGATCGTTGTCA
TATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGACTCCAGGCCCTCCGGTCTGGTTGCCGATGGTCGATGCTGTCGTGGGTGTGTTTCGATTCGAAGACT
CATTGGCTTCAGATGCATCTTCATTCTCCTATTATCCGTTGCCTTGTTCGTTTCTGCTGTTGTTTGGCTGCCCCCTTTTATCCATTATGCAGATCAAAAGGATCTGGGTC
TCAATCCGTCGTATCGAGGTCATGATATTGTAGCAACATTTAATGTTGAGAGATCGGTGTCTTTGCTGGAAGACAATTTTGATCAACTCCGAACCGACATTTTTGAAGAG
TTTCCTATACCTTCTATCAAAGTGAATATACTGTCTCTAGAACCATTGTCTGGATCAAACCGTACAAAAGTTGTATTCAGCATCGATCCAGATACTGATGATTCGGAAAT
CTCGTCAACGTATCTAAGTTTAATCAGGTCGATCATTACAAGTCTAGTAACAAATCAGTTCCTCAGCATTACCAAATCCACATTCGGGGAGGCCTATTCGTTTGAAGTGC
TGAAATTCCCTGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAAAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAA
GTACATTTCAGCGAACTGACCAGCCAACTGAAGGCGGGATTACGACTAGCTCCATATGAGTTGTGTTGTGCTAAATGCAGATTTTATATTAAACTTTGGAATGCGGAAGG
TTCAACTGTGACAGCCCCTACAATTGTCCAGACTTCTGTACTTCTTGAAGTTGGAAATACTCCGTCGATGCGACGGCTGAAGCAGCTGGCTCAAACAATCTCGGGTTCTA
ATTCTAGCAATCTCGGCCTGAATAATGCGGAGTTTGGTAAGGTGAAACAGTTCGGGAACGGCCCCGTAAGGTCGCCTTCTCCTGCTCCTACACCCCAGCCCCATAACCAC
CATCATCCTCCAACTCACCACCACCACCACCATCACACCCCTCTAATCTCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAGAATATGGTTCGCCTGCCCC
TGAAAGAAGTGCAGCATCACCTCAGAGAAGTTATACGGCTGAGCCGCCCGGTTGTCAATATAGGTACAAGAGGAAGTCTGGTAGGAAAGAAGGAAAGCAATCTCATTTAA
CCCCGCTTGCGTCACCCAATATATCTCCTGATCATTCTGCTGCATCGCCATCACCACAACATCAAATAAACCCACCAGCAGCTCCCGTCTCACCAGCTCCGGCATTAACT
CCATTGCCAAACGTCATTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCTAATGACCCCGCAAATCCATCAGTTGCACCATCTCCATCTGGCGCAGATCGTTGTCA
TATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA
Protein sequenceShow/hide protein sequence
MGKNDGEQPLPSAIDSRPSGLVADGRCCRGCVSIRRLIGFRCIFILLLSVALFVSAVVWLPPFIHYADQKDLGLNPSYRGHDIVATFNVERSVSLLEDNFDQLRTDIFEE
FPIPSIKVNILSLEPLSGSNRTKVVFSIDPDTDDSEISSTYLSLIRSIITSLVTNQFLSITKSTFGEAYSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQ
VHFSELTSQLKAGLRLAPYELCCAKCRFYIKLWNAEGSTVTAPTIVQTSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNAEFGKVKQFGNGPVRSPSPAPTPQPHNH
HHPPTHHHHHHHTPLISAISPAPATEKGAPEYGSPAPERSAASPQRSYTAEPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPDHSAASPSPQHQINPPAAPVSPAPALT
PLPNVIYAHVQPPSKSDSNDPANPSVAPSPSGADRCHMITQWGFTLFLILARHM