; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017546 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017546
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationChr03:15464545..15468928
RNA-Seq ExpressionHG10017546
SyntenyHG10017546
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025811.1 Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa]7.3e-20488.86Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        VA  PSPSGADR  MITQWGFTLFLILARHM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]9.2e-20790.02Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATEKGAPEY SPAPER+A
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+HP     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        +A  PSPSGADR  MITQWGFTLFLILA HM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]9.2e-20790.02Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATEKGAPEY SPAPER+A
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+HP     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        +A  PSPSGADR  MITQWGFTLFLILA HM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]7.3e-20488.86Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        VA  PSPSGADR  MITQWGFTLFLILARHM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]5.4e-20790.72Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVERPVSLLEDNIEQL+TDIFEEF IPSIKVDILSLE L GSNRTKVVFSLDPD D+ EISSTYLSLIRSTI SLVTNQFLRITKSMFGEAF
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN  +PPT HHHHHHT LTPAISPAPATEKGAPEY SPAPERS 
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASPKRSY AKPPGCQY  KRKSGRKEGKQSHLTPLASP +SPDHSAASPSP PQH+VNPPAAP+  APALTPLPNV+YAHVQPPSKS+S+HPEKSTTNP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
         APSPSPSGADR  MITQWGFTLFLILA HM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein4.4e-20790.02Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATEKGAPEY SPAPER+A
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+HP     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        +A  PSPSGADR  MITQWGFTLFLILA HM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

A0A1S3C173 uncharacterized protein LOC1034958523.5e-20488.86Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        VA  PSPSGADR  MITQWGFTLFLILARHM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

A0A5A7SNH7 Zinc finger family protein, putative isoform 13.5e-20488.86Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF
        GHDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGEA+
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
        SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA
        LAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPERSA
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSA

Query:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL
        ASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+ P     NP 
Subjt:  ASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPL

Query:  VAPSPSPSGADRRRMITQWGFTLFLILARHM
        VA  PSPSGADR  MITQWGFTLFLILARHM
Subjt:  VAPSPSPSGADRRRMITQWGFTLFLILARHM

A0A6J1HSR1 uncharacterized protein LOC111466276 isoform X35.3e-18482.87Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEA
        GHDIVATF VERPVSLLEDNIE+L+TDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +DPD DD EI STYLSLIRST ASLVTNQ FLRITKSMFGEA
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEA

Query:  FSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK
        FSFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSM+RLK
Subjt:  FSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERS
        QLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILK+SLNG DG GP RSPSPAP PQSHN+ HPP+HHHHHHH+PLTP ISPAPA E GAPEY  PAP +S
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERS

Query:  AASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNP
        AASPKRSY AKPPGCQ  YKRKSGRKEGKQ +L+PLASP ISP HSAA  SPS QH V+P         A TPLP+V+YAHVQPPSKS+S+HPEKSTT+P
Subjt:  AASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNP

Query:  LVAPSPSPSGADRRRMITQWGFTLFLILARHM
         + PSPSPS A    MIT+W FTL LI+A +M
Subjt:  LVAPSPSPSGADRRRMITQWGFTLFLILARHM

A0A6J1HW39 uncharacterized protein LOC1114671962.1e-18582.19Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEA
        GHDI+ATFNVERPVSLL+DN+EQLQTDIFEEFPIPSIKVD+L L+ LSGSN T VVFSLD D DD EIS TYLSLIRST ASLVTNQ FL +TKSMFGEA
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEA

Query:  FSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK
        FSFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFS+HQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK
Subjt:  FSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPA------PQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSS
        QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+ LNGS+GN P RSPSPAPA      P +HNY HPPT HHHHHHTP+TPAISPAP TEKGAPEY S
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPA------PQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSS

Query:  PAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPE
        PAPER+AASPKRS  A+PPGCQYRYKRKS RKEGKQ           SP HSA  PSPSP+H+V      VS APAL PLPNVVY HVQPPSKS+S+H E
Subjt:  PAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPE

Query:  KSTTNPLVAPSPSPSGADRRRMITQWGFTLFLILARHM
         S  NP  APSPSPSGADR R ITQWGFTLFLILA HM
Subjt:  KSTTNPLVAPSPSPSGADRRRMITQWGFTLFLILARHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)4.5e-2634.78Show/hide
Query:  IVATFNVERPVSLLEDNIEQLQTDIFEEFPIP-SIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFS
        + A+F +++PVS +  +  +++ DI     +  + KV +LSL     SN T V F++ P   D EIS   LSL+RS+   L   +  L++T S FG+  S
Subjt:  IVATFNVERPVSLLEDNIEQLQTDIFEEFPIP-SIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFS

Query:  FEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQL
        F+VLKFP GIT+ P + A +     +LF+ T+  SI  +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL   
Subjt:  FEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQL

Query:  AQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAP
         Q I  S + NLGL+   FG+VK +  S+ L       DG  P      APAP
Subjt:  AQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.7e-8148.98Show/hide
Query:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEA
        GH IVA+F++ R  S L +N  QLQ DIF+E    SIKV IL++EP    N TKVVF +DPD    EI    LS I+    S++ NQ  L++TKS+FGE 
Subjt:  GHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEA

Query:  FSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK
        F FEVLKFP GIT+IPPQSAF LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGSTV+ PT V SSVLL VG + S  RLK
Subjt:  FSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLK

Query:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQS-HNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYS---SPA
        QL  TI+GS S NLGLNNT FGKVKQVRLSS L    N SD +   +SPSP+P+P S H++ H   HHHHHHH            + K APE S   SPA
Subjt:  QLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQS-HNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYS---SPA

Query:  PERSAASPKRSYAAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRA-----PALTPLPNVVYAH-VQPPSK
        P RS    KR+ +A P   PG +  +K K       Q   TP  +P           + +P HQ++ P AP+S A     P   PLP+VV+AH  QPP  
Subjt:  PERSAASPKRSYAAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRA-----PALTPLPNVVYAH-VQPPSK

Query:  SDSSHPEKSTTNPLVAPSP-SPSGADRRRMITQWGFTLFLILA
           + P +   N +  P P S S A        W   L LI+A
Subjt:  SDSSHPEKSTTNPLVAPSP-SPSGADRRRMITQWGFTLFLILA

AT3G56590.1 hydroxyproline-rich glycoprotein family protein2.8e-8448.79Show/hide
Query:  HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAF
        H IVA+F+V +P+S +EDN+ QL+ DI +E   P  KV +L+LE L   NRT V+F++DP+ ++ +I +   SLI++   +LV  Q   R+T+S+FGE F
Subjt:  HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
         FEVLKFP GIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER
        LAQTI+ S+S NLGLN+T FGKVKQVRLSSIL +S        PA S  PSP+P P++H YPH   HHHHHHH  L P  S +P T+  AP   + AP +
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER

Query:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN
         +  P R+     P C Y  +R  G          P  +P  S  H  A     P+H     A PVS     +PLP+VV+AH+ PPSKS          +
Subjt:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN

Query:  PLVAPSPSPSGA
        P  AP+P  S +
Subjt:  PLVAPSPSPSGA

AT3G56590.2 hydroxyproline-rich glycoprotein family protein2.1e-8449.26Show/hide
Query:  HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAF
        H IVA+F+V +P+S +EDN+ QL+ DI +E   P  KV +L+LE L   NRT V+F++DP+ ++ +I +   SLI++   +LV  Q   R+T+S+FGE F
Subjt:  HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
         FEVLKFP GIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER
        LAQTI+ S+S NLGLN+T FGKVKQVRLSSIL +S        PA S  PSP+P P++H YPH   HHHHHHH  L P  S +P T+  AP   + AP +
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER

Query:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN
         +  P R+     P C Y  +R  G          P  +P  S  H  A     P+H     A PVS     +PLP+VV+AH+ PPSKS          +
Subjt:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN

Query:  PLVAPSPS
        P  AP+PS
Subjt:  PLVAPSPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGGCTACAGGCCATCTGGCCAGGCTGCCGATGGC
CGATGCTGTTGTGGGTGTGTTTCGATTCCAAGACTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCC
TTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTCAATTCGTCGTATCGAGGTGGGACTCTTCATCGATTCTGATATCTTTGGGGTTTTAGGGGTTCGTCGATTGCTG
TTTCTTTTGTGGGGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTGAAGAGTTCCC
TATACCTTCTATCAAAGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCGT
CAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGAAGTACTGAAA
TTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGATTCAAGTACA
TTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACTGCCCCTACGA
TTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCTCGGTCTGAAT
AATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTCCTGCTCCTGC
ACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAG
AATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGGTAGGAAAGAA
GGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCACCAGCAGCACC
CGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACGACAAATCCAT
TAGTTGCGCCATCTCCATCTCCATCTGGTGCTGATCGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACTGCCGTCCGCCATCGGCTACAGGCCATCTGGCCAGGCTGCCGATGGC
CGATGCTGTTGTGGGTGTGTTTCGATTCCAAGACTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCC
TTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTCAATTCGTCGTATCGAGGTGGGACTCTTCATCGATTCTGATATCTTTGGGGTTTTAGGGGTTCGTCGATTGCTG
TTTCTTTTGTGGGGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTGAAGAGTTCCC
TATACCTTCTATCAAAGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTGGAAATCTCGT
CAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGAAGTACTGAAA
TTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGATTCAAGTACA
TTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACTGCCCCTACGA
TTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCTCGGTCTGAAT
AATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTCCTGCTCCTGC
ACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAGGGTGCACCAG
AATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGGTAGGAAAGAA
GGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCACCAGCAGCACC
CGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACGACAAATCCAT
TAGTTGCGCCATCTCCATCTCCATCTGGTGCTGATCGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTCTCGCACGCCATATGTAA
Protein sequenceShow/hide protein sequence
MALDPLRRHCSRWGKTTENSHCRPPSATGHLARLPMADAVVGVFRFQDSLASDASSFCYCPLPCSFLLFFGCPLFSIMQIKRIWVSIRRIEVGLFIDSDIFGVLGVRRLL
FLLWGHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLK
FPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
NTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKE
GKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTNPLVAPSPSPSGADRRRMITQWGFTLFLILARHM