; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G017560 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G017560
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationchr01:16339827..16344309
RNA-Seq ExpressionLsi01G017560
SyntenyLsi01G017560
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025811.1 Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa]1.5e-19188.48Show/hide
Query:  EESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGE
        ++ HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGE
Subjt:  EESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGE

Query:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL
        A+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRL
Subjt:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL

Query:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER
        KQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPER
Subjt:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER

Query:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN
        SAASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+ P     N
Subjt:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN

Query:  PLVAPSPS
        P VAPSPS
Subjt:  PLVAPSPS

KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]5.1e-19587.89Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV
        NQFL ITKS FGEA+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SV
Subjt:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV

Query:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE
        LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATE
Subjt:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE

Query:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS
        KGAPEY SPAPER+AASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPS
Subjt:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS

Query:  KSDSSHPEKSTTNPLVAPSPS
        KSDS+HP     NP +APSPS
Subjt:  KSDSSHPEKSTTNPLVAPSPS

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]5.1e-19587.89Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV
        NQFL ITKS FGEA+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SV
Subjt:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV

Query:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE
        LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATE
Subjt:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE

Query:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS
        KGAPEY SPAPER+AASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPS
Subjt:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS

Query:  KSDSSHPEKSTTNPLVAPSPS
        KSDS+HP     NP +APSPS
Subjt:  KSDSSHPEKSTTNPLVAPSPS

XP_031740216.1 uncharacterized protein LOC101216010 isoform X2 [Cucumis sativus]8.7e-19587.5Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV
        NQFL ITKS FGEA+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SV
Subjt:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV

Query:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE
        LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATE
Subjt:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE

Query:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS
        KGAPEY SPAPER+AASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPS
Subjt:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS

Query:  KSDSSHPEKSTTNPLVAPSPSPCE
        KSDS+HP     NP +A  PSPCE
Subjt:  KSDSSHPEKSTTNPLVAPSPSPCE

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]1.6e-19388.15Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATFNVERPVSLLEDNIEQL+TDIFEEF IPSIKVDILSLE L GSNRTKVVFSLDPD D+ EISSTYLSLIRSTI SLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV
        NQFLRITKSMFGEAFSFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGSTVTAPTIVQSSV
Subjt:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV

Query:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE
        LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN  +PPT HHHHHHT LTPAISPAPATE
Subjt:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE

Query:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS
        KGAPEY SPAPERS ASPKRSY AKPPGCQY  KRKSGRKEGKQSHLTPLASP +SPDHSAASPSP PQH+VNPPAAP+  APALTPLPNV+YAHVQPPS
Subjt:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS

Query:  KSDSSHPEKSTTNPLVAPSPSP
        KS+S+HPEKSTTNP  APSPSP
Subjt:  KSDSSHPEKSTTNPLVAPSPSP

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein2.5e-19587.89Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFSLDPD DD EISSTYLSLIRS I SLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV
        NQFL ITKS FGEA+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SV
Subjt:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV

Query:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE
        LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILK+SLNGSDGNGP RSPSPAP PQ HN  HPPTHHHHHHHTPLTPAISPAPATE
Subjt:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE

Query:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS
        KGAPEY SPAPER+AASPKRSY AKPPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPS
Subjt:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS

Query:  KSDSSHPEKSTTNPLVAPSPS
        KSDS+HP     NP +APSPS
Subjt:  KSDSSHPEKSTTNPLVAPSPS

A0A1S3C173 uncharacterized protein LOC1034958521.7e-19186.46Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV
        NQFL ITKS FGEA+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SV
Subjt:  NQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSV

Query:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE
        LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATE
Subjt:  LLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATE

Query:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS
        KGAPEY SPAPERSAASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPS
Subjt:  KGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPS

Query:  KSDSSHPEKSTTNPLVAPSPS
        KSDS+ P     NP VAPSPS
Subjt:  KSDSSHPEKSTTNPLVAPSPS

A0A5A7SNH7 Zinc finger family protein, putative isoform 17.5e-19288.48Show/hide
Query:  EESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGE
        ++ HDIVATFNVER VSLLEDN +QL+TDIFEEFPIPSIKV+ILSLEPLSGSNRTKVVFS+DPD DD EISSTYLSLIRS I SLVTNQFL ITKS FGE
Subjt:  EESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQFLRITKSMFGE

Query:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL
        A+SFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQ+SVLLEVGNTPSMRRL
Subjt:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL

Query:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER
        KQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILK+SLNGS+GNGP RSPSPAP PQ HN+ HPPTHHHHHHHTPL  AISPAPATEKGAPEY SPAPER
Subjt:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAPER

Query:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN
        SAASP+RSY A+PPGCQYRYKRKSGRKEGKQSHLTPLASP ISPDHSAA  SPSPQHQ+NPPAAPVS APALTPLPNV+YAHVQPPSKSDS+ P     N
Subjt:  SAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKSTTN

Query:  PLVAPSPS
        P VAPSPS
Subjt:  PLVAPSPS

A0A6J1HPX6 uncharacterized protein LOC111466276 isoform X13.4e-17681.65Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATF VERPVSLLEDNIE+L+TDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +DPD DD EI STYLSLIRST ASLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSS
        NQ FLRITKSMFGEAFSFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQSS
Subjt:  NQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSS

Query:  VLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPAT
        VLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILK+SLNG DG GP RSPSPAP PQSHN+ HPP+HHHHHHH+PLTP ISPAPA 
Subjt:  VLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPAT

Query:  EKGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPP
        E GAPEY  PAP +SAASPKRSY AKPPGCQ  YKRKSGRKEGKQ +L+PLASP ISP HSAA  SPS QH V+P         A TPLP+V+YAHVQPP
Subjt:  EKGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPP

Query:  SKSDSSHPEKSTTNPLVAPSPSPCE
        SKS+S+HPEKSTT+P + PSPSP +
Subjt:  SKSDSSHPEKSTTNPLVAPSPSPCE

A0A6J1HSR1 uncharacterized protein LOC111466276 isoform X35.7e-17682.03Show/hide
Query:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT
        H+A++K          HDIVATF VERPVSLLEDNIE+L+TDIFEEFPIPSIKVDILSL  LSGSNRTKVVF +DPD DD EI STYLSLIRST ASLVT
Subjt:  HHANKK------KEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVT

Query:  NQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSS
        NQ FLRITKSMFGEAFSFEVLKFP GITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVTAPTIVQSS
Subjt:  NQ-FLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSS

Query:  VLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPAT
        VLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILK+SLNG DG GP RSPSPAP PQSHN+ HPP+HHHHHHH+PLTP ISPAPA 
Subjt:  VLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPAT

Query:  EKGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPP
        E GAPEY  PAP +SAASPKRSY AKPPGCQ  YKRKSGRKEGKQ +L+PLASP ISP HSAA  SPS QH V+P         A TPLP+V+YAHVQPP
Subjt:  EKGAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPP

Query:  SKSDSSHPEKSTTNPLVAPSPSP
        SKS+S+HPEKSTT+P + PSPSP
Subjt:  SKSDSSHPEKSTTNPLVAPSPSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)3.7e-2634.78Show/hide
Query:  IVATFNVERPVSLLEDNIEQLQTDIFEEFPIP-SIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFS
        + A+F +++PVS +  +  +++ DI     +  + KV +LSL     SN T V F++ P   D EIS   LSL+RS+   L   +  L++T S FG+  S
Subjt:  IVATFNVERPVSLLEDNIEQLQTDIFEEFPIP-SIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGEAFS

Query:  FEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQL
        F+VLKFP GIT+ P + A +     +LF+ T+  SI  +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL   
Subjt:  FEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQL

Query:  AQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAP
         Q I  S + NLGL+   FG+VK +  S+ L       DG  P      APAP
Subjt:  AQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein2.2e-7949.76Show/hide
Query:  HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEAF
        H IVA+F++ R  S L +N  QLQ DIF+E    SIKV IL++EP    N TKVVF +DPD    EI    LS I+    S++ NQ  L++TKS+FGE F
Subjt:  HDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQ-FLRITKSMFGEAF

Query:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ
         FEVLKFP GIT+IPPQSAF LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGSTV+ PT V SSVLL VG + S  RLKQ
Subjt:  SFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQS-HNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYS---SPAP
        L  TI+GS S NLGLNNT FGKVKQVRLSS L    N SD +   +SPSP+P+P S H++ H   HHHHHHH            + K APE S   SPAP
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQS-HNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYS---SPAP

Query:  ERSAASPKRSYAAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRA-----PALTPLPNVVYAH-VQPPSKS
         RS    KR+ +A P   PG +  +K K       Q   TP  +P           + +P HQ++ P AP+S A     P   PLP+VV+AH  QPP   
Subjt:  ERSAASPKRSYAAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRA-----PALTPLPNVVYAH-VQPPSKS

Query:  DSSHPEKSTTNPLVAPSP
          + P +   N +  P P
Subjt:  DSSHPEKSTTNPLVAPSP

AT3G56590.1 hydroxyproline-rich glycoprotein family protein6.7e-8448.67Show/hide
Query:  ESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGE
        + H IVA+F+V +P+S +EDN+ QL+ DI +E   P  KV +L+LE L   NRT V+F++DP+ ++ +I +   SLI++   +LV  Q   R+T+S+FGE
Subjt:  ESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGE

Query:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL
         F FEVLKFP GIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RL
Subjt:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL

Query:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAP
        KQLAQTI+ S+S NLGLN+T FGKVKQVRLSSIL +S        PA S  PSP+P P++H YPH   HHHHHHH  L P  S +P T+  AP   + AP
Subjt:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAP

Query:  ERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKST
         + +  P R+     P C Y  +R  G          P  +P  S  H  A     P+H     A PVS     +PLP+VV+AH+ PPSKS         
Subjt:  ERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKST

Query:  TNPLVAPSPSPCE
         +P  +P+P+PC+
Subjt:  TNPLVAPSPSPCE

AT3G56590.2 hydroxyproline-rich glycoprotein family protein5.1e-8449.02Show/hide
Query:  ESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGE
        + H IVA+F+V +P+S +EDN+ QL+ DI +E   P  KV +L+LE L   NRT V+F++DP+ ++ +I +   SLI++   +LV  Q   R+T+S+FGE
Subjt:  ESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDLEISSTYLSLIRSTIASLVTNQF-LRITKSMFGE

Query:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL
         F FEVLKFP GIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RL
Subjt:  AFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMRRL

Query:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAP
        KQLAQTI+ S+S NLGLN+T FGKVKQVRLSSIL +S        PA S  PSP+P P++H YPH   HHHHHHH  L P  S +P T+  AP   + AP
Subjt:  KQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARS--PSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEKGAPEYSSPAP

Query:  ERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKST
         + +  P R+     P C Y  +R  G          P  +P  S  H  A     P+H     A PVS     +PLP+VV+AH+ PPSKS         
Subjt:  ERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKST

Query:  TNPLVAPSPS
         +P  AP+PS
Subjt:  TNPLVAPSPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGGCGGAGATTCGAAACTCCGACCTCAAGGATAACAAGTGGCACAATGCTGTCGTATGCCTCCTCCCTTCAAAAGTTTTGCCTTAGAGGTGAGAAGCCTCACCA
TGCAAATAAGAAAAAGGAAGAAAGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTG
AAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTG
GAAATCTCGTCAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGA
AGTACTGAAATTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGA
TTCAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACT
GCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCT
CGGTCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTC
CTGCTCCTGCACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAG
GGTGCACCAGAATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGG
TAGGAAAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCAC
CAGCAGCACCCGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACG
ACAAATCCATTAGTTGCGCCATCTCCATCTCCATGTGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGGCGGAGATTCGAAACTCCGACCTCAAGGATAACAAGTGGCACAATGCTGTCGTATGCCTCCTCCCTTCAAAAGTTTTGCCTTAGAGGTGAGAAGCCTCACCA
TGCAAATAAGAAAAAGGAAGAAAGTCATGATATAGTAGCAACATTCAATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCAAACCGACATTTTTG
AAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCTCTAGAACCATTATCTGGATCCAACCGTACAAAAGTTGTGTTCAGCCTCGATCCAGATGGCGATGACTTG
GAAATCTCGTCAACTTATCTAAGTTTAATCAGGTCAACCATTGCAAGTCTAGTAACGAATCAGTTCCTCCGCATTACTAAATCCATGTTTGGGGAGGCTTTTTCGTTTGA
AGTACTGAAATTCCCCAGAGGAATAACTATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACGTTGAACTTCTCTATTCATCAGA
TTCAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAGCTGTGGAATGCAGAAGGTTCGACTGTGACT
GCCCCTACGATTGTCCAGTCGTCTGTACTTCTTGAAGTTGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAAACAATCTCAGGTTCTAATTCTAGCAATCT
CGGTCTGAATAATACGGAGTTTGGAAAGGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAATACTCCCTCAATGGGAGTGACGGGAACGGCCCCGCAAGGTCACCTTCTC
CTGCTCCTGCACCCCAGTCCCATAACTACCCTCATCCCCCGACTCACCACCATCACCACCATCACACCCCTCTAACCCCTGCAATTTCACCTGCCCCTGCAACCGAGAAG
GGTGCACCAGAATACAGTTCGCCTGCCCCTGAAAGAAGCGCAGCATCACCTAAGAGAAGTTACGCGGCAAAGCCGCCTGGTTGTCAGTATAGATACAAGAGGAAGTCTGG
TAGGAAAGAAGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCATTATATCTCCTGATCATTCTGCTGCATCGCCATCGCCATCGCCACAACATCAAGTAAACCCAC
CAGCAGCACCCGTCTCTCGAGCTCCGGCATTAACTCCATTGCCAAATGTCGTTTACGCTCATGTTCAACCACCTTCCAAAAGCGACTCCAGCCACCCGGAAAAATCCACG
ACAAATCCATTAGTTGCGCCATCTCCATCTCCATGTGAGTAACACACTGATTCCGGTAGGAAACTGGATAGTGTTTTAACTACAAAAGTCCCATATATTCCATTGAACAA
TCCCAAATTTTCCATTGTGAAAACTTTTTCAACTGACTCAACTGATTTTGCAGCTGGTGCTGATCGTCGTCGTATGATCACTCAATGGGGATTCACACTGTTTCTTATTC
TCGCACGCCATATGTAACATTCAAAAAGAAGACTACCGGTTTTCTGATGAACATGTATCGACGGCAACAAGAGATGCCAAGTTTTTTATAAAATGATAAAAGTGCAGGAG
TTTCTTTAAAAGTGTGAACAGAGTAGAGAAAAGCAAAGCCAAAGAGGCATTGCTTGTTGTAAGAAGGTTTCTAAGTGTGTAAATATCATCTGATTAAGAAACTTGTTGCA
GATGCAGTTTCAGGTCAAAGTCCACAGAGGTGGCAGGCCTTCAGAAACTTGCATATTTTCCCACTGTTTTGTGAATTATGATCTTCTTCTCCTTAAAATGTAAGGAGATG
GAGAAGGCAAAAAAGAAGAGAAAAAAAAGAAAAGAAAAGGAAAACAGAGCAAATGCATAACTTTTTTCTTTTACCATTGTGTTTTTTTTCTTCTGAAATCTATTAGGCTT
TTTGTACCCAAAATGGCCTTCTCTTGTATTGCTCAAATTTGAAGCTTGTATGTATATATAAACACATAACACAGCTTAAGCCCCCACTTGAATGCAACAACTGTAGTAGT
TTTTGGTTT
Protein sequenceShow/hide protein sequence
MRRRRFETPTSRITSGTMLSYASSLQKFCLRGEKPHHANKKKEESHDIVATFNVERPVSLLEDNIEQLQTDIFEEFPIPSIKVDILSLEPLSGSNRTKVVFSLDPDGDDL
EISSTYLSLIRSTIASLVTNQFLRITKSMFGEAFSFEVLKFPRGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTVT
APTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKYSLNGSDGNGPARSPSPAPAPQSHNYPHPPTHHHHHHHTPLTPAISPAPATEK
GAPEYSSPAPERSAASPKRSYAAKPPGCQYRYKRKSGRKEGKQSHLTPLASPIISPDHSAASPSPSPQHQVNPPAAPVSRAPALTPLPNVVYAHVQPPSKSDSSHPEKST
TNPLVAPSPSPCE