; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012785 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012785
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationscaffold1:18753964..18758946
RNA-Seq ExpressionSpg012785
SyntenySpg012785
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025811.1 Zinc finger family protein, putative isoform 1 [Cucumis melo var. makuwa]2.8e-19885.32Show/hide
Query:  MVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKST
        ++K  HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKST
Subjt:  MVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKST

Query:  FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSM
        FGEA+SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQ+SVLLEVGN PSM
Subjt:  FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSM

Query:  RRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPA
        RRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHN+H PPTHHHHHHHTPL  AISPAPATEKGAP+YGSPA
Subjt:  RRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPA

Query:  PE-SAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTT
        PE SAASP++S+ A+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ N P     
Subjt:  PE-SAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTT

Query:  NPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM
           A PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  NPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM

KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]3.0e-20087.01Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
        SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGST+T PTIVQ+SVLLEVGN PSMRRLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGS+GNGPVRSPSPAPTPQPHN H PPTHHHHHHHTPLTPAISPAPATEKGAP+YGSPAPE +A
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM
        ASPK+S+ AKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ NHP        A 
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM

Query:  PSPSPSPSGADHCHMITRWGFALFLILAFHM
        PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  PSPSPSPSGADHCHMITRWGFALFLILAFHM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]3.0e-20087.01Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
        SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGST+T PTIVQ+SVLLEVGN PSMRRLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGS+GNGPVRSPSPAPTPQPHN H PPTHHHHHHHTPLTPAISPAPATEKGAP+YGSPAPE +A
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM
        ASPK+S+ AKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ NHP        A 
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM

Query:  PSPSPSPSGADHCHMITRWGFALFLILAFHM
        PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  PSPSPSPSGADHCHMITRWGFALFLILAFHM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]1.4e-19786.08Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
        SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQ+SVLLEVGN PSMRRLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA
        LAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHN+H PPTHHHHHHHTPL  AISPAPATEKGAP+YGSPAPE SA
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM
        ASP++S+ A+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ N P        A 
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM

Query:  PSPSPSPSGADHCHMITRWGFALFLILAFHM
        PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  PSPSPSPSGADHCHMITRWGFALFLILAFHM

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]1.1e-19483.52Show/hide
Query:  WVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSI
        W+P  L       +G       HDIVATF+VER VSLLEDNIEQLRTDI+EEF IPSIKVDILSLESL GSNRTKVVF +DPD D+SEI ST LSLIRS 
Subjt:  WVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSI

Query:  FASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAP
          S+VTNQ FLRITKS FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGST+TAP
Subjt:  FASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAP

Query:  TIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAI
        TIVQSSVLLEVGN PSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGP RSPSPAP PQPHN   PPT HHHHHHT LTPAI
Subjt:  TIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAI

Query:  SPAPATEKGAPKYGSPAPE-SAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPS--PRHQVKPPATPVSPTPALTPLPNVIY
        SPAPATEKGAP+YGSPAPE S ASPK+S+ AKPPGCQY  KRKSGRKEGKQSHLTPLASPN+SP HSAASPS  P+H+V PPA P+ P PALTPLPNVIY
Subjt:  SPAPATEKGAPKYGSPAPE-SAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPS--PRHQVKPPATPVSPTPALTPLPNVIY

Query:  AHVQPPLKSNPNHPEKSTTNPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM
        AHVQPP KSN NHPEKSTTNPS   +PSPSPSGAD C MIT+WGF LFLILA HM
Subjt:  AHVQPPLKSNPNHPEKSTTNPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein1.5e-20087.01Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
        SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGST+T PTIVQ+SVLLEVGN PSMRRLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA
        LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGS+GNGPVRSPSPAPTPQPHN H PPTHHHHHHHTPLTPAISPAPATEKGAP+YGSPAPE +A
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM
        ASPK+S+ AKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ NHP        A 
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM

Query:  PSPSPSPSGADHCHMITRWGFALFLILAFHM
        PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  PSPSPSPSGADHCHMITRWGFALFLILAFHM

A0A1S3C173 uncharacterized protein LOC1034958526.8e-19886.08Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
        SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQ+SVLLEVGN PSMRRLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA
        LAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHN+H PPTHHHHHHHTPL  AISPAPATEKGAP+YGSPAPE SA
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAPE-SA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM
        ASP++S+ A+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ N P        A 
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAM

Query:  PSPSPSPSGADHCHMITRWGFALFLILAFHM
        PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  PSPSPSPSGADHCHMITRWGFALFLILAFHM

A0A5A7SNH7 Zinc finger family protein, putative isoform 11.4e-19885.32Show/hide
Query:  MVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKST
        ++K  HDIVATF+VER+VSLLEDN +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKST
Subjt:  MVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKST

Query:  FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSM
        FGEA+SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQ+SVLLEVGN PSM
Subjt:  FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSM

Query:  RRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPA
        RRLKQLAQTISGSNSSNLGLNN EFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHN+H PPTHHHHHHHTPL  AISPAPATEKGAP+YGSPA
Subjt:  RRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPA

Query:  PE-SAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTT
        PE SAASP++S+ A+PPGCQYRYKRKSGRKEGKQSHLTPLASPNISP HSAASPSP+HQ+ PPA PVSP PALTPLPNVIYAHVQPP KS+ N P     
Subjt:  PE-SAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTT

Query:  NPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM
           A PS +PSPSGAD CHMIT+WGF LFLILA HM
Subjt:  NPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X33.3e-19280.75Show/hide
Query:  WVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSI
        W+P  L       +G       HDIVATF VER VSLL+DNIE+LRTDI+EEF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EIPST LSLIRS 
Subjt:  WVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSI

Query:  FASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAP
         AS+VTNQSFLRITKS FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAP
Subjt:  FASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAP

Query:  TIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAI
        TIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILKHSLNG +G GP+RSPSPAPTPQPHN+H PP+HHHHHHH PLTP I
Subjt:  TIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAI

Query:  SPAPATEKGAPKYGSPAPESAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHV
        SPAPA E GAP+YG  AP+SAASPK+S+EAKPPGCQ  YKRKSGRKEGKQ HL+PLASP+ISPVHSAASPS +H        VSPT A TPLP+VIYAHV
Subjt:  SPAPATEKGAPKYGSPAPESAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHV

Query:  QPPLKSNPNHPEKSTTNPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM
        QPP KS+ NHPEKSTT+PS +PSPSPSPS A H  MITRWGF L LI+AF+M
Subjt:  QPPLKSNPNHPEKSTTNPSAMPSPSPSPSGADHCHMITRWGFALFLILAFHM

A0A6J1EH92 uncharacterized protein LOC111432513 isoform X11.8e-19080.04Show/hide
Query:  WVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSI
        W+P  L       +G       HDIVATF VER VSLL+DNIE+LRTDI+EEF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EIPST LSLIRS 
Subjt:  WVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSI

Query:  FASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAP
         AS+VTNQSFLRITKS FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAP
Subjt:  FASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAP

Query:  TIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAI
        TIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKVKQVRLSSILKHSLNG +G GP+RSPSPAPTPQPHN+H PP+HHHHHHH PLTP I
Subjt:  TIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAI

Query:  SPAPATEKGAPKYGSPAPESAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHV
        SPAPA E GAP+YG  AP+SAASPK+S+EAKPPGCQ  YKRKSGRKEGKQ HL+PLASP+ISPVHSAASPS +H        VSPT A TPLP+VIYAHV
Subjt:  SPAPATEKGAPKYGSPAPESAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHV

Query:  QPPLKSNPNHPEKSTTNPSAM----PSPSPSPSGADHCHMITRWGFALFLILAFHM
        QPP KS+ NHPEKSTT+PS +    PSPSPSPS A H  MITRWGF L LI+AF+M
Subjt:  QPPLKSNPNHPEKSTTNPSAM----PSPSPSPSGADHCHMITRWGFALFLILAFHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)5.9e-2935.52Show/hide
Query:  VKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEF-LIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKST
        VK +  + A+F +++ VS +  +  ++  DI     L  + KV +LSL     SN T V F V P   D EI   SLSL+RS F  +   +S L++T S 
Subjt:  VKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEF-LIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKST

Query:  FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSM
        FG+  SF+VLKFPGGIT+ P + A +     +LF+ T+  SI  +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      
Subjt:  FGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSM

Query:  RRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTP
        +RL    Q I  S + NLGL+   FG+VK +  S+ L   +  S+        +PAPTP
Subjt:  RRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.9e-8346.83Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        H IVA+F + R+ S L +N  QL+ DI++E    SIKV IL++E     N TKVVFG+DPD    EI   SLS I+ +F S++ NQS L++TKS FGE F
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
         FEVLKFPGGIT+IPPQSAF LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGST++ PT V SSVLL VG   S  RLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQP----------HNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPK
        L  TI+GS S NLGLNNT FGKVKQVRLSS L +S + S      +SPSP+P+P            H++H    +HHHHHH  L+P ++P          
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQP----------HNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPK

Query:  YGSPAPESAASPKKSHEAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPNI-SPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNP
          SPAP    S K++  A P   PG +  +K K       Q   TP  +P+  +P H   SP+P    K    P+S      PLP+V++AH   P  + P
Subjt:  YGSPAPESAASPKKSHEAKP---PGCQYRYKRKSGRKEGKQSHLTPLASPNI-SPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNP

Query:  NHPEKSTTNPSAMPSPSPSPSGADHCHMITRWGFALFLILAF
          P     N  A P P  S S  +    +  W   L LI+A+
Subjt:  NHPEKSTTNPSAMPSPSPSPSGADHCHMITRWGFALFLILAF

AT3G56590.1 hydroxyproline-rich glycoprotein family protein2.0e-8549.02Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        H IVA+FDV + +S +EDN+ QL  DI +E   P  KV +L+LE L   NRT V+F +DP+ ++S+IP+   SLI++ F ++V  Q   R+T+S FGE F
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
         FEVLKFPGGIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GST+  PTIV SSVLL  G   S  RLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATEKGAPKYGSPAPESA
        LAQTI+ S+S NLGLN+T FGKVKQVRLSSIL HS        P  S +P+P+PQP  +  P  H HHHHHH  L P  S +P T+  AP   +P   S 
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATEKGAPKYGSPAPESA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVH-SAASPS-PRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPS
          P+       P C Y  +R  G          P  +P+ S  H  A +P+ PRH     A PVS     +PLP+V++AH+ PP KS+P        +PS
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVH-SAASPS-PRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPS

Query:  AMPSPSPSPS
          P+P  S S
Subjt:  AMPSPSPSPS

AT3G56590.2 hydroxyproline-rich glycoprotein family protein6.9e-8649.27Show/hide
Query:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF
        H IVA+FDV + +S +EDN+ QL  DI +E   P  KV +L+LE L   NRT V+F +DP+ ++S+IP+   SLI++ F ++V  Q   R+T+S FGE F
Subjt:  HDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAF

Query:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ
         FEVLKFPGGIT+IPPQ  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GST+  PTIV SSVLL  G   S  RLKQ
Subjt:  SFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNPPSMRRLKQ

Query:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATEKGAPKYGSPAPESA
        LAQTI+ S+S NLGLN+T FGKVKQVRLSSIL HS        P  S +P+P+PQP  +  P  H HHHHHH  L P  S +P T+  AP   +P   S 
Subjt:  LAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATEKGAPKYGSPAPESA

Query:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVH-SAASPS-PRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPS
          P+       P C Y  +R  G          P  +P+ S  H  A +P+ PRH     A PVS     +PLP+V++AH+ PP KS+P    +S     
Subjt:  ASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVH-SAASPS-PRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPS

Query:  AMPSPSPSPSGA
          PSP+P+PS A
Subjt:  AMPSPSPSPSGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACCGCCGTCCGCCGTCGGCTCGCGGCCGTCCGGCCAGGCTGCCGATGGC
CGATGCTGTTGTGGGTGTCTTTCGATTCGAAGGCTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCC
TTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTTAATCCCTCGTATCGAGGTGGGATTCTCCATCGATTCTGGTATCTTTTGGGGTTTGGGGGGTTCGTCGATTGGT
GTTTCTTTTGTGGGTTTTCTGTGGCGTTCTTCTCGGTGGGTTCCTAGAGGCTTGCCTGTAGAATCGTTTGGTTTGATTGGAGAGAAGATGGTGAAGGGAAGTCATGATAT
AGTAGCAACATTCGATGTTGAGAGAGCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCGGACCGACATTTATGAAGAGTTCCTTATACCTTCTATCAAAGTGGATA
TACTATCTCTAGAATCGTTATCAGGATCCAACCGAACAAAAGTTGTGTTCGGTGTTGATCCAGATGCTGATGATTCAGAAATCCCATCAACTTCTCTAAGTTTAATAAGG
TCGATCTTTGCAAGTATAGTAACGAATCAGTCATTCCTCCGCATTACTAAATCCACGTTTGGGGAGGCCTTTTCGTTTGAAGTACTGAAATTCCCCGGAGGAATAACGAT
AATCCCACCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTAAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTGACCAGCC
AACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAACTATGGAATGCGGAAGGTTCGACCATGACTGCCCCTACGATTGTCCAGTCATCTGTTCTT
CTGGAAGTCGGAAATCCTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAGACAATCTCAGGTTCTAATTCTAGCAACCTTGGCCTGAATAATACTGAGTTTGGAAAAGT
GAAGCAAGTTCGCCTTTCGTCGATTCTTAAACACTCCCTCAATGGCAGTGAAGGGAATGGCCCTGTTAGGTCTCCTTCTCCTGCCCCGACACCGCAGCCCCATAACTACC
ATCGCCCCCCAACTCACCACCATCACCACCATCACACCCCTCTAACACCTGCAATTTCACCTGCCCCTGCTACCGAGAAGGGCGCACCGAAATATGGTTCGCCTGCCCCC
GAAAGTGCGGCATCACCTAAGAAAAGTCATGAAGCAAAGCCACCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAGGGAAAGCAATCTCATTTAACCCC
GCTTGCTTCACCCAATATATCTCCTGTTCATTCTGCTGCATCACCATCGCCACGACATCAAGTTAAACCACCAGCAACACCCGTCTCTCCAACTCCGGCATTAACTCCGT
TGCCAAATGTCATTTATGCTCATGTTCAACCACCTTTGAAAAGCAACCCCAATCACCCCGAAAAATCCACGACGAATCCATCAGCCATGCCGTCGCCATCTCCATCTCCA
TCTGGTGCAGATCATTGCCATATGATTACTCGATGGGGATTCGCACTGTTTCTAATTCTCGCATTCCACATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTTGACCCACTTCGCCGGCATTGTTCCCGATGGGGAAAAACGACGGAGAACAGCCACCGCCGTCCGCCGTCGGCTCGCGGCCGTCCGGCCAGGCTGCCGATGGC
CGATGCTGTTGTGGGTGTCTTTCGATTCGAAGGCTCATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCC
TTTTCTCCATTATGCAGATCAAAAGGATCTGGGTCTTAATCCCTCGTATCGAGGTGGGATTCTCCATCGATTCTGGTATCTTTTGGGGTTTGGGGGGTTCGTCGATTGGT
GTTTCTTTTGTGGGTTTTCTGTGGCGTTCTTCTCGGTGGGTTCCTAGAGGCTTGCCTGTAGAATCGTTTGGTTTGATTGGAGAGAAGATGGTGAAGGGAAGTCATGATAT
AGTAGCAACATTCGATGTTGAGAGAGCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCGGACCGACATTTATGAAGAGTTCCTTATACCTTCTATCAAAGTGGATA
TACTATCTCTAGAATCGTTATCAGGATCCAACCGAACAAAAGTTGTGTTCGGTGTTGATCCAGATGCTGATGATTCAGAAATCCCATCAACTTCTCTAAGTTTAATAAGG
TCGATCTTTGCAAGTATAGTAACGAATCAGTCATTCCTCCGCATTACTAAATCCACGTTTGGGGAGGCCTTTTCGTTTGAAGTACTGAAATTCCCCGGAGGAATAACGAT
AATCCCACCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTAAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTGACCAGCC
AACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAACTATGGAATGCGGAAGGTTCGACCATGACTGCCCCTACGATTGTCCAGTCATCTGTTCTT
CTGGAAGTCGGAAATCCTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAGACAATCTCAGGTTCTAATTCTAGCAACCTTGGCCTGAATAATACTGAGTTTGGAAAAGT
GAAGCAAGTTCGCCTTTCGTCGATTCTTAAACACTCCCTCAATGGCAGTGAAGGGAATGGCCCTGTTAGGTCTCCTTCTCCTGCCCCGACACCGCAGCCCCATAACTACC
ATCGCCCCCCAACTCACCACCATCACCACCATCACACCCCTCTAACACCTGCAATTTCACCTGCCCCTGCTACCGAGAAGGGCGCACCGAAATATGGTTCGCCTGCCCCC
GAAAGTGCGGCATCACCTAAGAAAAGTCATGAAGCAAAGCCACCTGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAGGGAAAGCAATCTCATTTAACCCC
GCTTGCTTCACCCAATATATCTCCTGTTCATTCTGCTGCATCACCATCGCCACGACATCAAGTTAAACCACCAGCAACACCCGTCTCTCCAACTCCGGCATTAACTCCGT
TGCCAAATGTCATTTATGCTCATGTTCAACCACCTTTGAAAAGCAACCCCAATCACCCCGAAAAATCCACGACGAATCCATCAGCCATGCCGTCGCCATCTCCATCTCCA
TCTGGTGCAGATCATTGCCATATGATTACTCGATGGGGATTCGCACTGTTTCTAATTCTCGCATTCCACATGTAA
Protein sequenceShow/hide protein sequence
MALDPLRRHCSRWGKTTENSHRRPPSARGRPARLPMADAVVGVFRFEGSLASDASSFCYCPLPCSFLLFFGCPLFSIMQIKRIWVLIPRIEVGFSIDSGIFWGLGGSSIG
VSFVGFLWRSSRWVPRGLPVESFGLIGEKMVKGSHDIVATFDVERAVSLLEDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIR
SIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVL
LEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSEGNGPVRSPSPAPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATEKGAPKYGSPAP
ESAASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPRHQVKPPATPVSPTPALTPLPNVIYAHVQPPLKSNPNHPEKSTTNPSAMPSPSPSP
SGADHCHMITRWGFALFLILAFHM