; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003755 (gene) of Chayote v1 genome

Gene IDSed0003755
OrganismSechium edule (Chayote v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG06:11551766..11554164
RNA-Seq ExpressionSed0003755
SyntenySed0003755
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]2.2e-3632.75Show/hide
Query:  EDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFK
        EDED      A   EI ++ +++ D D+      K+L+ K  + EAF+ L+ +IW+   +VE+E +G+N F  HF + + +  +   GPW F   L+V +
Subjt:  EDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFK

Query:  DSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEF
           G     +L FN   FW+ +  +  MC+  R  K +   IGE  ++   E+  C G+ +R+KV++D++KPLKR + IKLG   +    ++ YERLP+F
Subjt:  DSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEF

Query:  CLGCGRVGHGTKECREIEIRGAG----DLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQES
        C  CGR+GH  +EC + + + A        +G W+RA   +    KS   + T S+++R +  EG       EL+GD   S  S
Subjt:  CLGCGRVGHGTKECREIEIRGAG----DLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQES

TXG48431.1 hypothetical protein EZV62_027725 [Acer yangbiense]3.1e-3534.05Show/hide
Query:  KNSAYC---KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKL
        K  A C   K+LS +S + EAFR L+ +IW +   VEIE +  N++   F S D++  +  GGPWSFD  L+V ++  GK     L FN V FW+H+  +
Subjt:  KNSAYC---KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKL

Query:  SSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECR---EIEIRGA
          + +T    K +G++IGE R+VD+  ++ C G+ +R++V ++V KPL+R + + +    ++    I YERLP FC  CG VGH   EC    E  +R  
Subjt:  SSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECR---EIEIRGA

Query:  GDLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQESWPLRARFGRGRGRSGFDSRGG
          +VYG WLRA +  V   +       +    R +G     P          E +Q SW          G SG  + GG
Subjt:  GDLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQESWPLRARFGRGRGRSGFDSRGG

TXG63243.1 hypothetical protein EZV62_010237 [Acer yangbiense]6.3e-3637.44Show/hide
Query:  KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRW
        K+LS K  + EAF +++ KIW +   VEIESL  N+F  HF  L++   +  G PW+FDN +LV ++  GK   + + FN   FW+ + +L  +C T   
Subjt:  KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRW

Query:  AKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECREIEIRGAGD----LVYGGW
           +G MIGE ++VDI     C G+ +RI+V+ID+ KPL+R +++ +     +    + YERLP  C  CGR+GH T EC E+     G+     ++G W
Subjt:  AKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECREIEIRGAGD----LVYGGW

Query:  LRA
        ++A
Subjt:  LRA

TXG69259.1 hypothetical protein EZV62_004194 [Acer yangbiense]3.1e-3533.11Show/hide
Query:  EDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFK
        +DED E        +I ++  EE   D+ +    K+LS K  + EAF  ++  +W+   KVEIES+G+N+F  HF + +++  + + GPW FD  L+V +
Subjt:  EDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFK

Query:  DSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEF
           G     +L FN   FW+ +  +  MC+  R AK +   IGE  ++   E+  C G+ LR+KV+ID+S+PLKR + + L    +     + YER+PEF
Subjt:  DSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEF

Query:  CLGCGRVGHGTKECREIEIR----GAGDLVYGGWLRAESSNVGANK-STKTSRTESTTDR------EKGYEG-MGPKFQPELKGDKEGSQESWPLR
        C  CGRVGHG  EC +IE +       +  +G W+RA +++   +K S++ S   S  D+      E G  G +  +  P     KE     W  R
Subjt:  CLGCGRVGHGTKECREIEIR----GAGDLVYGGWLRAESSNVGANK-STKTSRTESTTDR------EKGYEG-MGPKFQPELKGDKEGSQESWPLR

VVA32948.1 PREDICTED: DUF4283 domain-containing [Prunus dulcis]3.1e-3534.08Show/hide
Query:  VEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDF
        +++  ND+++    LK S   K+L++ + + EAF++ M KIW    +V ++ +G+NLF   FA+  ++L +   GPW+FD  L++ +   G   P ++  
Subjt:  VEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDF

Query:  NLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKE
            FWI V  +   C+++   + IGN  G   DV    + +C GR LR++V +DVSKPL+RG  + L S  +  +    YERLPEFC  CGR+GH  KE
Subjt:  NLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKE

Query:  CREIE--IRGAGDLVYGGWLRA----ESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDK
        C  ++   + A +  YG WL+A     S   G  K T  S +E+  D  K  E      Q    G+K
Subjt:  CREIE--IRGAGDLVYGGWLRA----ESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDK

TrEMBL top hitse value%identityAlignment
A0A5C7GUN1 Uncharacterized protein1.5e-3534.05Show/hide
Query:  KNSAYC---KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKL
        K  A C   K+LS +S + EAFR L+ +IW +   VEIE +  N++   F S D++  +  GGPWSFD  L+V ++  GK     L FN V FW+H+  +
Subjt:  KNSAYC---KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKL

Query:  SSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECR---EIEIRGA
          + +T    K +G++IGE R+VD+  ++ C G+ +R++V ++V KPL+R + + +    ++    I YERLP FC  CG VGH   EC    E  +R  
Subjt:  SSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECR---EIEIRGA

Query:  GDLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQESWPLRARFGRGRGRSGFDSRGG
          +VYG WLRA +  V   +       +    R +G     P          E +Q SW          G SG  + GG
Subjt:  GDLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQESWPLRARFGRGRGRSGFDSRGG

A0A5C7GZQ4 CCHC-type domain-containing protein2.0e-3537.13Show/hide
Query:  KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRW
        KLLS K  + EAF  ++P+IW    + EIE L  N+F   F    ++ S+  GGPWSFD  LLV ++  GK    E+ F+ V FWI + K+  +C+TS  
Subjt:  KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRW

Query:  AKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECREIEIRGAGD---LVYGGWL
         + +G+MIGE +++D   +  C G+ +R++V +DV+KPL+R + + +     +    + YERLP+ C  CGR+GH  ++C  +      +   L++G WL
Subjt:  AKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECREIEIRGAGD---LVYGGWL

Query:  RA
        RA
Subjt:  RA

A0A5C7I124 CCHC-type domain-containing protein3.1e-3637.44Show/hide
Query:  KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRW
        K+LS K  + EAF +++ KIW +   VEIESL  N+F  HF  L++   +  G PW+FDN +LV ++  GK   + + FN   FW+ + +L  +C T   
Subjt:  KLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRW

Query:  AKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECREIEIRGAGD----LVYGGW
           +G MIGE ++VDI     C G+ +RI+V+ID+ KPL+R +++ +     +    + YERLP  C  CGR+GH T EC E+     G+     ++G W
Subjt:  AKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKECREIEIRGAGD----LVYGGW

Query:  LRA
        ++A
Subjt:  LRA

A0A5C7IJL3 CCHC-type domain-containing protein1.5e-3533.11Show/hide
Query:  EDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFK
        +DED E        +I ++  EE   D+ +    K+LS K  + EAF  ++  +W+   KVEIES+G+N+F  HF + +++  + + GPW FD  L+V +
Subjt:  EDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFK

Query:  DSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEF
           G     +L FN   FW+ +  +  MC+  R AK +   IGE  ++   E+  C G+ LR+KV+ID+S+PLKR + + L    +     + YER+PEF
Subjt:  DSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEF

Query:  CLGCGRVGHGTKECREIEIR----GAGDLVYGGWLRAESSNVGANK-STKTSRTESTTDR------EKGYEG-MGPKFQPELKGDKEGSQESWPLR
        C  CGRVGHG  EC +IE +       +  +G W+RA +++   +K S++ S   S  D+      E G  G +  +  P     KE     W  R
Subjt:  CLGCGRVGHGTKECREIEIR----GAGDLVYGGWLRAESSNVGANK-STKTSRTESTTDR------EKGYEG-MGPKFQPELKGDKEGSQESWPLR

A0A5E4G034 PREDICTED: DUF4283 domain-containing1.5e-3534.08Show/hide
Query:  VEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDF
        +++  ND+++    LK S   K+L++ + + EAF++ M KIW    +V ++ +G+NLF   FA+  ++L +   GPW+FD  L++ +   G   P ++  
Subjt:  VEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDF

Query:  NLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKE
            FWI V  +   C+++   + IGN  G   DV    + +C GR LR++V +DVSKPL+RG  + L S  +  +    YERLPEFC  CGR+GH  KE
Subjt:  NLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKE

Query:  CREIE--IRGAGDLVYGGWLRA----ESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDK
        C  ++   + A +  YG WL+A     S   G  K T  S +E+  D  K  E      Q    G+K
Subjt:  CREIE--IRGAGDLVYGGWLRA----ESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding5.6e-1426.34Show/hide
Query:  VEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDF
        + IG+  +E  +G  K     K+L S+   +   RKL  ++W     + +  L +  F   F   +E ++   GGPW      L+ +D   +  P   D 
Subjt:  VEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDF

Query:  NLVCFWIHVKKLS----SMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGH
             W+ +  +       CL    A+ +G  +     VD++  N  +GR  R+ ++++++KPLK  V I       DR+F +AYE L + C  CG  GH
Subjt:  NLVCFWIHVKKLS----SMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGH

Query:  GTKEC
            C
Subjt:  GTKEC

AT3G42140.1 zinc ion binding;nucleic acid binding3.0e-0723.57Show/hide
Query:  FASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLK
        F S +   SI   GPWSF++ + V +      L  + +F  + FWI ++ +    LT+R   +IG  +G + + ++       GR + +           
Subjt:  FASLDEKLSITEGGPWSFDNGLLVFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLK

Query:  RGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKEC
                           YE+L  FC  CG + H   EC
Subjt:  RGVYIKLGSMADDRWFSIAYERLPEFCLGCGRVGHGTKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATATGCATTACATCTGTAAGATTGAGAATGCTAGAGAGGATGAGGATAATGAGGCGAAGAAACGGGCTTGTCGGGTGGAGATAGGCGATAATGACGTTGAAGAAGC
GGATGGGGATCTAAAGAACTCAGCTTACTGTAAGTTACTTTCTTCAAAATCCTCCAGCGCTGAGGCTTTTAGAAAGCTAATGCCTAAAATTTGGGATCTGGGGTATAAAG
TGGAGATAGAATCTCTAGGGAAGAATTTATTTCAATGTCACTTTGCTAGTCTAGATGAGAAGTTGTCAATCACTGAGGGGGGACCTTGGAGCTTCGATAATGGTCTGTTA
GTGTTCAAGGATTCTTTCGGGAAGGCTCTCCCAGAAGAGTTGGATTTCAATTTAGTTTGTTTCTGGATTCATGTCAAAAAATTGTCGTCAATGTGTCTAACTAGCAGATG
GGCAAAAGCTATAGGGAATATGATAGGGGAGTATAGGGATGTGGACATTGATGAAAATAATCGATGTAGAGGACGAACTTTAAGGATTAAAGTGAAAATCGATGTGTCGA
AACCGTTAAAAAGAGGAGTGTATATCAAATTGGGATCAATGGCAGACGATCGGTGGTTCAGTATAGCTTATGAGAGATTGCCAGAATTTTGTTTGGGATGTGGAAGAGTT
GGCCATGGAACAAAAGAATGTAGAGAAATTGAAATCAGAGGAGCAGGAGACTTGGTGTATGGGGGTTGGTTGAGAGCAGAATCGAGTAATGTGGGGGCAAACAAATCAAC
CAAAACTAGTAGGACGGAAAGTACAACAGATAGAGAAAAAGGCTATGAAGGGATGGGTCCAAAATTTCAGCCAGAACTGAAAGGAGATAAAGAAGGGAGCCAAGAATCAT
GGCCTTTAAGGGCTAGATTTGGAAGGGGAAGAGGGCGATCAGGGTTCGATTCGAGAGGTGGATGGAGAATGAGACCAGAGGTTGAAGAAGAAGATTGGAGGAAAAAAAAC
CCGAAGTCGCAAGAGAGAAAACGGAAGGAAGAAATCCGGAGAGTGAAAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATATGCATTACATCTGTAAGATTGAGAATGCTAGAGAGGATGAGGATAATGAGGCGAAGAAACGGGCTTGTCGGGTGGAGATAGGCGATAATGACGTTGAAGAAGC
GGATGGGGATCTAAAGAACTCAGCTTACTGTAAGTTACTTTCTTCAAAATCCTCCAGCGCTGAGGCTTTTAGAAAGCTAATGCCTAAAATTTGGGATCTGGGGTATAAAG
TGGAGATAGAATCTCTAGGGAAGAATTTATTTCAATGTCACTTTGCTAGTCTAGATGAGAAGTTGTCAATCACTGAGGGGGGACCTTGGAGCTTCGATAATGGTCTGTTA
GTGTTCAAGGATTCTTTCGGGAAGGCTCTCCCAGAAGAGTTGGATTTCAATTTAGTTTGTTTCTGGATTCATGTCAAAAAATTGTCGTCAATGTGTCTAACTAGCAGATG
GGCAAAAGCTATAGGGAATATGATAGGGGAGTATAGGGATGTGGACATTGATGAAAATAATCGATGTAGAGGACGAACTTTAAGGATTAAAGTGAAAATCGATGTGTCGA
AACCGTTAAAAAGAGGAGTGTATATCAAATTGGGATCAATGGCAGACGATCGGTGGTTCAGTATAGCTTATGAGAGATTGCCAGAATTTTGTTTGGGATGTGGAAGAGTT
GGCCATGGAACAAAAGAATGTAGAGAAATTGAAATCAGAGGAGCAGGAGACTTGGTGTATGGGGGTTGGTTGAGAGCAGAATCGAGTAATGTGGGGGCAAACAAATCAAC
CAAAACTAGTAGGACGGAAAGTACAACAGATAGAGAAAAAGGCTATGAAGGGATGGGTCCAAAATTTCAGCCAGAACTGAAAGGAGATAAAGAAGGGAGCCAAGAATCAT
GGCCTTTAAGGGCTAGATTTGGAAGGGGAAGAGGGCGATCAGGGTTCGATTCGAGAGGTGGATGGAGAATGAGACCAGAGGTTGAAGAAGAAGATTGGAGGAAAAAAAAC
CCGAAGTCGCAAGAGAGAAAACGGAAGGAAGAAATCCGGAGAGTGAAAAGCTGA
Protein sequenceShow/hide protein sequence
MHMHYICKIENAREDEDNEAKKRACRVEIGDNDVEEADGDLKNSAYCKLLSSKSSSAEAFRKLMPKIWDLGYKVEIESLGKNLFQCHFASLDEKLSITEGGPWSFDNGLL
VFKDSFGKALPEELDFNLVCFWIHVKKLSSMCLTSRWAKAIGNMIGEYRDVDIDENNRCRGRTLRIKVKIDVSKPLKRGVYIKLGSMADDRWFSIAYERLPEFCLGCGRV
GHGTKECREIEIRGAGDLVYGGWLRAESSNVGANKSTKTSRTESTTDREKGYEGMGPKFQPELKGDKEGSQESWPLRARFGRGRGRSGFDSRGGWRMRPEVEEEDWRKKN
PKSQERKRKEEIRRVKS