; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004632 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004632
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr6:5599136..5601011
RNA-Seq ExpressionLag0004632
SyntenyLag0004632
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]2.3e-4034.68Show/hide
Query:  ETKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNF
        E ++ +  E L +  +E   +  I +D ++  D+D+   +  K+LT K +N E FK L+ +IWN+ G++ ++ +G N F+  F N   + ++   GPW F
Subjt:  ETKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNF

Query:  DNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKI
           L++ E+PKG   I   +F  A FW++IH++P++C+ ++    L   +G V EI  +  +  W   +R++V++D+T+PL R L +++G   E   + +
Subjt:  DNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKI

Query:  TYERLPDFCYKCGRIGHTLKEC--EEIINLEEEG---LFGDWMRAAPI
         YERLPDFC+ CGRIGH+++EC  E+      +G    FG WMRA PI
Subjt:  TYERLPDFCYKCGRIGHTLKEC--EEIINLEEEG---LFGDWMRAAPI

TXG72599.1 hypothetical protein EZV62_001178 [Acer yangbiense]4.3e-3937.33Show/hide
Query:  KQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLL
        ++L K    A+E   ++ + ++ +    +D+   +  K+LT K IN E FK L+ +IW+  G++ ++ +G N+F+  F N  +++R+ + GPW+F N L+
Subjt:  KQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLL

Query:  LFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERL
          E+  G   + N EF  A FWI+IH++P++C+ ++ A  L   +G V EI L E    W N LR++V+ID+++PL R L +++G+  E + + + YERL
Subjt:  LFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERL

Query:  PDFCYKCGRIGHTLKEC
        P+FCY CGRIGH +KEC
Subjt:  PDFCYKCGRIGHTLKEC

XP_015380691.1 uncharacterized protein LOC107174364 [Citrus sinensis]4.3e-3939.71Show/hide
Query:  KILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKW
        K+L  + +N E FKS + ++W     ++++++G N F+  F    +K+R++ GGPW+FD  LL+  EPKG   +    F + +FWI+I N+P+ C+ K+ 
Subjt:  KILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKW

Query:  ADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGE-EVWMKITYERLPDFCYKCGRIGHTLKECEEIINLEEEGL-FGDWMRA
           LG ++GAVEEI+ DE         RI+V I++T PL + L ++   +GE ++ M + YERLPDFCY CG IGH  KEC +    ++E L +G WM+A
Subjt:  ADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGE-EVWMKITYERLPDFCYKCGRIGHTLKECEEIINLEEEGL-FGDWMRA

Query:  APIG
          IG
Subjt:  APIG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]5.7e-3930.52Show/hide
Query:  TANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGG
        + N++ + ++I+  +     D+++  +  K+ T K I+ E  +S+M  +W      R + +G NI++  F++  EK R++  GPW F+  LL+   P   
Subjt:  TANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGG

Query:  SFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCG
        +   +  F + +FWI+IHN+P  C+  + A+ LG+ +G VEEI+ D  D      +R++VKIDV++PL RG+ ++  S G+++W  + YE+LPDFCY+CG
Subjt:  SFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCG

Query:  RIGHTLKECEE---IINLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRRGGNSRFHHQQQRDWFDRRKNDPKKEGKNQTHQPTLAMEV
        +IGH+ +ECE+   ++       +GDW+RA  +         +       E  W  GR  RG         + DW  R +N    +G   +H+  +   V
Subjt:  RIGHTLKECEE---IINLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRRGGNSRFHHQQQRDWFDRRKNDPKKEGKNQTHQPTLAMEV

Query:  DEPTSIET
        D   + E+
Subjt:  DEPTSIET

XP_042942846.1 uncharacterized protein LOC122277028 [Carya illinoinensis]3.3e-3934.85Show/hide
Query:  METKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWN
        ME +++   EKL +TA E   ++ +E + LE      + ++  ++LT++  N E FK  M R+W     I+ + + + + L  F   R+K R++  GPW 
Subjt:  METKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWN

Query:  FDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRL-GSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWM
        F+  L+L +E +G           A FW+RIH+LPMI  C ++  +L G  +G V E+D+D +D  W   +R+++ +D+T+PL R   + +G    E W+
Subjt:  FDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRL-GSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWM

Query:  KITYERLPDFCYKCGRIGHTLKECE----EIINLEEEGL-FGDWMRAAPIGMGTPNQGNQGNQE
        + TYERLPDFCY+CGR+GH+ KEC+       + +EE L +G W+RAA    G  N    G ++
Subjt:  KITYERLPDFCYKCGRIGHTLKECE----EIINLEEEGL-FGDWMRAAPIGMGTPNQGNQGNQE

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein1.1e-4034.68Show/hide
Query:  ETKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNF
        E ++ +  E L +  +E   +  I +D ++  D+D+   +  K+LT K +N E FK L+ +IWN+ G++ ++ +G N F+  F N   + ++   GPW F
Subjt:  ETKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNF

Query:  DNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKI
           L++ E+PKG   I   +F  A FW++IH++P++C+ ++    L   +G V EI  +  +  W   +R++V++D+T+PL R L +++G   E   + +
Subjt:  DNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKI

Query:  TYERLPDFCYKCGRIGHTLKEC--EEIINLEEEG---LFGDWMRAAPI
         YERLPDFC+ CGRIGH+++EC  E+      +G    FG WMRA PI
Subjt:  TYERLPDFCYKCGRIGHTLKEC--EEIINLEEEG---LFGDWMRAAPI

A0A5C7HA98 CCHC-type domain-containing protein8.0e-3936.18Show/hide
Query:  KILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKW
        KIL+  L+N + F+SL+ RIW   G + ++ +  N++   F+ + ++ ++   GPWNFD+ L++ EEP G   I   +F    FW++I NLPM+C+ K+ 
Subjt:  KILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKW

Query:  ADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLK-----ECEEIINLEEEGLFGDW
        A+ LG ++G V E+D           +R++V +D+T+PL R L V V    EE  M I YERLP FC++CG +GHT+      +C   IN  ++ L+G W
Subjt:  ADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLK-----ECEEIINLEEEGLFGDW

Query:  MRAA----PIGMGTPNQGNQGNQEQRREYSWGRGRGRRGGNSRFHH
        MRA     P+G    N        +R  Y    GRGR    + +HH
Subjt:  MRAA----PIGMGTPNQGNQGNQEQRREYSWGRGRGRRGGNSRFHH

A0A5C7IBW4 CCHC-type domain-containing protein3.6e-3935.25Show/hide
Query:  KKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIP
        +K    +++D L +    L  ++  K+L+ K +N + F+ +M +IW     + ++ +  NIF   F+N  +K +I+ GGPW+F++ L++ EEP+G   I 
Subjt:  KKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIP

Query:  NTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGH
        + +F    FWI+IHN+PM+C+ K     LG++VG V E+D  +        LR++V +++ +PL R L + V   G E  M + YERLPDFC++CG IGH
Subjt:  NTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGH

Query:  TLKECEEII-----NLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRRGG
        ++K+C E        ++E+ +FG WMR  P     PN         R+  SWGR    RGG
Subjt:  TLKECEEII-----NLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRRGG

A0A5C7IU01 CCHC-type domain-containing protein2.1e-3937.33Show/hide
Query:  KQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLL
        ++L K    A+E   ++ + ++ +    +D+   +  K+LT K IN E FK L+ +IW+  G++ ++ +G N+F+  F N  +++R+ + GPW+F N L+
Subjt:  KQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLL

Query:  LFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERL
          E+  G   + N EF  A FWI+IH++P++C+ ++ A  L   +G V EI L E    W N LR++V+ID+++PL R L +++G+  E + + + YERL
Subjt:  LFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERL

Query:  PDFCYKCGRIGHTLKEC
        P+FCY CGRIGH +KEC
Subjt:  PDFCYKCGRIGHTLKEC

A0A6J1D765 uncharacterized protein LOC1110179022.7e-3930.52Show/hide
Query:  TANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGG
        + N++ + ++I+  +     D+++  +  K+ T K I+ E  +S+M  +W      R + +G NI++  F++  EK R++  GPW F+  LL+   P   
Subjt:  TANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLINPEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGG

Query:  SFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCG
        +   +  F + +FWI+IHN+P  C+  + A+ LG+ +G VEEI+ D  D      +R++VKIDV++PL RG+ ++  S G+++W  + YE+LPDFCY+CG
Subjt:  SFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCG

Query:  RIGHTLKECEE---IINLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRRGGNSRFHHQQQRDWFDRRKNDPKKEGKNQTHQPTLAMEV
        +IGH+ +ECE+   ++       +GDW+RA  +         +       E  W  GR  RG         + DW  R +N    +G   +H+  +   V
Subjt:  RIGHTLKECEE---IINLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRRGGNSRFHHQQQRDWFDRRKNDPKKEGKNQTHQPTLAMEV

Query:  DEPTSIET
        D   + E+
Subjt:  DEPTSIET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding3.0e-0622.86Show/hide
Query:  FRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLT
        F++      I++ GPW+F++ + + +  +      + EFK   FWI+I  +P+  L  +    +G  +G   E +L  +                     
Subjt:  FRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEEDRRWENSLRIQVKIDVTEPLT

Query:  RGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLKEC
            V V        +K  YE+L +FC  CG + H   EC
Subjt:  RGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLKEC

AT5G36228.1 nucleic acid binding;zinc ion binding2.2e-1222.49Show/hide
Query:  WNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEF-KYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEE
        W    ++  + +    F   FR+  +    ++  PW F+   +  +  +     P  +F  +   W+ I  +P+  + ++  + + S +G V  +D +EE
Subjt:  WNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEF-KYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEE

Query:  DRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLKECEEIINLEE
               +R++V++D TEPL     V+  S+ E   +   YE+L   C  C R+ H +  C  +++ EE
Subjt:  DRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLKECEEIINLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCAAGAGGGATCCTTTGGTGATCAGCACGGGGGACAAATGGAGGAAGACCCACCACACCAGGTGGAGACGCCAACCATGGGAGACACCTGCAATCAGGAAGACAA
CAATCAGTCACAGCAAAAGGAAAAAGACAAAATCGACGACAGAACATATGACAAGGAGGAAAATATGGAAACGAAGGTGCAGAAACAATTAGAAAAACTCAAGATCACGG
CAAATGAGAAGAAAAAACTCATATCTATTGAAGATGATGAATTAGAAAAAGCAGATGATGACCTCCAAGATGCAATTTTCTGCAAAATACTCACACAAAAACTCATCAAC
CCAGAGGTCTTTAAGTCGCTGATGCTAAGAATATGGAATAAGGAAGGACGGATCAGGATGAAGACCATGGGAAGGAATATCTTCCTCTGCACTTTCAGAAATACGAGAGA
AAAGGAAAGAATAGTCAAAGGGGGTCCTTGGAACTTCGACAATGGATTACTCCTTTTTGAGGAACCAAAAGGGGGGAGTTTTATACCAAATACAGAATTCAAGTACGCAA
GTTTCTGGATACGAATACACAACTTACCTATGATTTGTTTATGCAAGAAATGGGCGGATCGACTGGGGAGCTTAGTTGGTGCCGTAGAAGAGATAGACCTGGATGAGGAA
GATAGAAGATGGGAAAACTCCCTGCGAATTCAAGTTAAGATTGATGTAACAGAACCACTCACCAGAGGCCTGCTGGTTCAAGTTGGGTCCAAAGGAGAGGAAGTGTGGAT
GAAAATCACTTACGAACGCCTCCCGGACTTCTGCTACAAATGTGGGAGAATTGGTCACACACTAAAAGAGTGTGAAGAGATAATCAACCTGGAAGAAGAGGGCCTTTTTG
GAGACTGGATGAGAGCAGCACCAATAGGGATGGGAACACCAAACCAAGGCAACCAAGGCAACCAAGAACAGAGAAGGGAATATAGTTGGGGCAGAGGCAGGGGAAGAAGA
GGGGGTAACTCAAGATTCCATCATCAACAACAAAGGGACTGGTTTGATCGCAGGAAAAATGACCCAAAGAAAGAAGGAAAAAACCAAACTCACCAGCCAACCCTAGCAAT
GGAGGTGGATGAACCAACAAGTATAGAAACAGGGCCAAAGCAGACTCAACGAGGAGAGGGAAAAACAAACATCTCACAAAGCTCACCCTCGCCGGAAAACGGGGAGGAAG
ATGATCGGAATAGTGAGGATGTCAGGCAACACCAGTGGACGAACAAAACGGGAACAGAAATCGAGGAAGCTAGGGAGGAGTGCCAGAACAACAAAGTGGTGTCAGAAGAA
GCAATAGCCGTTGGAATAGGAAACAGTGAAAATACCCAAAAGGAAGAAGTGTGTAAGAAGGAAAAAGGGAAGAATAGGAATATTCTAAGCGGTGACAGAGGGAAATTCGA
AACAAAAAACAAGCTATCGAGGAAATGGAAGAGGCTCGCGCGGGCTGACCCAAAAAGTCAAAATGGAGTCGAGAACAATGAAATTAACAAGAGGAAAATGGAAACTACCT
GTGAGGAGGAGTCAAGGGAGGAGACAAAGAGGATGCGAACAACGTACTTCGAAGTAGTCGAAGGGATATCGGCGGAGACTGGTCACCAGCCCCGCCGGACGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCAAGAGGGATCCTTTGGTGATCAGCACGGGGGACAAATGGAGGAAGACCCACCACACCAGGTGGAGACGCCAACCATGGGAGACACCTGCAATCAGGAAGACAA
CAATCAGTCACAGCAAAAGGAAAAAGACAAAATCGACGACAGAACATATGACAAGGAGGAAAATATGGAAACGAAGGTGCAGAAACAATTAGAAAAACTCAAGATCACGG
CAAATGAGAAGAAAAAACTCATATCTATTGAAGATGATGAATTAGAAAAAGCAGATGATGACCTCCAAGATGCAATTTTCTGCAAAATACTCACACAAAAACTCATCAAC
CCAGAGGTCTTTAAGTCGCTGATGCTAAGAATATGGAATAAGGAAGGACGGATCAGGATGAAGACCATGGGAAGGAATATCTTCCTCTGCACTTTCAGAAATACGAGAGA
AAAGGAAAGAATAGTCAAAGGGGGTCCTTGGAACTTCGACAATGGATTACTCCTTTTTGAGGAACCAAAAGGGGGGAGTTTTATACCAAATACAGAATTCAAGTACGCAA
GTTTCTGGATACGAATACACAACTTACCTATGATTTGTTTATGCAAGAAATGGGCGGATCGACTGGGGAGCTTAGTTGGTGCCGTAGAAGAGATAGACCTGGATGAGGAA
GATAGAAGATGGGAAAACTCCCTGCGAATTCAAGTTAAGATTGATGTAACAGAACCACTCACCAGAGGCCTGCTGGTTCAAGTTGGGTCCAAAGGAGAGGAAGTGTGGAT
GAAAATCACTTACGAACGCCTCCCGGACTTCTGCTACAAATGTGGGAGAATTGGTCACACACTAAAAGAGTGTGAAGAGATAATCAACCTGGAAGAAGAGGGCCTTTTTG
GAGACTGGATGAGAGCAGCACCAATAGGGATGGGAACACCAAACCAAGGCAACCAAGGCAACCAAGAACAGAGAAGGGAATATAGTTGGGGCAGAGGCAGGGGAAGAAGA
GGGGGTAACTCAAGATTCCATCATCAACAACAAAGGGACTGGTTTGATCGCAGGAAAAATGACCCAAAGAAAGAAGGAAAAAACCAAACTCACCAGCCAACCCTAGCAAT
GGAGGTGGATGAACCAACAAGTATAGAAACAGGGCCAAAGCAGACTCAACGAGGAGAGGGAAAAACAAACATCTCACAAAGCTCACCCTCGCCGGAAAACGGGGAGGAAG
ATGATCGGAATAGTGAGGATGTCAGGCAACACCAGTGGACGAACAAAACGGGAACAGAAATCGAGGAAGCTAGGGAGGAGTGCCAGAACAACAAAGTGGTGTCAGAAGAA
GCAATAGCCGTTGGAATAGGAAACAGTGAAAATACCCAAAAGGAAGAAGTGTGTAAGAAGGAAAAAGGGAAGAATAGGAATATTCTAAGCGGTGACAGAGGGAAATTCGA
AACAAAAAACAAGCTATCGAGGAAATGGAAGAGGCTCGCGCGGGCTGACCCAAAAAGTCAAAATGGAGTCGAGAACAATGAAATTAACAAGAGGAAAATGGAAACTACCT
GTGAGGAGGAGTCAAGGGAGGAGACAAAGAGGATGCGAACAACGTACTTCGAAGTAGTCGAAGGGATATCGGCGGAGACTGGTCACCAGCCCCGCCGGACGCAATGA
Protein sequenceShow/hide protein sequence
MLQEGSFGDQHGGQMEEDPPHQVETPTMGDTCNQEDNNQSQQKEKDKIDDRTYDKEENMETKVQKQLEKLKITANEKKKLISIEDDELEKADDDLQDAIFCKILTQKLIN
PEVFKSLMLRIWNKEGRIRMKTMGRNIFLCTFRNTREKERIVKGGPWNFDNGLLLFEEPKGGSFIPNTEFKYASFWIRIHNLPMICLCKKWADRLGSLVGAVEEIDLDEE
DRRWENSLRIQVKIDVTEPLTRGLLVQVGSKGEEVWMKITYERLPDFCYKCGRIGHTLKECEEIINLEEEGLFGDWMRAAPIGMGTPNQGNQGNQEQRREYSWGRGRGRR
GGNSRFHHQQQRDWFDRRKNDPKKEGKNQTHQPTLAMEVDEPTSIETGPKQTQRGEGKTNISQSSPSPENGEEDDRNSEDVRQHQWTNKTGTEIEEAREECQNNKVVSEE
AIAVGIGNSENTQKEEVCKKEKGKNRNILSGDRGKFETKNKLSRKWKRLARADPKSQNGVENNEINKRKMETTCEEESREETKRMRTTYFEVVEGISAETGHQPRRTQ