; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024921 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024921
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr10:7022175..7023308
RNA-Seq ExpressionLag0024921
SyntenyLag0024921
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69259.1 hypothetical protein EZV62_004194 [Acer yangbiense]8.7e-3933.11Show/hide
Query:  EKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLGTPEANQ
        E  + HQIS +  EE V    +DHCLVG++LS + ++  A  + +   W    +   ++VG N ++F F++  DR  V   GPW FDK LIVL  PE   
Subjt:  EKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLGTPEANQ

Query:  RTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGR
          A   F K D W+++ ++PI   NR SA  +   +GE +E+ ++   +  G  +R+KVR+DI+ PL R   LS  K      + ++YER+PEFCF CGR
Subjt:  RTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGR

Query:  IGHVVRECIEIKNKGE-MNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGEGINLADERTEKEGNNTRKEVEE
        +GH + EC +I+ K E M   +  FG+W++     K   +  S        +D    S + + K   L   +G   +   E         R  VE+
Subjt:  IGHVVRECIEIKNKGE-MNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGEGINLADERTEKEGNNTRKEVEE

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.7e-4033.33Show/hide
Query:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTR-KQFSADTVGRNTYIFKFEDMGDREWVMHNGP
        M   +L++EWK   L  +E +  + ++    +     L+  L+ +LLS R IS   +KN +  AW+   K FS D +G N ++F F    DR  ++  GP
Subjt:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTR-KQFSADTVGRNTYIFKFEDMGDREWVMHNGP

Query:  WLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECW
        W FD+ LI++  P +  +  D  F+ V LW+  F+L +   N++ A ++GN +G F +V+++      G+ +R++VR D+  PL RG  L+     G CW
Subjt:  WLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECW

Query:  ISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLKFQG
        I I+YERLP+F + CGR+ H++++C +     +   ++ ++G WL+FQG
Subjt:  ISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLKFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]9.9e-4334.86Show/hide
Query:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPW
        M+ E+L+ +W+K  L  +E    + ++ +  K+    L + LVG+LL+ RIIS   +   +  AW+   Q + +++G+N ++F F    D   VM  GPW
Subjt:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPW

Query:  LFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWI
         FDK LIVL  P +++  ++  F +V  W+ LF+LP+ + N++ A ++GN +G FV+VD +++G   G  +RI+V +DIT PL RG  ++     G CWI
Subjt:  LFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWI

Query:  SIRYERLPEFCFCCGRIGHVVREC-IEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKS--DSDEVLVKET
         I+YERLP+FC+ CG IGH   +C        + +    E+G WL+F G     ++    +    E   G S  +S E  V+ET
Subjt:  SIRYERLPEFCFCCGRIGHVVREC-IEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKS--DSDEVLVKET

XP_024033132.1 uncharacterized protein LOC112095437 [Citrus clementina]1.3e-3732.48Show/hide
Query:  MEIEDLIDEWKKLNLMEKEKRHQI---SLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHN
        ME E+LI + + + L  +E    I    ++E+  K + G    CLVG++L  R ++   +K A++ AWRT   F  +++G N ++FKF    D++ V + 
Subjt:  MEIEDLIDEWKKLNLMEKEKRHQI---SLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHN

Query:  GPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGE
        GPW FD+ L+VL  P+        SF  V  W+R+ N+P+   + S   ++G+ +G+  ++  D +G   G  +RI+V ++IT PL +  +L   + D +
Subjt:  GPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGE

Query:  CWISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGE--------
          + + YERLP+FCFCCG IGH  REC+E K +    +E+  FG WLK    +   K   + +K    S+  +   +EV   ++A + N G         
Subjt:  CWISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGE--------

Query:  -GINLADERTEKEG
         G  L  E TE+EG
Subjt:  -GINLADERTEKEG

XP_024955847.1 uncharacterized protein LOC112498636 [Citrus sinensis]3.3e-3835.23Show/hide
Query:  MEIEDLIDEWKKLNLMEKEKRHQISL-----EEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVM
        M+ E+LI + K +++ + E+ ++IS         E+   C     CLVG+++  R ++   +++AM   WRT K+   +++G N ++FKF    D++ VM
Subjt:  MEIEDLIDEWKKLNLMEKEKRHQISL-----EEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVM

Query:  HNGPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKID
          GPW FD  L+VL  P+     A+ SF     W+ L N+PI + +R    ++G  +G   E+DTD+EGN  G   R+++ +DIT PL R F+    + +
Subjt:  HNGPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKID

Query:  GECWISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLK-FQGFSKILKR-PDSPQKVRDESKDGKSDSD
        G   I I YERLP+FCFCCG IGH  +ECIE K +    ++D  +G W+K    F K+ +   D  Q+ +++ K G   SD
Subjt:  GECWISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLK-FQGFSKILKR-PDSPQKVRDESKDGKSDSD

TrEMBL top hitse value%identityAlignment
A0A2Z6NZV1 Uncharacterized protein3.9e-3731.76Show/hide
Query:  MEKEKRHQISLEEEEEKVLCGQLDHC---------LVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKL
        ME+EK+     E+EE+ +     + C         LVG+L +    +  A K  +  AWR +       +  N ++F+F    D + V+ NGPW FD+ L
Subjt:  MEKEKRHQISLEEEEEKVLCGQLDHC---------LVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKL

Query:  IVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYER
        ++L      ++ +D     V+ W+R+++LP   ++ + A K+GNI+G F EVD  K+ N  G  +R+K  +D+  PL RG  +     D   W+  +YER
Subjt:  IVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYER

Query:  LPEFCFCCGRIGHVVRECIEIKNKGEMNEEDF-----EFGVWLKFQGFSKILKRP
        LP FCF CG+IGH ++EC ++++  E N  D       +G WL+     +I + P
Subjt:  LPEFCFCCGRIGHVVRECIEIKNKGEMNEEDF-----EFGVWLKFQGFSKILKRP

A0A392M948 Zinc CCHC-type-like protein (Fragment)3.0e-3736.45Show/hide
Query:  GRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRS
        G++ +    +  A K  M  AWR R       + +N ++FKF    + E V   GPW FD+ L++L +   N++ ++    +V  W+R+++LP+  ++  
Subjt:  GRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRS

Query:  SAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGRIGHVVRECIEIKNKG-----EMNEEDF
         A K+GNILG F E+D  K+ N  G  +RI+V MD+  PL RG  LS    D   W+  +YERLP FCF CGRIGH +R+C E+++       E+ E+D 
Subjt:  SAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGRIGHVVRECIEIKNKG-----EMNEEDF

Query:  EFGVWLKFQGFSKI
         FG WL+     K+
Subjt:  EFGVWLKFQGFSKI

A0A5C7IJL3 CCHC-type domain-containing protein4.2e-3933.11Show/hide
Query:  EKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLGTPEANQ
        E  + HQIS +  EE V    +DHCLVG++LS + ++  A  + +   W    +   ++VG N ++F F++  DR  V   GPW FDK LIVL  PE   
Subjt:  EKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLGTPEANQ

Query:  RTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGR
          A   F K D W+++ ++PI   NR SA  +   +GE +E+ ++   +  G  +R+KVR+DI+ PL R   LS  K      + ++YER+PEFCF CGR
Subjt:  RTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGR

Query:  IGHVVRECIEIKNKGE-MNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGEGINLADERTEKEGNNTRKEVEE
        +GH + EC +I+ K E M   +  FG+W++     K   +  S        +D    S + + K   L   +G   +   E         R  VE+
Subjt:  IGHVVRECIEIKNKGE-MNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGEGINLADERTEKEGNNTRKEVEE

A0A6J1BSZ1 uncharacterized protein LOC1110054811.3e-4033.33Show/hide
Query:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTR-KQFSADTVGRNTYIFKFEDMGDREWVMHNGP
        M   +L++EWK   L  +E +  + ++    +     L+  L+ +LLS R IS   +KN +  AW+   K FS D +G N ++F F    DR  ++  GP
Subjt:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTR-KQFSADTVGRNTYIFKFEDMGDREWVMHNGP

Query:  WLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECW
        W FD+ LI++  P +  +  D  F+ V LW+  F+L +   N++ A ++GN +G F +V+++      G+ +R++VR D+  PL RG  L+     G CW
Subjt:  WLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECW

Query:  ISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLKFQG
        I I+YERLP+F + CGR+ H++++C +     +   ++ ++G WL+FQG
Subjt:  ISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEEDFEFGVWLKFQG

A0A6J1DU55 uncharacterized protein LOC1110231354.8e-4334.86Show/hide
Query:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPW
        M+ E+L+ +W+K  L  +E    + ++ +  K+    L + LVG+LL+ RIIS   +   +  AW+   Q + +++G+N ++F F    D   VM  GPW
Subjt:  MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPW

Query:  LFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWI
         FDK LIVL  P +++  ++  F +V  W+ LF+LP+ + N++ A ++GN +G FV+VD +++G   G  +RI+V +DIT PL RG  ++     G CWI
Subjt:  LFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWI

Query:  SIRYERLPEFCFCCGRIGHVVREC-IEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKS--DSDEVLVKET
         I+YERLP+FC+ CG IGH   +C        + +    E+G WL+F G     ++    +    E   G S  +S E  V+ET
Subjt:  SIRYERLPEFCFCCGRIGHVVREC-IEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKS--DSDEVLVKET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding7.7e-0921.02Show/hide
Query:  RTRKQFSADTVGRNTYIFKFEDMGDRE----WVMHNGPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTD
        R  K    + VGR   I K E +   E     ++  GPW F+  + V+      +  +D  FK++  W+++  +P+ +        IG  +G F+E +  
Subjt:  RTRKQFSADTVGRNTYIFKFEDMGDRE----WVMHNGPWLFDKKLIVLGTPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTD

Query:  KEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEED
        ++ ++                                 +  +YE+L  FC  CG + H   EC    N+G   ++D
Subjt:  KEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGRIGHVVRECIEIKNKGEMNEED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTGAAGATTTAATAGATGAATGGAAAAAGCTAAATCTGATGGAAAAAGAAAAGAGGCACCAGATATCACTGGAGGAAGAAGAAGAGAAAGTTTTATGCGGACA
GTTAGATCACTGCTTAGTGGGGAGACTCCTGTCAAACCGTATCATATCAAATATGGCGATCAAGAACGCAATGAATGGTGCTTGGCGAACAAGAAAACAATTCAGTGCAG
ACACAGTGGGACGGAATACGTACATTTTTAAATTCGAAGACATGGGGGACAGAGAATGGGTAATGCACAACGGACCATGGCTGTTTGACAAGAAATTAATAGTTCTGGGA
ACCCCGGAAGCAAACCAAAGAACTGCGGACTGGTCTTTCAAGAAGGTGGATCTTTGGCTGAGATTATTCAATCTTCCTATTGGATATAAAAACAGATCCTCGGCTGCAAA
GATTGGGAATATCTTAGGAGAATTTGTGGAGGTTGACACCGATAAAGAAGGAAATCTTAGGGGAAATCATATCAGGATCAAGGTGAGGATGGATATTACTTTACCTCTCT
TTAGAGGCTTCATGTTGTCAGGACCAAAGATAGATGGGGAATGTTGGATATCAATTAGATACGAAAGATTGCCTGAATTCTGCTTCTGTTGTGGGAGGATTGGTCATGTG
GTCCGAGAATGCATTGAGATTAAGAACAAAGGAGAGATGAATGAAGAAGACTTCGAATTTGGTGTCTGGCTTAAGTTCCAGGGTTTTTCGAAAATCCTTAAACGACCTGA
TTCCCCTCAGAAGGTTCGAGATGAAAGTAAAGATGGTAAAAGTGACTCAGATGAAGTATTGGTGAAAGAAACGGCTTTAGTTGCAAATATTGGAGAAGGAATAAACCTGG
CTGATGAACGAACAGAAAAGGAGGGGAATAACACAAGGAAAGAAGTGGAGGAAGAGTTGACTAAGGAGCACTTTCAAAGAAGATACTCGAATTTTGAATACACAGACATG
GAAATAAATTTGGGAATGGCAGAGAATCAATTAACCAATGAGAATAACCAAAATAATACTGATCTCTCACCAGGCTCTGAAAAATTGGTAGGAAGAAAACCAGCTGGAAA
AGAAAGACTAAAAGTGGTACGAATTTTGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTGAAGATTTAATAGATGAATGGAAAAAGCTAAATCTGATGGAAAAAGAAAAGAGGCACCAGATATCACTGGAGGAAGAAGAAGAGAAAGTTTTATGCGGACA
GTTAGATCACTGCTTAGTGGGGAGACTCCTGTCAAACCGTATCATATCAAATATGGCGATCAAGAACGCAATGAATGGTGCTTGGCGAACAAGAAAACAATTCAGTGCAG
ACACAGTGGGACGGAATACGTACATTTTTAAATTCGAAGACATGGGGGACAGAGAATGGGTAATGCACAACGGACCATGGCTGTTTGACAAGAAATTAATAGTTCTGGGA
ACCCCGGAAGCAAACCAAAGAACTGCGGACTGGTCTTTCAAGAAGGTGGATCTTTGGCTGAGATTATTCAATCTTCCTATTGGATATAAAAACAGATCCTCGGCTGCAAA
GATTGGGAATATCTTAGGAGAATTTGTGGAGGTTGACACCGATAAAGAAGGAAATCTTAGGGGAAATCATATCAGGATCAAGGTGAGGATGGATATTACTTTACCTCTCT
TTAGAGGCTTCATGTTGTCAGGACCAAAGATAGATGGGGAATGTTGGATATCAATTAGATACGAAAGATTGCCTGAATTCTGCTTCTGTTGTGGGAGGATTGGTCATGTG
GTCCGAGAATGCATTGAGATTAAGAACAAAGGAGAGATGAATGAAGAAGACTTCGAATTTGGTGTCTGGCTTAAGTTCCAGGGTTTTTCGAAAATCCTTAAACGACCTGA
TTCCCCTCAGAAGGTTCGAGATGAAAGTAAAGATGGTAAAAGTGACTCAGATGAAGTATTGGTGAAAGAAACGGCTTTAGTTGCAAATATTGGAGAAGGAATAAACCTGG
CTGATGAACGAACAGAAAAGGAGGGGAATAACACAAGGAAAGAAGTGGAGGAAGAGTTGACTAAGGAGCACTTTCAAAGAAGATACTCGAATTTTGAATACACAGACATG
GAAATAAATTTGGGAATGGCAGAGAATCAATTAACCAATGAGAATAACCAAAATAATACTGATCTCTCACCAGGCTCTGAAAAATTGGTAGGAAGAAAACCAGCTGGAAA
AGAAAGACTAAAAGTGGTACGAATTTTGACCTGA
Protein sequenceShow/hide protein sequence
MEIEDLIDEWKKLNLMEKEKRHQISLEEEEEKVLCGQLDHCLVGRLLSNRIISNMAIKNAMNGAWRTRKQFSADTVGRNTYIFKFEDMGDREWVMHNGPWLFDKKLIVLG
TPEANQRTADWSFKKVDLWLRLFNLPIGYKNRSSAAKIGNILGEFVEVDTDKEGNLRGNHIRIKVRMDITLPLFRGFMLSGPKIDGECWISIRYERLPEFCFCCGRIGHV
VRECIEIKNKGEMNEEDFEFGVWLKFQGFSKILKRPDSPQKVRDESKDGKSDSDEVLVKETALVANIGEGINLADERTEKEGNNTRKEVEEELTKEHFQRRYSNFEYTDM
EINLGMAENQLTNENNQNNTDLSPGSEKLVGRKPAGKERLKVVRILT