; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032169 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032169
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:26524548..26525501
RNA-Seq ExpressionLag0032169
SyntenyLag0032169
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006484927.1 uncharacterized protein LOC102626623 [Citrus sinensis]7.1e-4239.15Show/hide
Query:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR
        E+L  +   + L + E  AV          E     C VG++L +R VN + LR  M  AW   +   +ESLGDNIFV +F +  +KKR+L+ GPW+FD+
Subjt:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR

Query:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE
        AL VLV P G  S     FT  +FWV+I+ +P   +  G    LG +IG V EV  +  G+ IGP  R+++ +D+T+PL+R++ L+ + +      I+Y+
Subjt:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE

Query:  RLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFG
        RLPDFCF CG IGH  REC+       P ++L +G
Subjt:  RLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.8e-5338.3Show/hide
Query:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD
        ++++ D W     +  E++ V +    P+L    V++C V ++ TS+ ++ +A+R VM   W  H ST  E LG NI+V+ F SL EK R+LSSGPW F+
Subjt:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD

Query:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY
        ++L VL SP+  + P  ++F  CAFW++I+ +PF  ++  MA  LG+++G V E+ G+G   W GP +R++V +DV++PLRR ++L+       WCP++Y
Subjt:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY

Query:  ERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGSGRGLGRGMGRGR
        E+LPDFC+ CG IGHS REC   +     +    +GD L A  L++S     E+    G       +   GRG GRG  R R
Subjt:  ERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGSGRGLGRGMGRGR

XP_022156711.1 uncharacterized protein LOC111023555 [Momordica charantia]1.2e-4639.27Show/hide
Query:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD
        ++D+   W    L + E   + +    P+L    +Q+CAVG++  S+ +  +A   VM   W  H ST IE+ G NI+V+ F ++ EK R+ S GPW FD
Subjt:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD

Query:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY
        ++L +LV  + A+ P  +D + CAFWV+I+ + F  +T  MA+ LG+ +G V EV G    DW+ P + ++V ++V +PLRR ++++       WCP++Y
Subjt:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY

Query:  ERLPDFCFRCGCIGHSHRE
        ERLPDFC+ CGC+GHS RE
Subjt:  ERLPDFCFRCGCIGHSHRE

XP_024033132.1 uncharacterized protein LOC112095437 [Citrus clementina]3.5e-4140.76Show/hide
Query:  CAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDRALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFL
        C VG++L +R VN + L+  +  AW       +ESLG NIFV +F S  +KKR+ + GPW+FDRAL VL  P G  S     F+  +FW+RI+ VP   +
Subjt:  CAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDRALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFL

Query:  TPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGD
           + R LGS +G V ++  +  G+  G  +R+QV +++T+PL++++ L+ + D     P+ YERLPDFCF CGCIGH  REC+         + LPFG 
Subjt:  TPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGD

Query:  GLCAPPLRRST
         L A  L   T
Subjt:  GLCAPPLRRST

XP_024953751.1 uncharacterized protein LOC112498094 [Citrus sinensis]1.2e-4138.4Show/hide
Query:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR
        E+L  +   + LS+ E   V   +      E  V  C +G+V+ +R V+ + L+  M   W   R   IESLGDNIF+ +F S  +K+ IL  GPW+FDR
Subjt:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR

Query:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQ--
        AL VL+ P G       DF+  +FWV+I+ VP   +T  M  ALG  IG V EV  +  G+  G  +R+++ +D+TRPL++++ L+ +  +    P+Q  
Subjt:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQ--

Query:  YERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDS
        YERLPDFCF CG IGH +REC    S S   D L +G  L A  +   T +  +  G D  DS
Subjt:  YERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDS

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)1.3e-3836.78Show/hide
Query:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR
        E+L  +   + LS+ E   V       +  E  +  C VG+VL +R V+ + L+  M   W   R   IE LG+N+F+ +F S  +K+ I+  GPW+FDR
Subjt:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR

Query:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCP--IQ
        AL  L  P+G       DF+  +FWV+I+ VP   ++  MA  LG  IG V EV  +  G+  G  +R+++ +D+T+PL++++ L+ + +     P  + 
Subjt:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCP--IQ

Query:  YERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCA
        YERLPDFCF CG IGH +REC    S S   D L +G  L A
Subjt:  YERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCA

A0A6J1D765 uncharacterized protein LOC1110179028.7e-5438.3Show/hide
Query:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD
        ++++ D W     +  E++ V +    P+L    V++C V ++ TS+ ++ +A+R VM   W  H ST  E LG NI+V+ F SL EK R+LSSGPW F+
Subjt:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD

Query:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY
        ++L VL SP+  + P  ++F  CAFW++I+ +PF  ++  MA  LG+++G V E+ G+G   W GP +R++V +DV++PLRR ++L+       WCP++Y
Subjt:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY

Query:  ERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGSGRGLGRGMGRGR
        E+LPDFC+ CG IGHS REC   +     +    +GD L A  L++S     E+    G       +   GRG GRG  R R
Subjt:  ERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGSGRGLGRGMGRGR

A0A6J1DU55 uncharacterized protein LOC1110231352.3e-3835.42Show/hide
Query:  EDLADQWSKMGL-SEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD
        E+L   W K  L SE +  A+ V  +A  + E  +    VG++L  R ++ D L RV+L AW       +ES+G N+F+  F    +  R++ +GPW FD
Subjt:  EDLADQWSKMGL-SEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD

Query:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTD-SYRWCPIQ
        +AL VL  P  + + + L+F + AFW+ +  +P ++L   MA  LG+ IG  V+V     G   G  +R++V++D+T+PLRR +++ +D      W PIQ
Subjt:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTD-SYRWCPIQ

Query:  YERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGS
        YERLPDFC+ CG IGHS  +C      +  D R     G   P LR    KA   +G  G         GS
Subjt:  YERLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGS

A0A6J1DVS4 uncharacterized protein LOC1110235556.0e-4739.27Show/hide
Query:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD
        ++D+   W    L + E   + +    P+L    +Q+CAVG++  S+ +  +A   VM   W  H ST IE+ G NI+V+ F ++ EK R+ S GPW FD
Subjt:  IEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFD

Query:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY
        ++L +LV  + A+ P  +D + CAFWV+I+ + F  +T  MA+ LG+ +G V EV G    DW+ P + ++V ++V +PLRR ++++       WCP++Y
Subjt:  RALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQY

Query:  ERLPDFCFRCGCIGHSHRE
        ERLPDFC+ CGC+GHS RE
Subjt:  ERLPDFCFRCGCIGHSHRE

A0A803L2P2 Uncharacterized protein1.1e-3734.84Show/hide
Query:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR
        +D+AD+W++M ++E ES  V +   +    EA V +  +G++LT R VN +A +R M+ +W      +I SLG N++  +F+   +K+R+++  PW F++
Subjt:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR

Query:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE
         L +L S SG + PT +  T   FW+RI  +PFN  +    RA+ + +G V+EV  E     +G   R++V LDV +PLRR   ++    +     ++YE
Subjt:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE

Query:  RLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLR
        RLP FCF CG +GHS R+C+     S  +    +G  L A P++
Subjt:  RLPDFCFRCGCIGHSHRECVMENSPSMPDDRLPFGDGLCAPPLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13450.1 unknown protein4.2e-0820.26Show/hide
Query:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR
        ++L D+   + L   E  A+ +P  A  + E+  ++  + R L  R  N  A+   +  AW          L D      F S  +   +L   PW ++ 
Subjt:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR

Query:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE
          + + +     + T    T    WV++  +P  ++    A  +  E+G ++ +            +R+++   +T  LR  +R+  D+        QYE
Subjt:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE

Query:  RLPDFCFRCGCIGHSHRECVMENSPSM
        RL   C  C  + H    C+     S+
Subjt:  RLPDFCFRCGCIGHSHRECVMENSPSM

AT2G17920.1 nucleic acid binding;zinc ion binding4.9e-0921Show/hide
Query:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR
        ++L D+   + L + E  ++ +P  A ++     ++  + R L  R  N  A+   +  AW          + D      F S  +   +    PW F+ 
Subjt:  EDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDR

Query:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE
          + + S     +P L   T    WV++  +PF +++   A  +  EIG ++ +            +R++V + +T  LR   R+  ++        QYE
Subjt:  ALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYE

Query:  RLPDFCFRCGCIGHSHREC
        RL   C  C    H+   C
Subjt:  RLPDFCFRCGCIGHSHREC

AT3G31430.1 unknown protein2.0e-1029.5Show/hide
Query:  ILSSGPWNFDRALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRM---QVVL--DVTRPLRRVVR
        +L  GPW F+  + +L        P +  F    FWV+I  +PF FL  G+   +G  +G V++         +  + RM   +V+L  D+T PLR    
Subjt:  ILSSGPWNFDRALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRM---QVVL--DVTRPLRRVVR

Query:  LQLDTDSYRWCPIQYERLPDFCFRCGCIGHSHRECVMEN
         Q           +YERL  FC  CG + H    C+++N
Subjt:  LQLDTDSYRWCPIQYERLPDFCFRCGCIGHSHRECVMEN

AT3G42140.1 zinc ion binding;nucleic acid binding6.0e-0724.48Show/hide
Query:  ILSSGPWNFDRALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDT
        IL  GPW+F+  + V+   +   S    +F +  FW++I  +P  FLT  +  ++G  +G  +E                               L  D 
Subjt:  ILSSGPWNFDRALFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDT

Query:  DSYRWCPIQYERLPDFCFRCGCIGHSHRECVMENS--PSMPDD
           ++   QYE+L +FC  CG + H   EC    +  P   DD
Subjt:  DSYRWCPIQYERLPDFCFRCGCIGHSHRECVMENS--PSMPDD

AT3G47920.1 unknown protein4.8e-0424.3Show/hide
Query:  SPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYERLPDFCFRCGCI
        +P L   T    WV++  +P  ++    A  +  EIG ++ +            +R++V + +T  LR   R+  D+        QYERL   C  C  +
Subjt:  SPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYERLPDFCFRCGCI

Query:  GHSHREC
         H    C
Subjt:  GHSHREC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGAATGGAAGAAGGGACCATCGAGGATCTCGCAGATCAATGGAGCAAGATGGGTCTGTCTGAGGCGGAGTCAAAGGCAGTCCCTGTGCCGGCGAATGCCCCCCT
CCTTGATGAAGCGACGGTTCAGGTATGTGCGGTGGGGAGAGTACTGACAAGCCGGTTTGTTAATCCAGATGCTCTTAGGAGGGTGATGTTGTTTGCGTGGAATGCCCACC
GGTCGACGTTGATCGAGTCGCTGGGAGATAATATTTTTGTGGTTCGGTTCTACTCCTTGGGTGAGAAGAAACGTATACTGAGTTCAGGGCCCTGGAATTTTGATAGAGCT
CTGTTTGTTTTAGTGTCTCCCTCTGGTGCTGATAGTCCAACGCTGCTTGATTTCACGAAATGTGCTTTCTGGGTGCGCATTAATCAGGTACCGTTCAATTTTCTTACTCC
TGGAATGGCTCGGGCCCTTGGTAGTGAGATAGGGCCGGTTGTTGAGGTGCCAGGTGAGGGATGGGGTGACTGGATTGGGCCTATAATGAGAATGCAGGTTGTGTTGGATG
TCACTAGACCTTTGCGTCGGGTGGTCCGTCTCCAATTAGATACTGATAGTTACAGGTGGTGCCCAATTCAGTACGAGCGTCTTCCCGATTTTTGTTTTAGGTGTGGGTGC
ATTGGGCACTCCCATCGTGAATGTGTTATGGAGAATTCACCGTCAATGCCCGATGATCGGCTTCCTTTCGGTGATGGGCTCTGTGCTCCACCTCTGAGGCGTTCAACCCC
GAAAGCCATGGAGGATGAGGGTGGGGATGGTAATGACAGTGATAATAGGCAGCGAAGGGGGTCGGGTAGAGGGTTGGGTCGAGGTATGGGTAGGGGTCGGGGGCCTGCTG
TGGTTGGGGAGGAGGTTGGGGGGTTGTTAGGGAACCGGGGCAGGAGGGGGGAGAGGTGTTGGAGAGAGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGAATGGAAGAAGGGACCATCGAGGATCTCGCAGATCAATGGAGCAAGATGGGTCTGTCTGAGGCGGAGTCAAAGGCAGTCCCTGTGCCGGCGAATGCCCCCCT
CCTTGATGAAGCGACGGTTCAGGTATGTGCGGTGGGGAGAGTACTGACAAGCCGGTTTGTTAATCCAGATGCTCTTAGGAGGGTGATGTTGTTTGCGTGGAATGCCCACC
GGTCGACGTTGATCGAGTCGCTGGGAGATAATATTTTTGTGGTTCGGTTCTACTCCTTGGGTGAGAAGAAACGTATACTGAGTTCAGGGCCCTGGAATTTTGATAGAGCT
CTGTTTGTTTTAGTGTCTCCCTCTGGTGCTGATAGTCCAACGCTGCTTGATTTCACGAAATGTGCTTTCTGGGTGCGCATTAATCAGGTACCGTTCAATTTTCTTACTCC
TGGAATGGCTCGGGCCCTTGGTAGTGAGATAGGGCCGGTTGTTGAGGTGCCAGGTGAGGGATGGGGTGACTGGATTGGGCCTATAATGAGAATGCAGGTTGTGTTGGATG
TCACTAGACCTTTGCGTCGGGTGGTCCGTCTCCAATTAGATACTGATAGTTACAGGTGGTGCCCAATTCAGTACGAGCGTCTTCCCGATTTTTGTTTTAGGTGTGGGTGC
ATTGGGCACTCCCATCGTGAATGTGTTATGGAGAATTCACCGTCAATGCCCGATGATCGGCTTCCTTTCGGTGATGGGCTCTGTGCTCCACCTCTGAGGCGTTCAACCCC
GAAAGCCATGGAGGATGAGGGTGGGGATGGTAATGACAGTGATAATAGGCAGCGAAGGGGGTCGGGTAGAGGGTTGGGTCGAGGTATGGGTAGGGGTCGGGGGCCTGCTG
TGGTTGGGGAGGAGGTTGGGGGGTTGTTAGGGAACCGGGGCAGGAGGGGGGAGAGGTGTTGGAGAGAGAGGTAG
Protein sequenceShow/hide protein sequence
MAGMEEGTIEDLADQWSKMGLSEAESKAVPVPANAPLLDEATVQVCAVGRVLTSRFVNPDALRRVMLFAWNAHRSTLIESLGDNIFVVRFYSLGEKKRILSSGPWNFDRA
LFVLVSPSGADSPTLLDFTKCAFWVRINQVPFNFLTPGMARALGSEIGPVVEVPGEGWGDWIGPIMRMQVVLDVTRPLRRVVRLQLDTDSYRWCPIQYERLPDFCFRCGC
IGHSHRECVMENSPSMPDDRLPFGDGLCAPPLRRSTPKAMEDEGGDGNDSDNRQRRGSGRGLGRGMGRGRGPAVVGEEVGGLLGNRGRRGERCWRER