; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026182 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026182
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr10:31719966..31721803
RNA-Seq ExpressionLag0026182
SyntenyLag0026182
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006484927.1 uncharacterized protein LOC102626623 [Citrus sinensis]1.6e-3636.8Show/hide
Query:  DDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDK
        ++L+ R + + L  EE D V  +G      E      LVGKIL +R VN E  R  ++  W+  ++ +V+ +GDN+FV KF T  +K  V   GPW  DK
Subjt:  DDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDK

Query:  SFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRY
        + IV+V P+      +   FT  S WVQIH +P + M +    KL   IG VE+V  D   E IGP  RV +++D+TKPL+  L+++  G ++++  ++Y
Subjt:  SFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRY

Query:  ERLPDFCYECGLVTLRVGVPLYHRGTLRTKL
        +RLPDFC+ CGL+  +    + ++G  + KL
Subjt:  ERLPDFCYECGLVTLRVGVPLYHRGTLRTKL

XP_015382389.1 uncharacterized protein LOC107175483 [Citrus sinensis]4.7e-3636.52Show/hide
Query:  DLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKS
        +L+ R + + L  EE D V  +G      E      LV KIL +R VN +  R  +++ W+  ++ +V+ +GDN+FV KF T  +K  V   GPW  DK+
Subjt:  DLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKS

Query:  FIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYE
         IV+V P+      +  +FT  S WVQIH +P + M +    KL   IGLVE+V  D   E IGP   V +++D+TK LR  L+++  G ++++  ++YE
Subjt:  FIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYE

Query:  RLPDFCYECGLVTLRVGVPLYHRGTLRTKL
        RLPDFC+ CGL+  +    + ++G  + KL
Subjt:  RLPDFCYECGLVTLRVGVPLYHRGTLRTKL

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.7e-4942.06Show/hide
Query:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        MD++   W+    T +E + V +   +P+L+ D++KL +V K+ TS++++AEA R V++ VWRVH   R + +G N++V  FK+L EK+ V  +GPWT +
Subjt:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR
        KS +V+  P A  + P +++F   + W+QIHN+PF  ++  MA  L   +G VE++ GDG D W GP +RV V +D++KPLR G+ +++   +++ CPLR
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR

Query:  YERLPDFCYECGLV
        YE+LPDFCYECG +
Subjt:  YERLPDFCYECGLV

XP_022155933.1 uncharacterized protein LOC111022932 [Momordica charantia]5.3e-4038.12Show/hide
Query:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        MD++   W+   LT E+ + + +   +PL++ ++++ Y VGK+ TS++++ EAFR V++ +W+VH    ++  G N++V  FK++ EKN V  +GPW+ +
Subjt:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR
         S +V+  P A  D P++++F   +LW+QIH +PF+ M + MA  L   IG VE++  +G  EW GP +RV V +D++KP + GL +R    +E  CPLR
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR

Query:  YE
        YE
Subjt:  YE

XP_022156711.1 uncharacterized protein LOC111023555 [Momordica charantia]5.5e-4540.19Show/hide
Query:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        MDD+   W+   L  EE D + +   +P+L+ D+I+L  VGK+  S+++  EAF  V+++VW++H   R++  G N++V  FKT+ EK  VF  GPWT D
Subjt:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR
        KS +++++P    + P ++  +  + WVQIH + F  MT+ MA  L   +G VE+V G    +W+ P + V V +++ KPLR GL ++    +++ CPLR
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR

Query:  YERLPDFCYECGLV
        YERLPDFCY CG V
Subjt:  YERLPDFCYECGLV

TrEMBL top hitse value%identityAlignment
A0A5C7IB82 DUF4283 domain-containing protein4.9e-3139.9Show/hide
Query:  LVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYM
        LVGKIL  ++V  EAF+ VI K+W++ ++  V+ I  N F   F    +K+ V + GPW+ D  F+V+  P   + D   + F     WVQI NVP   M
Subjt:  LVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYM

Query:  TRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGN-QELLCPLRYERLPDFCYECGLVTL------RVGVPLYHRGTLR
        T  +   L   IG V+DV G G    +G  + V V LD+ KPLR  L I +LG+ +E + PL+YERLPDFC++CGL+        R+G   Y  G  R
Subjt:  TRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGN-QELLCPLRYERLPDFCYECGLVTL------RVGVPLYHRGTLR

A0A6J1D765 uncharacterized protein LOC1110179028.0e-5042.06Show/hide
Query:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        MD++   W+    T +E + V +   +P+L+ D++KL +V K+ TS++++AEA R V++ VWRVH   R + +G N++V  FK+L EK+ V  +GPWT +
Subjt:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR
        KS +V+  P A  + P +++F   + W+QIHN+PF  ++  MA  L   +G VE++ GDG D W GP +RV V +D++KPLR G+ +++   +++ CPLR
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR

Query:  YERLPDFCYECGLV
        YE+LPDFCYECG +
Subjt:  YERLPDFCYECGLV

A0A6J1DP89 uncharacterized protein LOC1110229322.6e-4038.12Show/hide
Query:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        MD++   W+   LT E+ + + +   +PL++ ++++ Y VGK+ TS++++ EAFR V++ +W+VH    ++  G N++V  FK++ EKN V  +GPW+ +
Subjt:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR
         S +V+  P A  D P++++F   +LW+QIH +PF+ M + MA  L   IG VE++  +G  EW GP +RV V +D++KP + GL +R    +E  CPLR
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR

Query:  YE
        YE
Subjt:  YE

A0A6J1DVS4 uncharacterized protein LOC1110235552.7e-4540.19Show/hide
Query:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        MDD+   W+   L  EE D + +   +P+L+ D+I+L  VGK+  S+++  EAF  V+++VW++H   R++  G N++V  FKT+ EK  VF  GPWT D
Subjt:  MDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR
        KS +++++P    + P ++  +  + WVQIH + F  MT+ MA  L   +G VE+V G    +W+ P + V V +++ KPLR GL ++    +++ CPLR
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLR

Query:  YERLPDFCYECGLV
        YERLPDFCY CG V
Subjt:  YERLPDFCYECGLV

A0A6J1DX30 uncharacterized protein LOC1110248743.1e-3323.54Show/hide
Query:  DLVTRWQGLYLTFEETD-VVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDN-RVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD
        DL+  W+   LT EE +  + +    P  +   ++  LVGK+   R +     ++ +R  W++  +   V  +G N+F+  F   +++N ++++GPWT D
Subjt:  DLVTRWQGLYLTFEETD-VVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDN-RVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSD

Query:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIR---DLGNQELLC
        ++ +++  P+A    P  L FT + +WV+  ++P   +TR MA++L   +G  E+   D L+   G  +RV V LD++KPLR G+ +     +G   +  
Subjt:  KSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIR---DLGNQELLC

Query:  PLRYERLPDFCYECGLVTLR----VGVPLYHRGTLRTKLSNM----------------------------------------------------------
        P++YERLPDFCY CGL + R     G  L ++GT++  +  M                                                          
Subjt:  PLRYERLPDFCYECGLVTLR----VGVPLYHRGTLRTKLSNM----------------------------------------------------------

Query:  -----------------------------------------------------ARGVGFYGFPETELHHRSWSLLRQLRGSPDTPWLVGGDFNAILQHSE
                                                              R  GFYG P     H +W LLR++     +PWL+GGD NAIL + E
Subjt:  -----------------------------------------------------ARGVGFYGFPETELHHRSWSLLRQLRGSPDTPWLVGGDFNAILQHSE

Query:  KSEGEDLELVSTESWRFLYPQCEVSHLDY
         S     +    E++R +   C ++ + +
Subjt:  KSEGEDLELVSTESWRFLYPQCEVSHLDY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.0e-0923.12Show/hide
Query:  EETDVVALMGVEPLLSEDSI-KLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQD
        E+ + V  +G E L + + + K  ++ K+L S ++        +R++W+      V  +    F+ +F+   E       GPW    ++++V    + + 
Subjt:  EETDVVALMGVEPLLSEDSI-KLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQD

Query:  DPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYECGL
        DP      T  +WV++ N+P+ Y  R + +++ R +G    V  + ++   G   RV + ++L KPL+  ++I   G++  +    YE L   C  CG+
Subjt:  DPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYECGL

AT2G17920.1 nucleic acid binding;zinc ion binding1.0e-0925Show/hide
Query:  KLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPF
        +L ++ + L  R  N +A    + + W +       +I D      F++ M+   V R  PW  +  F   V     Q  P     TT+ LWVQ+  +PF
Subjt:  KLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPF

Query:  RYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYEC
         Y++   A+++ + IG +  +            +RV V + +T  LR    I     +  L   +YERL   C  C
Subjt:  RYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYEC

AT3G31430.1 unknown protein4.2e-1130.15Show/hide
Query:  VFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDL
        V R GPW  +   I++      + +PQ   F  +  WVQI  +PF+++ R +   + R +G V D   +          RV +  D+T PLR     +  
Subjt:  VFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHGLMIRDL

Query:  GNQELLCPLRYERLPDFCYECGLVTLRVGVPLYHRG
             L   RYERL  FC  CG++T   G  L   G
Subjt:  GNQELLCPLRYERLPDFCYECGLVTLRVGVPLYHRG

AT5G18636.1 unknown protein3.5e-0522.04Show/hide
Query:  LLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWV
        ++ E S  L L+ + L  R  N  +    + + W +       ++        F   ++   V R  PW  +  F+        Q  P +   TT+ LWV
Subjt:  LLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWV

Query:  QIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGP---IMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYEC
        QI  +P  Y++    +++ + +G   ++      E   P    +RV V   +T  LR    I     +      +YERL   C  C
Subjt:  QIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGP---IMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYEC

AT5G25200.1 unknown protein3.5e-0522.04Show/hide
Query:  LLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWV
        ++ E S  L L+ + L  R  N  +    + + W +       ++        F   ++   V R  PW  +  F+        Q  P +   TT+ LWV
Subjt:  LLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIGDNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWV

Query:  QIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGP---IMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYEC
        QI  +P  Y++    +++ + +G   ++      E   P    +RV V   +T  LR    I     +      +YERL   C  C
Subjt:  QIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGP---IMRVWVALDLTKPLRHGLMIRDLGNQELLCPLRYERLPDFCYEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTGGGTGGGTTTTTGGGTATTATTTTTGCTGGTTTGATTGAATCTGAGGGTTTGTATTGTACCTTTGTTCTTCTAGGGTGTGTTTGTGTTATCACGGTTCGAAT
GGATGATCTTGTGACTCGATGGCAAGGACTGTATCTCACTTTCGAGGAAACAGATGTCGTTGCTCTAATGGGGGTAGAACCTTTACTATCAGAGGATTCCATCAAATTAT
ATCTGGTAGGGAAGATTCTGACCAGTCGAAAGGTAAATGCAGAGGCTTTTCGCGATGTCATCAGGAAAGTTTGGCGAGTGCATCGTGACAACCGTGTGGATCTTATTGGT
GATAATGTCTTTGTTGCCAAGTTCAAAACTCTGATGGAAAAGAATCATGTCTTTCGAGCAGGGCCTTGGACTTCTGATAAGAGTTTCATTGTTGTGGTTATTCCTATTGC
GGATCAGGACGATCCACAGAATTTGTCATTCACAACTGTTTCTTTATGGGTCCAGATCCATAATGTTCCATTTCGGTACATGACTCGCTCTATGGCTGTGAAGTTGAGGC
GAACTATTGGCTTAGTAGAAGATGTTGCTGGTGATGGTTTGGATGAATGGATAGGTCCGATAATGCGGGTCTGGGTTGCATTGGATCTGACGAAGCCTCTTCGTCATGGT
CTAATGATTCGTGACCTTGGCAATCAAGAACTTCTATGTCCGCTAAGGTATGAACGCCTCCCTGATTTTTGTTATGAATGTGGTCTGGTCACTCTTCGCGTGGGTGTTCC
TCTGTACCATCGGGGTACCCTTCGGACAAAGTTGAGCAATATGGCGCGTGGTGTTGGGTTCTATGGTTTTCCAGAAACTGAGCTACATCATAGATCTTGGAGTCTTCTCC
GACAGCTACGTGGTTCTCCAGATACACCTTGGCTAGTGGGAGGAGATTTCAATGCGATACTTCAGCATTCAGAAAAATCAGAGGGGGAGGACCTCGAGTTAGTAAGCACC
GAATCGTGGCGGTTCCTTTACCCGCAGTGTGAGGTCTCTCACTTGGATTACCATAGTTTAAAGCATCAACCACTGGCCTTATGTTTACGACCCATTACTCCGTTACTCCG
ATCATCTGGGGTCGTATTTCCCGTTTTGAGGAGGTCGAAATCGAGCAATTACAAAATTCGTATTACAGCGGCGTCCTCCCGAGTCCAATCTGAGATTGAGAATCTTGGCT
CCTACTCTAATCGAACAAAGTTGCTTGCAGCTGAGAGGGATTTGGATCAGTTGCTCGCAAAGAGATTTTTTGGAAGCAGAGATCACGGGACCAATGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTGGGTGGGTTTTTGGGTATTATTTTTGCTGGTTTGATTGAATCTGAGGGTTTGTATTGTACCTTTGTTCTTCTAGGGTGTGTTTGTGTTATCACGGTTCGAAT
GGATGATCTTGTGACTCGATGGCAAGGACTGTATCTCACTTTCGAGGAAACAGATGTCGTTGCTCTAATGGGGGTAGAACCTTTACTATCAGAGGATTCCATCAAATTAT
ATCTGGTAGGGAAGATTCTGACCAGTCGAAAGGTAAATGCAGAGGCTTTTCGCGATGTCATCAGGAAAGTTTGGCGAGTGCATCGTGACAACCGTGTGGATCTTATTGGT
GATAATGTCTTTGTTGCCAAGTTCAAAACTCTGATGGAAAAGAATCATGTCTTTCGAGCAGGGCCTTGGACTTCTGATAAGAGTTTCATTGTTGTGGTTATTCCTATTGC
GGATCAGGACGATCCACAGAATTTGTCATTCACAACTGTTTCTTTATGGGTCCAGATCCATAATGTTCCATTTCGGTACATGACTCGCTCTATGGCTGTGAAGTTGAGGC
GAACTATTGGCTTAGTAGAAGATGTTGCTGGTGATGGTTTGGATGAATGGATAGGTCCGATAATGCGGGTCTGGGTTGCATTGGATCTGACGAAGCCTCTTCGTCATGGT
CTAATGATTCGTGACCTTGGCAATCAAGAACTTCTATGTCCGCTAAGGTATGAACGCCTCCCTGATTTTTGTTATGAATGTGGTCTGGTCACTCTTCGCGTGGGTGTTCC
TCTGTACCATCGGGGTACCCTTCGGACAAAGTTGAGCAATATGGCGCGTGGTGTTGGGTTCTATGGTTTTCCAGAAACTGAGCTACATCATAGATCTTGGAGTCTTCTCC
GACAGCTACGTGGTTCTCCAGATACACCTTGGCTAGTGGGAGGAGATTTCAATGCGATACTTCAGCATTCAGAAAAATCAGAGGGGGAGGACCTCGAGTTAGTAAGCACC
GAATCGTGGCGGTTCCTTTACCCGCAGTGTGAGGTCTCTCACTTGGATTACCATAGTTTAAAGCATCAACCACTGGCCTTATGTTTACGACCCATTACTCCGTTACTCCG
ATCATCTGGGGTCGTATTTCCCGTTTTGAGGAGGTCGAAATCGAGCAATTACAAAATTCGTATTACAGCGGCGTCCTCCCGAGTCCAATCTGAGATTGAGAATCTTGGCT
CCTACTCTAATCGAACAAAGTTGCTTGCAGCTGAGAGGGATTTGGATCAGTTGCTCGCAAAGAGATTTTTTGGAAGCAGAGATCACGGGACCAATGGCTAG
Protein sequenceShow/hide protein sequence
MRVGGFLGIIFAGLIESEGLYCTFVLLGCVCVITVRMDDLVTRWQGLYLTFEETDVVALMGVEPLLSEDSIKLYLVGKILTSRKVNAEAFRDVIRKVWRVHRDNRVDLIG
DNVFVAKFKTLMEKNHVFRAGPWTSDKSFIVVVIPIADQDDPQNLSFTTVSLWVQIHNVPFRYMTRSMAVKLRRTIGLVEDVAGDGLDEWIGPIMRVWVALDLTKPLRHG
LMIRDLGNQELLCPLRYERLPDFCYECGLVTLRVGVPLYHRGTLRTKLSNMARGVGFYGFPETELHHRSWSLLRQLRGSPDTPWLVGGDFNAILQHSEKSEGEDLELVST
ESWRFLYPQCEVSHLDYHSLKHQPLALCLRPITPLLRSSGVVFPVLRRSKSSNYKIRITAASSRVQSEIENLGSYSNRTKLLAAERDLDQLLAKRFFGSRDHGTNG