; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015384 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015384
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr12:11956748..11958700
RNA-Seq ExpressionLag0015384
SyntenyLag0015384
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]6.7e-5540.96Show/hide
Query:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP
        M +S+L+++W    LT++E++I+V  D  A+E TG  L   L+ KLL  R ++  V++   + AWK+D     VD +G N+F+F F+  +DR  + R GP
Subjt:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP

Query:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW
        WTF+R L+++  P+   +P D  F +V  W+H F+L L   N+ MA RLGNA+G+F+DV+S  N   WG+ LR++VR D+ + + RGI++  DGP+ G W
Subjt:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG
        +PI+YERLP+   HCG + H+ +DC+         S   QY  WLRF G
Subjt:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]8.7e-6343.96Show/hide
Query:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPW
        MD  +L+ DW +  LT++E+EI++  D +AV+     L + L+GKLL  R ++A+V+ R    AWK+++ L V+ +G+NLF+F F  E D   V + GPW
Subjt:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPW

Query:  TFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWV
         F++ L+VL  P      S+  F+ V FWIH+F+LP+ W N+ MA RLGNA+G F DVD       WGASLRI+V ID+ + +RRGI+I  DGP+ G W+
Subjt:  TFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWV

Query:  PIRYERLPELCSHCGIIGHLSRDC-ARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRDQEISHS
        PI+YERLP+ C  CG+IGH S DC AR L  + +  A  +Y  WLRF G S   A    +G+   R+     S
Subjt:  PIRYERLPELCSHCGIIGHLSRDC-ARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRDQEISHS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.9e-4938.96Show/hide
Query:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP
        M +  L+++W    LT++EEE ++  D  A   TG  L   L+GKL   RP+   VM+   R AWK++ N  +V  LG NLF+F F+   DR  +++ GP
Subjt:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP

Query:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW
        WTF+R L+++  P+  I PS+  F+ +  W+  F+LPL    R MA RLGNALG F++ D  +    WG++LR++V +D+++ +RRGI++  DGP+ G W
Subjt:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG
        +PI+YERLP+ C HCG+     +                QY  WLR+ G
Subjt:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG

XP_028117212.1 uncharacterized protein LOC114314884 [Camellia sinensis]5.2e-3934.58Show/hide
Query:  VDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERFL
        ++D +   + T+EEE  VV D+  +  T    G CL+GKLL  RP   + +R    + WK   G+     G NL + +F    D+  V   GPW+F++ L
Subjt:  VDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERFL

Query:  LVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYER
        ++L    R ++PS+  FS++ FW+H   LPL    R +   LGN LG F D++  +  + WG +L I++ I++++ +RRG+++   G    +W+ ++YER
Subjt:  LVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYER

Query:  LPELCSHCGIIGHLSRDCA-RVLRMEREFSAPPQYEDWLR
        LP  C HCG++GH   DC  R+L          QY  WLR
Subjt:  LPELCSHCGIIGHLSRDCA-RVLRMEREFSAPPQYEDWLR

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]2.0e-3836.03Show/hide
Query:  SSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLG---FCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGP
        + SL+D    LSLT++E+ +  +      E T + +G    CL+GKLL  RP   E M+    + W+   G+QV  +G NLF+F F    D+  V   GP
Subjt:  SSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLG---FCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGP

Query:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW
        WTF++ LL+L      ++PSD   + V FW+HV  LPL   N+ + E +GNA+G F D+D  +  + WG ++RI+V +D+ + +RRG+++        +W
Subjt:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDC-ARVLRMEREFSAPPQYEDWLR
        V  +YERLP  C  CG +GH  R+C  ++   +       QY  WLR
Subjt:  VPIRYERLPELCSHCGIIGHLSRDC-ARVLRMEREFSAPPQYEDWLR

TrEMBL top hitse value%identityAlignment
A0A2N9GF83 CCHC-type domain-containing protein1.0e-3736.36Show/hide
Query:  LVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERF
        LV+DW R SLT DE       D+EA+     F   CLLGKLL  +      ++      W +  G+    +G NLF+F+F  E+D   VF+  PW F+  
Subjt:  LVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERF

Query:  LLVLVFPIRGIRPSDH-PFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRY
        LLVL     G  P++   F+S CFW+ +  +PL +  +   ER+G A+GI + VD   + + WG  LR+++ +D+ + ++RG R+   G    +W+  +Y
Subjt:  LLVLVFPIRGIRPSDH-PFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRY

Query:  ERLPELCSHCGIIGHLSRDCARVLR-MEREFSAPPQYEDWLR
        ERLP  C HCG +GH  R+C   LR      S   QY  WLR
Subjt:  ERLPELCSHCGIIGHLSRDCARVLR-MEREFSAPPQYEDWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054813.2e-5540.96Show/hide
Query:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP
        M +S+L+++W    LT++E++I+V  D  A+E TG  L   L+ KLL  R ++  V++   + AWK+D     VD +G N+F+F F+  +DR  + R GP
Subjt:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP

Query:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW
        WTF+R L+++  P+   +P D  F +V  W+H F+L L   N+ MA RLGNA+G+F+DV+S  N   WG+ LR++VR D+ + + RGI++  DGP+ G W
Subjt:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG
        +PI+YERLP+   HCG + H+ +DC+         S   QY  WLRF G
Subjt:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG

A0A6J1DU55 uncharacterized protein LOC1110231354.2e-6343.96Show/hide
Query:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPW
        MD  +L+ DW +  LT++E+EI++  D +AV+     L + L+GKLL  R ++A+V+ R    AWK+++ L V+ +G+NLF+F F  E D   V + GPW
Subjt:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPW

Query:  TFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWV
         F++ L+VL  P      S+  F+ V FWIH+F+LP+ W N+ MA RLGNA+G F DVD       WGASLRI+V ID+ + +RRGI+I  DGP+ G W+
Subjt:  TFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWV

Query:  PIRYERLPELCSHCGIIGHLSRDC-ARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRDQEISHS
        PI+YERLP+ C  CG+IGH S DC AR L  + +  A  +Y  WLRF G S   A    +G+   R+     S
Subjt:  PIRYERLPELCSHCGIIGHLSRDC-ARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRDQEISHS

A0A6J1DX30 uncharacterized protein LOC1110248749.1e-5038.96Show/hide
Query:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP
        M +  L+++W    LT++EEE ++  D  A   TG  L   L+GKL   RP+   VM+   R AWK++ N  +V  LG NLF+F F+   DR  +++ GP
Subjt:  MDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKID-NGLQVDRLGRNLFIFRFSAEADRVLVFRQGP

Query:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW
        WTF+R L+++  P+  I PS+  F+ +  W+  F+LPL    R MA RLGNALG F++ D  +    WG++LR++V +D+++ +RRGI++  DGP+ G W
Subjt:  WTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLW

Query:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG
        +PI+YERLP+ C HCG+     +                QY  WLR+ G
Subjt:  VPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTG

A0A6P6S3G2 uncharacterized protein LOC1136872936.8e-3733.59Show/hide
Query:  LSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERFLLVLVFP
        L L  DEEE+ ++   +A +        C+LGKL   +    E +    +  W    GL    LG NLF+F+F+   D+  VF  GPW F+  LLV+   
Subjt:  LSLTTDEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERFLLVLVFP

Query:  IRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYERLPELCS
        I  ++ ++    +  FW+ V+ LPL W N   AE LGN LG+++  + R +   WG  LRI+V+I L   ++R + ++ +G +    V  +YERLP LC 
Subjt:  IRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYERLPELCS

Query:  HCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRDQE
        +CG IGH  RDC   L      +  PQY  WLR   +   +     R  + + D E
Subjt:  HCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRDQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding3.2e-1524.76Show/hide
Query:  DEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPW-TFERFLLVLVFPIRGI
        ++EE  +   +E +E        C++ K+L  + +   V+ R  R  WK    + V  L R  F+ RF  E + +     GPW     +LLV  +  R  
Subjt:  DEEEISVVADQEAVERTGIFLGFCLLGKLLCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPW-TFERFLLVLVFPIRGI

Query:  RPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYERLPELCSHCGI
         P      +   W+ +  +P ++++R +   +   LG    VD        G   R+ + ++L + ++  + I  D         + YE L ++CS CGI
Subjt:  RPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYERLPELCSHCGI

Query:  IGHLSRDCAR
         GHL   C R
Subjt:  IGHLSRDCAR

AT3G31430.1 unknown protein1.8e-1027.34Show/hide
Query:  FIFRFSAEADRVLVFRQGPWTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLN
        F F F+ E     V R+GPW F  ++++L    +   P    F  + FW+ +  +P  + NR + E +G ALG   D D     +      R+ +  D+ 
Subjt:  FIFRFSAEADRVLVFRQGPWTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQDVDSRNNFLFWGASLRIKVRIDLN

Query:  RSVRRGIRIYPDGPLSGLWVPIRYERLPELCSHCGIIGH
          +R          ++ L +  RYERL   C  CG++ H
Subjt:  RSVRRGIRIYPDGPLSGLWVPIRYERLPELCSHCGIIGH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGCGACGTAATTCTAAAATCTTACAGCCGTTTCCAAATCTGCCGGATTACAACGGCGGATGTTCAGCCTTCGCGCTGCTATATCAACGGGAGAGTGGAAGCAC
GGAAACATCACCTCTTTCGACGGTATTTTCTTCTTCCTTACTATACTGTTGTTTTCAATACTTTGTTCTGTGTCTTGGAAACATGGATTCCTCCTCTTTGGTCGATGATT
GGTCGCGGTTGAGTCTCACTACGGATGAGGAGGAAATCTCGGTGGTGGCGGATCAAGAAGCGGTGGAGCGGACGGGTATCTTCCTAGGGTTCTGTCTCCTGGGTAAGTTG
TTGTGTCATAGACCACTGGCGGCTGAAGTGATGCGACGAAATTTTCGTGCGGCGTGGAAGATCGACAACGGGTTGCAAGTGGATCGTTTGGGAAGAAATTTGTTTATATT
CAGATTCAGTGCTGAAGCCGATCGGGTCCTCGTGTTTCGTCAAGGTCCCTGGACTTTTGAAAGATTTCTCCTGGTTCTAGTGTTCCCAATTCGTGGAATAAGACCGTCTG
ATCACCCATTTTCATCTGTGTGTTTTTGGATCCATGTGTTCGAGTTACCTTTGGATTGGTTCAATCGAATTATGGCAGAACGTCTGGGAAATGCTTTGGGAATATTTCAA
GACGTTGACAGTCGAAATAACTTCCTATTCTGGGGAGCAAGCCTGCGAATCAAGGTAAGAATCGATCTAAATCGGTCGGTCCGTCGCGGGATTCGAATTTACCCGGATGG
CCCTCTCAGTGGTCTCTGGGTGCCGATAAGATACGAACGACTGCCGGAGCTCTGTTCTCATTGTGGGATAATTGGCCATTTATCGCGGGATTGTGCTCGAGTTTTGAGGA
TGGAACGAGAATTTAGTGCTCCTCCACAGTATGAAGATTGGCTTCGGTTCACGGGTAAAAGTATGATTACGGCTTCTGTAGCAGTAAGAGGGGAGATGCAAGTTCGTGAT
CAAGAAATTTCACATAGCCAGCGACTGGCGCTTGAGGTGGAACCGGTTGCTAATCCGATTCCGTCTGTGAACGAATTACCTTCTCGTAGATCTTCGAGTATTCGAATAAT
TGAGGCTGTCGATGGTGTCTGCCCCCAAACTCCAAAACCCATGGCGCCGTCTCATTATTCGCTCTCTGGTGATGTTCTTTCCAAATTAAAGGGCAAGGAGAAGGTGACTG
AAGTGGAGTTGGTGACCAACGGTACTGCCTCTAAATCGGGGAAGCGGCGGACGGACAATCCAGAATGGCTGCCCGAAAGTTCGAGTCGGCGACCGTTACTCGGAGCTGAT
CATGTGGGCCGGCAGTCGTTGGGTCTTGGAAGCGTTTCAGGAGAAATTAAAGGGCCATTATTGCAGCCGATGGGACCTCTTGTCGGTTTTGGGATTGATAGGTATAAAAA
GGAATTGAAGGCTAGTAACTACCGGTTATCCAAATCATTATTATCAATTTTTAATGCTGCGGAAAATCATCCTGTTATTAATAAATCAAGCAATCTGGAAGGGGTTATTG
AAGATGATGGCGTGGAAAGTGATCCGGAAATGGTGGAGGAAGTGGGCTTTGAATCTCAAAATGGTGGCACATGGGCCCACCAAAATGCTCAAGAGGGCTTGCATCCAGAC
ATCAATGGCGAAATCCAGAGTGAGCACAATCAAGATAAGAATGACCAGATAGGAGGAGAAGTCAGTGACGCAGGCAGCTTTGTTTTTTCGTCGAAAAAACTCCCATCAAA
ACACAAAGGATCAATGTGGAAGAAGAGAGCCCGGGCGGGATTTGTTCCCCAAGGAGTGAATGTGGAAGCTGTGGAAGAAGCACAAAAGAGGAAGGATGGACCTTTGTTAT
TCTCTCCTGGGAATATTAAGTGTCCTAAGGTTGATGATGATAAACAGGCGGGGACTGCAGAGCAGCCTCGCCCAGAATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGATGCGACGTAATTCTAAAATCTTACAGCCGTTTCCAAATCTGCCGGATTACAACGGCGGATGTTCAGCCTTCGCGCTGCTATATCAACGGGAGAGTGGAAGCAC
GGAAACATCACCTCTTTCGACGGTATTTTCTTCTTCCTTACTATACTGTTGTTTTCAATACTTTGTTCTGTGTCTTGGAAACATGGATTCCTCCTCTTTGGTCGATGATT
GGTCGCGGTTGAGTCTCACTACGGATGAGGAGGAAATCTCGGTGGTGGCGGATCAAGAAGCGGTGGAGCGGACGGGTATCTTCCTAGGGTTCTGTCTCCTGGGTAAGTTG
TTGTGTCATAGACCACTGGCGGCTGAAGTGATGCGACGAAATTTTCGTGCGGCGTGGAAGATCGACAACGGGTTGCAAGTGGATCGTTTGGGAAGAAATTTGTTTATATT
CAGATTCAGTGCTGAAGCCGATCGGGTCCTCGTGTTTCGTCAAGGTCCCTGGACTTTTGAAAGATTTCTCCTGGTTCTAGTGTTCCCAATTCGTGGAATAAGACCGTCTG
ATCACCCATTTTCATCTGTGTGTTTTTGGATCCATGTGTTCGAGTTACCTTTGGATTGGTTCAATCGAATTATGGCAGAACGTCTGGGAAATGCTTTGGGAATATTTCAA
GACGTTGACAGTCGAAATAACTTCCTATTCTGGGGAGCAAGCCTGCGAATCAAGGTAAGAATCGATCTAAATCGGTCGGTCCGTCGCGGGATTCGAATTTACCCGGATGG
CCCTCTCAGTGGTCTCTGGGTGCCGATAAGATACGAACGACTGCCGGAGCTCTGTTCTCATTGTGGGATAATTGGCCATTTATCGCGGGATTGTGCTCGAGTTTTGAGGA
TGGAACGAGAATTTAGTGCTCCTCCACAGTATGAAGATTGGCTTCGGTTCACGGGTAAAAGTATGATTACGGCTTCTGTAGCAGTAAGAGGGGAGATGCAAGTTCGTGAT
CAAGAAATTTCACATAGCCAGCGACTGGCGCTTGAGGTGGAACCGGTTGCTAATCCGATTCCGTCTGTGAACGAATTACCTTCTCGTAGATCTTCGAGTATTCGAATAAT
TGAGGCTGTCGATGGTGTCTGCCCCCAAACTCCAAAACCCATGGCGCCGTCTCATTATTCGCTCTCTGGTGATGTTCTTTCCAAATTAAAGGGCAAGGAGAAGGTGACTG
AAGTGGAGTTGGTGACCAACGGTACTGCCTCTAAATCGGGGAAGCGGCGGACGGACAATCCAGAATGGCTGCCCGAAAGTTCGAGTCGGCGACCGTTACTCGGAGCTGAT
CATGTGGGCCGGCAGTCGTTGGGTCTTGGAAGCGTTTCAGGAGAAATTAAAGGGCCATTATTGCAGCCGATGGGACCTCTTGTCGGTTTTGGGATTGATAGGTATAAAAA
GGAATTGAAGGCTAGTAACTACCGGTTATCCAAATCATTATTATCAATTTTTAATGCTGCGGAAAATCATCCTGTTATTAATAAATCAAGCAATCTGGAAGGGGTTATTG
AAGATGATGGCGTGGAAAGTGATCCGGAAATGGTGGAGGAAGTGGGCTTTGAATCTCAAAATGGTGGCACATGGGCCCACCAAAATGCTCAAGAGGGCTTGCATCCAGAC
ATCAATGGCGAAATCCAGAGTGAGCACAATCAAGATAAGAATGACCAGATAGGAGGAGAAGTCAGTGACGCAGGCAGCTTTGTTTTTTCGTCGAAAAAACTCCCATCAAA
ACACAAAGGATCAATGTGGAAGAAGAGAGCCCGGGCGGGATTTGTTCCCCAAGGAGTGAATGTGGAAGCTGTGGAAGAAGCACAAAAGAGGAAGGATGGACCTTTGTTAT
TCTCTCCTGGGAATATTAAGTGTCCTAAGGTTGATGATGATAAACAGGCGGGGACTGCAGAGCAGCCTCGCCCAGAATCATGA
Protein sequenceShow/hide protein sequence
MMMRRNSKILQPFPNLPDYNGGCSAFALLYQRESGSTETSPLSTVFSSSLLYCCFQYFVLCLGNMDSSSLVDDWSRLSLTTDEEEISVVADQEAVERTGIFLGFCLLGKL
LCHRPLAAEVMRRNFRAAWKIDNGLQVDRLGRNLFIFRFSAEADRVLVFRQGPWTFERFLLVLVFPIRGIRPSDHPFSSVCFWIHVFELPLDWFNRIMAERLGNALGIFQ
DVDSRNNFLFWGASLRIKVRIDLNRSVRRGIRIYPDGPLSGLWVPIRYERLPELCSHCGIIGHLSRDCARVLRMEREFSAPPQYEDWLRFTGKSMITASVAVRGEMQVRD
QEISHSQRLALEVEPVANPIPSVNELPSRRSSSIRIIEAVDGVCPQTPKPMAPSHYSLSGDVLSKLKGKEKVTEVELVTNGTASKSGKRRTDNPEWLPESSSRRPLLGAD
HVGRQSLGLGSVSGEIKGPLLQPMGPLVGFGIDRYKKELKASNYRLSKSLLSIFNAAENHPVINKSSNLEGVIEDDGVESDPEMVEEVGFESQNGGTWAHQNAQEGLHPD
INGEIQSEHNQDKNDQIGGEVSDAGSFVFSSKKLPSKHKGSMWKKRARAGFVPQGVNVEAVEEAQKRKDGPLLFSPGNIKCPKVDDDKQAGTAEQPRPES