; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018857 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018857
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:35475488..35478299
RNA-Seq ExpressionLag0018857
SyntenyLag0018857
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO61345.1 reverse transcriptase [Corchorus capsularis]4.0e-4229.45Show/hide
Query:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF
        L+++E+  +      + E+  +  NCL+GK++S+R +N++  R  M  VW    G+ +  +G+NLF+   +S   ++R+ +++PW F+  L++LK+   +
Subjt:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF

Query:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM
        D  EDI  D  +FW Q H+LPLG  N+S+ +++G   GTV  IDT      WG+FLR R ++ + +PL RG+ +      KI+++ +YE+LPDFCY CG 
Subjt:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM

Query:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR----------GRFGNKWWTKQSWRSEPMADERDSGHDFDHDEDVDS
        + H +  C     + + + K  K++G W+RA   PR   V   G G    R GR           R G     ++      +   RD+    D  +D D 
Subjt:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR----------GRFGNKWWTKQSWRSEPMADERDSGHDFDHDEDVDS

Query:  VDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTG
        + + N +  +G+       +        + SP+ +K +    G
Subjt:  VDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTG

OMP03234.1 hypothetical protein COLO4_10561 [Corchorus olitorius]2.0e-4133.07Show/hide
Query:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF
        L+++E+  +  +   ++ES  + + CL+GK++S+R +NID  R  +  VW    G+ +  +G+ L++   +S   ++R+ ++ PW F+  L++LK    F
Subjt:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF

Query:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM
           E+I  +   FW+Q H+LPLG   +SV + +G   G V+ IDT      WG+FLR+R  + +++PL RG+ +      KI+V+ +YE+LPDFCY CG 
Subjt:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM

Query:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR
        ++H +  C     + +   K +K++G W+RA   PR N +   G G    R G+
Subjt:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR

TXG60685.1 hypothetical protein EZV62_015258 [Acer yangbiense]3.1e-3940.44Show/hide
Query:  SESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQ
        SE    + +CLVGKV++ + +N +AF+K +  +W     + I+ VGDN F+    S+  R  I    PW FD NLI+L+ PT       + F    FWVQ
Subjt:  SESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQ

Query:  FHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAI-KYERLPDFCYFCGMIDHNDGSCP----LL
         HNLPL   N  +A+ L  +IGTV+ + T+   ECWGRF+RV++++ I +PL R L +N   S ++  AI  YERLP+FCY CG+I H    CP     L
Subjt:  FHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAI-KYERLPDFCYFCGMIDHNDGSCP----LL

Query:  KTQVKPEKQFGSWMRAATPPR-KNR
        +       ++GSW+RAA+  + KNR
Subjt:  KTQVKPEKQFGSWMRAATPPR-KNR

XP_038708494.1 uncharacterized protein LOC120003548 [Tripterygium wilfordii]3.1e-3934.15Show/hide
Query:  DVIEKLSTLKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGN
        D+ ++   L L+ +ES  +  +    +  ++  +  L G+++S R  N +AF  TM N+W   RG+    VG+NL+L+   S    +++++ SPW FD +
Subjt:  DVIEKLSTLKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGN

Query:  LILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGP-SRKIMVAIKYE
         +LLK       P D++F+E   WV+F+NLP  + N+ V   LG+ +G + ++D +     WG++LRVRI++ I +PL R + + G      I V ++YE
Subjt:  LILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGP-SRKIMVAIKYE

Query:  RLPDFCYFCGMIDHNDGSCPLLKTQVKPEKQFGSWMRAATPPRKNR
        RLP+FCY CG + H D  C + +T+    K +G W+R ++PPR  R
Subjt:  RLPDFCYFCGMIDHNDGSCPLLKTQVKPEKQFGSWMRAATPPRKNR

XP_040990949.1 uncharacterized protein LOC121238174 [Juglans microcarpa x Juglans regia]1.1e-3935.66Show/hide
Query:  DVIEKLSTLKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGN
        D+ E  +   L+++E   I  ++  +  +  + + CLVG+V++ + IN +AF+ TM  VW  C G  I  VGDNL++   Q+ G  +RI    PW FD N
Subjt:  DVIEKLSTLKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGN

Query:  LILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYER
        L+ LK       P ++ F+    WVQ HNLPLG  N  +   +GS IG V+ ++ +    CWG+++RV+I+M + +PL RG  +     +K  +  KYER
Subjt:  LILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYER

Query:  LPDFCYFCGMIDHNDGSCPLLKTQVKPEK-----QFGSWMRAAT
        LP FC+ CG I H    C  +    + +K     Q+GSW+RA++
Subjt:  LPDFCYFCGMIDHNDGSCPLLKTQVKPEK-----QFGSWMRAAT

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase1.9e-4229.45Show/hide
Query:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF
        L+++E+  +      + E+  +  NCL+GK++S+R +N++  R  M  VW    G+ +  +G+NLF+   +S   ++R+ +++PW F+  L++LK+   +
Subjt:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF

Query:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM
        D  EDI  D  +FW Q H+LPLG  N+S+ +++G   GTV  IDT      WG+FLR R ++ + +PL RG+ +      KI+++ +YE+LPDFCY CG 
Subjt:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM

Query:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR----------GRFGNKWWTKQSWRSEPMADERDSGHDFDHDEDVDS
        + H +  C     + + + K  K++G W+RA   PR   V   G G    R GR           R G     ++      +   RD+    D  +D D 
Subjt:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR----------GRFGNKWWTKQSWRSEPMADERDSGHDFDHDEDVDS

Query:  VDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTG
        + + N +  +G+       +        + SP+ +K +    G
Subjt:  VDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTG

A0A1R3K847 Uncharacterized protein9.5e-4233.07Show/hide
Query:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF
        L+++E+  +  +   ++ES  + + CL+GK++S+R +NID  R  +  VW    G+ +  +G+ L++   +S   ++R+ ++ PW F+  L++LK    F
Subjt:  LSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTF

Query:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM
           E+I  +   FW+Q H+LPLG   +SV + +G   G V+ IDT      WG+FLR+R  + +++PL RG+ +      KI+V+ +YE+LPDFCY CG 
Subjt:  DLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGM

Query:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR
        ++H +  C     + +   K +K++G W+RA   PR N +   G G    R G+
Subjt:  IDHNDGSCP----LLKTQVKPEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGR

A0A2N9F689 Uncharacterized protein2.6e-3931.01Show/hide
Query:  NMDVIEKLSTLKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVW--YFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWC
        N+DVI  L  L L++ E+  I   E     SQ K + CL+ K+++QR  N    + TMTN+W     RG+   ++GDNLFL        ++ ++  SPWC
Subjt:  NMDVIEKLSTLKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVW--YFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWC

Query:  FDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAI
        FD  L++LK         +I  DE  FWVQ ++LP+      V + +G  +GTV  ++       WGRFLRVR+++ + +PL R   +  G    I+V  
Subjt:  FDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAI

Query:  KYERLPDFCYFCGMIDHNDGSCPL-LKTQVK-----PEKQFGSWMRAATPPRK--------NRVYASGRGFDGYRGGRGRFGNKWWTKQ--------SWR
        +YERLP+FC++CG +DH +  C + L+ + K      E Q+G W+RA     K        N V A+  G    R        K  T Q           
Subjt:  KYERLPDFCYFCGMIDHNDGSCPL-LKTQVK-----PEKQFGSWMRAATPPRK--------NRVYASGRGFDGYRGGRGRFGNKWWTKQ--------SWR

Query:  SEPMADERDSGHDFDHDEDVDSVDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTGIKCPVSLGNANIT----FSFQK
         E   +E++ G       DV+ VDS   K   G++++ T   T+++ G   +     ++Q    G++    LG   +T    FS QK
Subjt:  SEPMADERDSGHDFDHDEDVDSVDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTGIKCPVSLGNANIT----FSFQK

A0A5C7HUW5 CCHC-type domain-containing protein1.5e-3940.44Show/hide
Query:  SESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQ
        SE    + +CLVGKV++ + +N +AF+K +  +W     + I+ VGDN F+    S+  R  I    PW FD NLI+L+ PT       + F    FWVQ
Subjt:  SESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQ

Query:  FHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAI-KYERLPDFCYFCGMIDHNDGSCP----LL
         HNLPL   N  +A+ L  +IGTV+ + T+   ECWGRF+RV++++ I +PL R L +N   S ++  AI  YERLP+FCY CG+I H    CP     L
Subjt:  FHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAI-KYERLPDFCYFCGMIDHNDGSCP----LL

Query:  KTQVKPEKQFGSWMRAATPPR-KNR
        +       ++GSW+RAA+  + KNR
Subjt:  KTQVKPEKQFGSWMRAATPPR-KNR

A0A6J5TXP7 CCHC-type domain-containing protein3.7e-3831.66Show/hide
Query:  LKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPT
        L L+ KE  G++      +    +L  CLVG V++ +  N +AF++TM   W   R + + ++ DNLFL    ++  ++ ++   PW FD  L+LL+ P 
Subjt:  LKLSQKESSGINSSELGLSESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPT

Query:  TFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFC
            P  ++    +FWVQ HNLPL     +  + +G+R+G  + +      EC G++L +R+++ + +PL R + +    S K ++  KYERLPDFCY C
Subjt:  TFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFC

Query:  GMIDHNDGSCPLLKTQVK--PEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGRGRFGN
        G I H    C  +    K   EK +GSW+ +     + R    G+  +G R  +  FGN
Subjt:  GMIDHNDGSCPLLKTQVK--PEKQFGSWMRAATPPRKNRVYASGRGFDGYRGGRGRFGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding3.1e-1325.67Show/hide
Query:  QNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKA-PTTFDLPEDIIFDEPNFWVQFHNLPL
        + C++ KV+  + I I    + +  +W     +++ ++    F+I  +        +   PW   GN +L++   + FD   D I   P  WV+  N+P 
Subjt:  QNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKA-PTTFDLPEDIIFDEPNFWVQFHNLPL

Query:  GLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGMIDHNDGSCP
           +  +   +   +G  L++D N +N   GRF RV I++ + +PL   + ING         + YE L   C  CG+  H   SCP
Subjt:  GLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGMIDHNDGSCP

AT3G31430.1 unknown protein2.7e-1229.55Show/hide
Query:  DRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTING
        + ++   PW F+  +ILL+       P+  +F    FWVQ   +P    N  V + +G  +G VL  D N        F RV +   I  PL        
Subjt:  DRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTING

Query:  GPSRKIMVAIKYERLPDFCYFCGMIDHNDGSC
              ++  +YERL  FC  CGM+ H+ G+C
Subjt:  GPSRKIMVAIKYERLPDFCYFCGMIDHNDGSC

AT3G42140.1 zinc ion binding;nucleic acid binding2.2e-0624.43Show/hide
Query:  IVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGP
        I+   PW F+  + +++  T   L  D  F    FW+Q   +PL      +   +G R+G  L ++TN                     L R +++    
Subjt:  IVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGP

Query:  SRKIMVAIKYERLPDFCYFCGMIDHNDGSCP
             +  +YE+L +FC  CGM+ H+   CP
Subjt:  SRKIMVAIKYERLPDFCYFCGMIDHNDGSCP

AT5G36228.1 nucleic acid binding;zinc ion binding2.0e-1224.11Show/hide
Query:  VEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPS
        +  +PW F+   I L+    F   + + F +   WV    +PL   ++   +++ S +G V+ +D NE       F+RV+++M   EPL     +     
Subjt:  VEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRNDSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPS

Query:  RKIMVAIKYERLPDFCYFCGMIDHNDGSCPLLKTQVKPEKQ
         + M+  +YE+L   C  C  ++H    CP +  Q + + +
Subjt:  RKIMVAIKYERLPDFCYFCGMIDHNDGSCPLLKTQVKPEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAGAGGCCCACGCCCCCCTTCGCTCTTCTCCTCGGCCTCGACCCATTGCCGAGGCTGAGGAAGAGGTCGGCCTCGACCCAATGCCGAGGGCGACCAAGGGCAT
GGGCCTAAGCCCGTTGTGGATCGACCCATCGTTGTCTCTCAGAGCTCTAAGTTTCTACTGTTCTCCGATCAGAGGAGTTTCTCTCAGGCTGGTTTTTGCAGCCTGCCTTC
TTCCTTCCTCTGAAGGATTATTGCCAAATATGGATGTAATCGAAAAGCTCTCCACGCTCAAACTTTCACAGAAGGAATCTAGTGGAATAAACTCATCGGAGTTAGGCTTG
TCTGAAAGCCAGACTAAGCTCCAAAATTGTTTGGTAGGCAAGGTTGTATCTCAAAGAATTATCAACATTGATGCTTTTCGGAAAACTATGACAAACGTATGGTATTTCTG
CCGAGGGATTTCCATCGATAATGTTGGGGACAACCTCTTCCTTATCAATCCACAATCCATCGGGGGAAGAGACAGAATCGTGGAAGAAAGTCCATGGTGTTTTGACGGTA
ATTTAATACTACTCAAAGCCCCAACAACTTTTGATTTACCGGAAGATATAATTTTTGATGAACCAAATTTTTGGGTTCAATTCCACAATCTTCCGCTTGGATTACGAAAC
GACTCAGTAGCACAAATGCTTGGAAGCCGAATAGGGACTGTTCTGAGAATTGATACGAACGAATTGAATGAATGTTGGGGTCGTTTTCTTCGAGTACGGATAAAGATGAA
GATTGACGAGCCTTTATGTAGGGGATTAACTATCAATGGCGGACCATCGAGAAAAATCATGGTGGCAATTAAATACGAGAGATTACCAGATTTCTGTTATTTTTGCGGAA
TGATTGACCACAACGATGGTTCATGCCCACTGCTAAAAACACAGGTGAAACCAGAAAAACAGTTTGGATCATGGATGCGAGCGGCTACTCCTCCAAGGAAAAATCGTGTT
TACGCAAGCGGCCGAGGTTTCGATGGGTACCGAGGGGGCAGAGGACGTTTTGGAAACAAATGGTGGACCAAACAATCGTGGCGATCTGAACCAATGGCGGATGAACGAGA
CTCCGGCCACGATTTCGATCACGATGAAGACGTTGATTCAGTCGATTCACAAAACTTGAAAGGACGCAGTGGCGAACAACAAGCTGAAACCGATCACCGGACGGCCACTG
CAACCGGCGTTGACAACCTGTCTCCTCTGCATTCAAAGCTACAAACGAAAAAGACGGGTATTAAATGTCCAGTATCCTTAGGAAACGCTAATATCACGTTTTCTTTCCAA
AAAAATGCAGATGGCGTTTTTCTCGGAAATGAGGGTGATGGAATAAATGATTGCAACGACCAGAGATCCCTAAAAACCAGCGAAATCATAGAGGGTGAAGGGAAAGTTAG
CGGCGGCAAGCTATTAGGAGATTTAGGGCAAATGGTTACTGATGCTCATGAAAATCAACTTGGGCCCATTGCAAAGGATAATACCATCTCAGCCCACATAACCACTTGCC
CAGATCAGTCCAGTGAGCAGCATTTAAATGTGAACCTTGAATCTATTAATGGGCCAACTAATTCAAGCCCAATTTCCTCTAATGATCAGCAGCAGCGGACGAAACCACAT
GACCCAATTTCCCACGAAGACACTAGAATGGTCGAAATCTGGCCCGATTTTCAGTCTAGTCAAACGGAAATTTTCATGACTAACGATAAACAGAAGCAGCCCACAAGCGT
TGTCTCAGAAAGTGGACAAAAGGGAAAGCAGGAAGAAGACATTTATAGTTTCAAGCCGTTAAAAGGTTGGAAAAGAAGAAATAGGCAGGAAGAACATATGGAGGCTGACC
AATCCACATCTTACACGAAATTGTTTAGGAAAAAAAGAGCTGCAGAGGATGAGAATTTGACAGGAAGCTCAGACAAGAAATTCAAATCTCAAATTCCACTAAACAATGAA
ATTCTATCGGCGGAGCCTGATCAACAGGCCCGCCGGGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAGAGGCCCACGCCCCCCTTCGCTCTTCTCCTCGGCCTCGACCCATTGCCGAGGCTGAGGAAGAGGTCGGCCTCGACCCAATGCCGAGGGCGACCAAGGGCAT
GGGCCTAAGCCCGTTGTGGATCGACCCATCGTTGTCTCTCAGAGCTCTAAGTTTCTACTGTTCTCCGATCAGAGGAGTTTCTCTCAGGCTGGTTTTTGCAGCCTGCCTTC
TTCCTTCCTCTGAAGGATTATTGCCAAATATGGATGTAATCGAAAAGCTCTCCACGCTCAAACTTTCACAGAAGGAATCTAGTGGAATAAACTCATCGGAGTTAGGCTTG
TCTGAAAGCCAGACTAAGCTCCAAAATTGTTTGGTAGGCAAGGTTGTATCTCAAAGAATTATCAACATTGATGCTTTTCGGAAAACTATGACAAACGTATGGTATTTCTG
CCGAGGGATTTCCATCGATAATGTTGGGGACAACCTCTTCCTTATCAATCCACAATCCATCGGGGGAAGAGACAGAATCGTGGAAGAAAGTCCATGGTGTTTTGACGGTA
ATTTAATACTACTCAAAGCCCCAACAACTTTTGATTTACCGGAAGATATAATTTTTGATGAACCAAATTTTTGGGTTCAATTCCACAATCTTCCGCTTGGATTACGAAAC
GACTCAGTAGCACAAATGCTTGGAAGCCGAATAGGGACTGTTCTGAGAATTGATACGAACGAATTGAATGAATGTTGGGGTCGTTTTCTTCGAGTACGGATAAAGATGAA
GATTGACGAGCCTTTATGTAGGGGATTAACTATCAATGGCGGACCATCGAGAAAAATCATGGTGGCAATTAAATACGAGAGATTACCAGATTTCTGTTATTTTTGCGGAA
TGATTGACCACAACGATGGTTCATGCCCACTGCTAAAAACACAGGTGAAACCAGAAAAACAGTTTGGATCATGGATGCGAGCGGCTACTCCTCCAAGGAAAAATCGTGTT
TACGCAAGCGGCCGAGGTTTCGATGGGTACCGAGGGGGCAGAGGACGTTTTGGAAACAAATGGTGGACCAAACAATCGTGGCGATCTGAACCAATGGCGGATGAACGAGA
CTCCGGCCACGATTTCGATCACGATGAAGACGTTGATTCAGTCGATTCACAAAACTTGAAAGGACGCAGTGGCGAACAACAAGCTGAAACCGATCACCGGACGGCCACTG
CAACCGGCGTTGACAACCTGTCTCCTCTGCATTCAAAGCTACAAACGAAAAAGACGGGTATTAAATGTCCAGTATCCTTAGGAAACGCTAATATCACGTTTTCTTTCCAA
AAAAATGCAGATGGCGTTTTTCTCGGAAATGAGGGTGATGGAATAAATGATTGCAACGACCAGAGATCCCTAAAAACCAGCGAAATCATAGAGGGTGAAGGGAAAGTTAG
CGGCGGCAAGCTATTAGGAGATTTAGGGCAAATGGTTACTGATGCTCATGAAAATCAACTTGGGCCCATTGCAAAGGATAATACCATCTCAGCCCACATAACCACTTGCC
CAGATCAGTCCAGTGAGCAGCATTTAAATGTGAACCTTGAATCTATTAATGGGCCAACTAATTCAAGCCCAATTTCCTCTAATGATCAGCAGCAGCGGACGAAACCACAT
GACCCAATTTCCCACGAAGACACTAGAATGGTCGAAATCTGGCCCGATTTTCAGTCTAGTCAAACGGAAATTTTCATGACTAACGATAAACAGAAGCAGCCCACAAGCGT
TGTCTCAGAAAGTGGACAAAAGGGAAAGCAGGAAGAAGACATTTATAGTTTCAAGCCGTTAAAAGGTTGGAAAAGAAGAAATAGGCAGGAAGAACATATGGAGGCTGACC
AATCCACATCTTACACGAAATTGTTTAGGAAAAAAAGAGCTGCAGAGGATGAGAATTTGACAGGAAGCTCAGACAAGAAATTCAAATCTCAAATTCCACTAAACAATGAA
ATTCTATCGGCGGAGCCTGATCAACAGGCCCGCCGGGAGCCATGA
Protein sequenceShow/hide protein sequence
MKKEAHAPLRSSPRPRPIAEAEEEVGLDPMPRATKGMGLSPLWIDPSLSLRALSFYCSPIRGVSLRLVFAACLLPSSEGLLPNMDVIEKLSTLKLSQKESSGINSSELGL
SESQTKLQNCLVGKVVSQRIINIDAFRKTMTNVWYFCRGISIDNVGDNLFLINPQSIGGRDRIVEESPWCFDGNLILLKAPTTFDLPEDIIFDEPNFWVQFHNLPLGLRN
DSVAQMLGSRIGTVLRIDTNELNECWGRFLRVRIKMKIDEPLCRGLTINGGPSRKIMVAIKYERLPDFCYFCGMIDHNDGSCPLLKTQVKPEKQFGSWMRAATPPRKNRV
YASGRGFDGYRGGRGRFGNKWWTKQSWRSEPMADERDSGHDFDHDEDVDSVDSQNLKGRSGEQQAETDHRTATATGVDNLSPLHSKLQTKKTGIKCPVSLGNANITFSFQ
KNADGVFLGNEGDGINDCNDQRSLKTSEIIEGEGKVSGGKLLGDLGQMVTDAHENQLGPIAKDNTISAHITTCPDQSSEQHLNVNLESINGPTNSSPISSNDQQQRTKPH
DPISHEDTRMVEIWPDFQSSQTEIFMTNDKQKQPTSVVSESGQKGKQEEDIYSFKPLKGWKRRNRQEEHMEADQSTSYTKLFRKKRAAEDENLTGSSDKKFKSQIPLNNE
ILSAEPDQQARREP