; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy04g014630 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy04g014630
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr04:46956266..46961344
RNA-Seq ExpressionLcy04g014630
SyntenyLcy04g014630
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PQQ10307.1 uncharacterized protein Pyn_17609 [Prunus yedoensis var. nudiflora]3.5e-4032.44Show/hide
Query:  MEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFF
        MEE+I   ++  ++ E+++    ++  + +++G +   SLVGK+ +T   ++ A K +M+  W T ++  V+ +G N+FLFIF ++ DR  ++ +GPW F
Subjt:  MEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFF

Query:  DRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIR
        D++L++LE P+ +     +  K+ D W+++ N+P+C     + ++IG+ +G  L+     +   LG  +R+RV L+++KPL RG  + L   +  +V  R
Subjt:  DRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIR

Query:  YERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPRKENNK
        YERLPEFCF CG +GH+ K+C           E+  +G+W+K       TK+  S R  N+K
Subjt:  YERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPRKENNK

TXG66887.1 hypothetical protein EZV62_008162 [Acer yangbiense]1.0e-3934.36Show/hide
Query:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW
        M+ EE+        L +++ + ++++     +  G +    LVGK+++ + +++ A +  M   W+T  D  VE++ +N FLF F +Q DR WI++ GPW
Subjt:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW

Query:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVT
         FD  LLVLE P+  +  + L F  V  WL++ N P+    +++ + IG  IGE  + D        G  +RL+V ++++KPL+R   + L+   E  + 
Subjt:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVT

Query:  IRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPR
        +RYE+LPE+CF+CG+IGH  +DC  RR E  GG ++FE+G WM+     G+    GS R
Subjt:  IRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.1e-4338.4Show/hide
Query:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTR-KDFNVEIIGNNIFLFIFESQEDRDWIITNGP
        M    L+ EWK FKL  EE      +D+      G     SL+ KL+S R IS   +KN++  AWK   K F+V+IIG NIFLF F    DR+ I+  GP
Subjt:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTR-KDFNVEIIGNNIFLFIFESQEDRDWIITNGP

Query:  WFFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIW
        W FDR+L++++ P S  + +D++F+ V +W+  F+L +   N+ +A ++G+ IG F + ++N N    G+ +R+RVR ++ KPL RG  + LD      W
Subjt:  WFFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIW

Query:  VTIRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGY
        + I+YERLP+F ++CG + HI+KDC       D   +  ++G W++FQG+
Subjt:  VTIRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGY

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.1e-4636.77Show/hide
Query:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW
        M+ E L+ +W++FKL  EE +    +D +         + SLVGKL++ R IS   +   ++ AWK      VE IG N+FLF F  + D + ++  GPW
Subjt:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW

Query:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIWV
        FFD++L+VL+ P S +   +LEF  V  W+ +F+LP+ + N+ +A ++G+ IG F++ D N+     G S+R+RV ++ITKPLRRG  I +D      W+
Subjt:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIWV

Query:  TIRYERLPEFCFNCGVIGHIVKDCPHR-RNETDGGEEEFEFGMWMKFQGYTG---KTKNQGSPRKEN---NKTDNQVLRGTYISETDGKSSEGLDVDLNL
         I+YERLP+FC+ CGVIGH   DC  R     D      E+G W++F G      K +   SP +E+   + + N   RG  + ET  + SE  + D   
Subjt:  TIRYERLPEFCFNCGVIGHIVKDCPHR-RNETDGGEEEFEFGMWMKFQGYTG---KTKNQGSPRKEN---NKTDNQVLRGTYISETDGKSSEGLDVDLNL

Query:  VSPIAEETED
            AE+T D
Subjt:  VSPIAEETED

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]4.5e-4031.4Show/hide
Query:  MEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFF
        M + + +  R   +  E+D +  +  E  + + G++   LVGKL++ R  +  AMKN+++  W+  K   V +IG+N+F+F+F    D+  ++++GPW F
Subjt:  MEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFF

Query:  DRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIR
        D+ LL+L   +   +  D++   V  W+ V NLP+   N+K+ + +G+ +G+F++ D        G ++R+RV L++ KPLRRG  + L  +  IWV  +
Subjt:  DRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIR

Query:  YERLPEFCFNCGVIGHIVKDCPHRRNETDGGE-EEFEFGMWMKFQGYTGKTKNQGSPR
        YERLP +C+ CG +GH  ++C  + +  DG   +  ++G W++        K++GS R
Subjt:  YERLPEFCFNCGVIGHIVKDCPHRRNETDGGE-EEFEFGMWMKFQGYTGKTKNQGSPR

TrEMBL top hitse value%identityAlignment
A0A314YVX1 CCHC-type domain-containing protein1.7e-4032.44Show/hide
Query:  MEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFF
        MEE+I   ++  ++ E+++    ++  + +++G +   SLVGK+ +T   ++ A K +M+  W T ++  V+ +G N+FLFIF ++ DR  ++ +GPW F
Subjt:  MEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFF

Query:  DRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIR
        D++L++LE P+ +     +  K+ D W+++ N+P+C     + ++IG+ +G  L+     +   LG  +R+RV L+++KPL RG  + L   +  +V  R
Subjt:  DRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIR

Query:  YERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPRKENNK
        YERLPEFCF CG +GH+ K+C           E+  +G+W+K       TK+  S R  N+K
Subjt:  YERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPRKENNK

A0A392MLA7 Zinc CCHC-type-like protein (Fragment)8.3e-4032.89Show/hide
Query:  EWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFFDRSLLV
        EW++ KL  EE++ I  +++EE  E      +SLVGKL +    +    K  +  AW+ +    V+ +  N+FLF F S+ D D ++ NGPW FDR+L+V
Subjt:  EWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFFDRSLLV

Query:  LEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPE
        L+  + +++  DLE    + W R+++LP+  R+E +AKK+G  IG F+E D +K+  +LG  +R++  +++ KPL+RG +++     ++ V  ++ERLP 
Subjt:  LEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPE

Query:  FCFNCGVIGHIVKDCPHRRNETDGG-----EEEFEFGMWMK---FQGYTGKTKNQGSPRKENNKTDNQVLRGTYISETDGKSSEGLDVDLNLVSPIAEET
        FCF CG IGH +KDC    ++ D       E+E  FG W++      +TG+ K + S    +    ++       S  D +  E   V+  +VSP+   +
Subjt:  FCFNCGVIGHIVKDCPHRRNETDGG-----EEEFEFGMWMK---FQGYTGKTKNQGSPRKENNKTDNQVLRGTYISETDGKSSEGLDVDLNLVSPIAEET

Query:  EDPE
          PE
Subjt:  EDPE

A0A5C7ICE5 CCHC-type domain-containing protein4.9e-4034.36Show/hide
Query:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW
        M+ EE+        L +++ + ++++     +  G +    LVGK+++ + +++ A +  M   W+T  D  VE++ +N FLF F +Q DR WI++ GPW
Subjt:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW

Query:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVT
         FD  LLVLE P+  +  + L F  V  WL++ N P+    +++ + IG  IGE  + D        G  +RL+V ++++KPL+R   + L+   E  + 
Subjt:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVT

Query:  IRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPR
        +RYE+LPE+CF+CG+IGH  +DC  RR E  GG ++FE+G WM+     G+    GS R
Subjt:  IRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPR

A0A6J1BSZ1 uncharacterized protein LOC1110054815.6e-4438.4Show/hide
Query:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTR-KDFNVEIIGNNIFLFIFESQEDRDWIITNGP
        M    L+ EWK FKL  EE      +D+      G     SL+ KL+S R IS   +KN++  AWK   K F+V+IIG NIFLF F    DR+ I+  GP
Subjt:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTR-KDFNVEIIGNNIFLFIFESQEDRDWIITNGP

Query:  WFFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIW
        W FDR+L++++ P S  + +D++F+ V +W+  F+L +   N+ +A ++G+ IG F + ++N N    G+ +R+RVR ++ KPL RG  + LD      W
Subjt:  WFFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIW

Query:  VTIRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGY
        + I+YERLP+F ++CG + HI+KDC       D   +  ++G W++FQG+
Subjt:  VTIRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEEFEFGMWMKFQGY

A0A6J1DU55 uncharacterized protein LOC1110231355.4e-4736.77Show/hide
Query:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW
        M+ E L+ +W++FKL  EE +    +D +         + SLVGKL++ R IS   +   ++ AWK      VE IG N+FLF F  + D + ++  GPW
Subjt:  MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPW

Query:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIWV
        FFD++L+VL+ P S +   +LEF  V  W+ +F+LP+ + N+ +A ++G+ IG F++ D N+     G S+R+RV ++ITKPLRRG  I +D      W+
Subjt:  FFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLD-DSSEIWV

Query:  TIRYERLPEFCFNCGVIGHIVKDCPHR-RNETDGGEEEFEFGMWMKFQGYTG---KTKNQGSPRKEN---NKTDNQVLRGTYISETDGKSSEGLDVDLNL
         I+YERLP+FC+ CGVIGH   DC  R     D      E+G W++F G      K +   SP +E+   + + N   RG  + ET  + SE  + D   
Subjt:  TIRYERLPEFCFNCGVIGHIVKDCPHR-RNETDGGEEEFEFGMWMKFQGYTG---KTKNQGSPRKEN---NKTDNQVLRGTYISETDGKSSEGLDVDLNL

Query:  VSPIAEETED
            AE+T D
Subjt:  VSPIAEETED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G16676.1 unknown protein4.5e-0628.42Show/hide
Query:  LPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIVKDCPHRRNE
        +P+ Y  E    +I  G+G+ +  D +         IR+R+R  IT  +R    I  D      +  +YERL   C +C  + H    CP+R+ E
Subjt:  LPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIVKDCPHRRNE

AT3G42140.1 zinc ion binding;nucleic acid binding6.1e-1122.67Show/hide
Query:  KDFNVEIIGNNIFL----FIFESQEDRDWIITNGPWFFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNY
        K+ + E++G  + +    F+F+S+E    I+  GPW F+  + V++     +   D EFK +  W+++  +P+ +   +I   IG  +G FLE       
Subjt:  KDFNVEIIGNNIFL----FIFESQEDRDWIITNGPWFFDRSLLVLEAPNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNY

Query:  AQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEE
          LG  + +                         +  +YE+L  FC  CG++ H   +CP   N+    +++
Subjt:  AQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIVKDCPHRRNETDGGEEE

AT5G36228.1 nucleic acid binding;zinc ion binding4.1e-1526.26Show/hide
Query:  IGGQASQ--SLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFFDRSLLVLEAPNSKQRTMDLE----FKFVD
        +G  AS   SL+G++++ +  S       +   W      +  I+ +  F   F S+ D    +   PW F+   + L      QR  D        F+D
Subjt:  IGGQASQ--SLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFFDRSLLVLEAPNSKQRTMDLE----FKFVD

Query:  VWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIVKDCPH
        VW+ +  +P+ Y +E+  + I S +GE +  D N+        IR++VR++ T+PLR    +R        +   YE+L   C NC  + H V  CP+
Subjt:  VWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIVKDCPH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGGAAGAGTTGATTAACGAATGGAAGAGGTTCAAATTGATGGATGAAGAAAAAGATAGAATCTTTACGCTGGACACAGAAGAAGCAAATGAAATTGGGGGGCA
GGCGAGTCAATCCTTGGTTGGCAAGCTGATATCGACCAGATTCATATCCAAATTGGCCATGAAGAATTCCATGATCGGTGCCTGGAAGACAAGGAAGGACTTTAATGTTG
AGATCATTGGAAACAATATCTTCCTATTCATTTTTGAAAGTCAAGAGGACAGAGATTGGATAATTACAAATGGACCTTGGTTTTTCGACAGAAGCCTTCTTGTTCTTGAA
GCGCCAAACTCAAAACAGAGAACCATGGACTTGGAGTTTAAATTTGTTGATGTCTGGCTTCGCGTCTTCAACCTGCCAATCTGTTATCGAAATGAGAAAATAGCAAAAAA
AATTGGCAGTGGCATAGGGGAATTCCTGGAGCAAGACAACAACAAAAATTATGCTCAACTAGGCAATAGCATCAGACTTCGTGTAAGACTGAATATCACAAAACCATTAC
GCAGGGGCTTCATGATTAGATTAGATGATTCTAGTGAAATTTGGGTAACCATCAGGTACGAACGATTACCTGAGTTTTGTTTTAATTGTGGAGTAATCGGTCACATTGTT
AAAGATTGCCCGCATAGAAGAAATGAAACGGATGGAGGGGAAGAAGAGTTCGAATTTGGAATGTGGATGAAGTTCCAAGGCTACACTGGGAAAACAAAGAATCAAGGGTC
TCCAAGAAAGGAAAATAATAAAACAGACAATCAAGTTTTAAGAGGGACATATATCTCTGAAACGGATGGCAAAAGCTCTGAGGGTCTGGATGTGGATCTTAATTTAGTTA
GTCCAATTGCTGAAGAAACGGAAGATCCAGAAACAACTGAGGAAGGAAGAGTTAGACAAATGATTATGGAAGGCAATAGAAATCCTGATGAAGAAGGGAGTCGCTGGGAT
ATTCTTCAAATGATGGAGGATGAATCACAGGTTAACTACAACAATGTGAGTCCTATCGAGGTGGAAGAAAGTCAGATTGTAAAAGAAACCCCCTTGCAGAAAAAGAGGAG
TTGGAAACGAAGGGAAAGGATTAATCAGTCAGATAACACTCATCAAGATAGCTCGGGTTCCAAGAAGAGGAAAGGAATAGGAACAGATGGGATCCAGAGGAAGAAGGCAA
AAGGGTTAGGGACAGATGGAGTGACTTCTGATGATGCAAAAAAGAACAACGCTACTATTAGCAGTGAAATGGAATTTGGTCCTAGAAATCGAGTGTCGAAGCAACATGCA
ACCCTTGATGTTTGGTCTCCTCCCCCTGATGGATTCTGGAAATTTAATGTCGACGCGGCCTGGCTTCATCTTGTCCAGCGACAGGCATTGGCATCATTGGAAGGAAATCG
GATGGCGATATTGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGGAAGAGTTGATTAACGAATGGAAGAGGTTCAAATTGATGGATGAAGAAAAAGATAGAATCTTTACGCTGGACACAGAAGAAGCAAATGAAATTGGGGGGCA
GGCGAGTCAATCCTTGGTTGGCAAGCTGATATCGACCAGATTCATATCCAAATTGGCCATGAAGAATTCCATGATCGGTGCCTGGAAGACAAGGAAGGACTTTAATGTTG
AGATCATTGGAAACAATATCTTCCTATTCATTTTTGAAAGTCAAGAGGACAGAGATTGGATAATTACAAATGGACCTTGGTTTTTCGACAGAAGCCTTCTTGTTCTTGAA
GCGCCAAACTCAAAACAGAGAACCATGGACTTGGAGTTTAAATTTGTTGATGTCTGGCTTCGCGTCTTCAACCTGCCAATCTGTTATCGAAATGAGAAAATAGCAAAAAA
AATTGGCAGTGGCATAGGGGAATTCCTGGAGCAAGACAACAACAAAAATTATGCTCAACTAGGCAATAGCATCAGACTTCGTGTAAGACTGAATATCACAAAACCATTAC
GCAGGGGCTTCATGATTAGATTAGATGATTCTAGTGAAATTTGGGTAACCATCAGGTACGAACGATTACCTGAGTTTTGTTTTAATTGTGGAGTAATCGGTCACATTGTT
AAAGATTGCCCGCATAGAAGAAATGAAACGGATGGAGGGGAAGAAGAGTTCGAATTTGGAATGTGGATGAAGTTCCAAGGCTACACTGGGAAAACAAAGAATCAAGGGTC
TCCAAGAAAGGAAAATAATAAAACAGACAATCAAGTTTTAAGAGGGACATATATCTCTGAAACGGATGGCAAAAGCTCTGAGGGTCTGGATGTGGATCTTAATTTAGTTA
GTCCAATTGCTGAAGAAACGGAAGATCCAGAAACAACTGAGGAAGGAAGAGTTAGACAAATGATTATGGAAGGCAATAGAAATCCTGATGAAGAAGGGAGTCGCTGGGAT
ATTCTTCAAATGATGGAGGATGAATCACAGGTTAACTACAACAATGTGAGTCCTATCGAGGTGGAAGAAAGTCAGATTGTAAAAGAAACCCCCTTGCAGAAAAAGAGGAG
TTGGAAACGAAGGGAAAGGATTAATCAGTCAGATAACACTCATCAAGATAGCTCGGGTTCCAAGAAGAGGAAAGGAATAGGAACAGATGGGATCCAGAGGAAGAAGGCAA
AAGGGTTAGGGACAGATGGAGTGACTTCTGATGATGCAAAAAAGAACAACGCTACTATTAGCAGTGAAATGGAATTTGGTCCTAGAAATCGAGTGTCGAAGCAACATGCA
ACCCTTGATGTTTGGTCTCCTCCCCCTGATGGATTCTGGAAATTTAATGTCGACGCGGCCTGGCTTCATCTTGTCCAGCGACAGGCATTGGCATCATTGGAAGGAAATCG
GATGGCGATATTGGAGTGA
Protein sequenceShow/hide protein sequence
MEMEELINEWKRFKLMDEEKDRIFTLDTEEANEIGGQASQSLVGKLISTRFISKLAMKNSMIGAWKTRKDFNVEIIGNNIFLFIFESQEDRDWIITNGPWFFDRSLLVLE
APNSKQRTMDLEFKFVDVWLRVFNLPICYRNEKIAKKIGSGIGEFLEQDNNKNYAQLGNSIRLRVRLNITKPLRRGFMIRLDDSSEIWVTIRYERLPEFCFNCGVIGHIV
KDCPHRRNETDGGEEEFEFGMWMKFQGYTGKTKNQGSPRKENNKTDNQVLRGTYISETDGKSSEGLDVDLNLVSPIAEETEDPETTEEGRVRQMIMEGNRNPDEEGSRWD
ILQMMEDESQVNYNNVSPIEVEESQIVKETPLQKKRSWKRRERINQSDNTHQDSSGSKKRKGIGTDGIQRKKAKGLGTDGVTSDDAKKNNATISSEMEFGPRNRVSKQHA
TLDVWSPPPDGFWKFNVDAAWLHLVQRQALASLEGNRMAILE