; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038053 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038053
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:11996340..11998187
RNA-Seq ExpressionLag0038053
SyntenyLag0038053
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]3.5e-3434.23Show/hide
Query:  EDKEVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY
        E  ++S    KL L +D   +  I+    E  ++    ++  K +T K IN E F   ++ IW  +  V +E  G N++  +F+   D+ RI++GGPW +
Subjt:  EDKEVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY

Query:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV
        D  +LVL E  GS  + +L+F+YV FWI  H LP  C  R+    LG ++G  +  +  E G+  G+ +R +V  ++  PLKRG  V +G   +   + +
Subjt:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV

Query:  SYEKLLDFCYYCGKLGHVFQEC
         YE+L +FCYYCGK+GH+ ++C
Subjt:  SYEKLLDFCYYCGKLGHVFQEC

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]3.2e-3232.37Show/hide
Query:  SNLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLE-GLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY
        SNLL      KL   EDK  V I+   +E T K  + ++ CK+L+ + I+  V    +   W L+     ++  G N++L  F    D++RI++ GPW++
Subjt:  SNLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLE-GLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY

Query:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV
        D A+++++ P   T   +++F+ VS W+HF  L   C  +  A  LGN +G FE  E +      G  LR +V+ ++ +PL RG  + +       WIP+
Subjt:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV

Query:  SYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNL-YGVEMR
         YE+L DF Y+CG+L H+ ++C +   +  + NL YG  +R
Subjt:  SYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNL-YGVEMR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.9e-3330.72Show/hide
Query:  EVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDA
        E+++L    K   D+   V I+      T  + K  +  K+ T+K I+ E    +M  +W +    + E  G N+Y+  FK L +KSR++  GPW+++ +
Subjt:  EVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDA

Query:  ILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYE
        +LVL  P  +    ++ F + +FWI  H +P  C   + A  LG  +G  E  E D      G  +R +VK ++ +PL+RG  +K  S  +  W P+ YE
Subjt:  ILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYE

Query:  KLLDFCYYCGKLGHVFQECGE--EAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGR---------GRSGRGRFGRGGYNFHRKDMLGDESR
        KL DFCY CGK+GH  +EC +  +    ++   YG  +R T   K +     S  +++  W  GR         GR GRG + R   N+  +D+ G ES 
Subjt:  KLLDFCYYCGKLGHVFQECGE--EAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGR---------GRSGRGRFGRGGYNFHRKDMLGDESR

Query:  NNRNLNSDSWRKKNPETPT
        + R +     R  N E+ T
Subjt:  NNRNLNSDSWRKKNPETPT

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.5e-3230.09Show/hide
Query:  NLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDD
        NLLA     KL   ED+  + ++ + ++  ++    ++  K+L  + I+ +V S ++   W +E  + +E  G N++L  F    D +R++K GPW +D 
Subjt:  NLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDD

Query:  AILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSY
        A++VL++P  S +I ELEF  V+FWIH   LP     +  A  LGN +G F   +C+E G   G +LR +V  +I +PL+RG  + I       WIP+ Y
Subjt:  AILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSY

Query:  EKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGYNFHRKDMLGDESRNNRNLNSDSW
        E+L DFCY+CG +GH   +C         D  Y     D++ +            +   W R  G     + GR G +  R+D  G  S N+        
Subjt:  EKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGYNFHRKDMLGDESRNNRNLNSDSW

Query:  RKKNPETPTQPQSSEQTMEEVSEEERSQRKGGENSSPAS
         K+     T+ Q SEQT ++  + + +  + G+    AS
Subjt:  RKKNPETPTQPQSSEQTMEEVSEEERSQRKGGENSSPAS

XP_024041282.1 uncharacterized protein LOC112098886 [Citrus clementina]2.9e-3334.72Show/hide
Query:  KILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKY
        K+L T+ +N E   + M R+W     VKIE  G NV++ KF    DK  II GGPW +D A++VL EP G  ++++ +F+++SFW+  H +P +C  ++ 
Subjt:  KILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKY

Query:  AEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDT
        A  LG V+G  E  E D  G+  G+ LR ++  ++ + LK+    + G   E   I V YE+LLDFC+ CG++ H ++EC    +   ++  YG  ++  
Subjt:  AEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDT

Query:  KGSKGIYRSWKSDLKD
          +K + +S   D  D
Subjt:  KGSKGIYRSWKSDLKD

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein1.7e-3434.23Show/hide
Query:  EDKEVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY
        E  ++S    KL L +D   +  I+    E  ++    ++  K +T K IN E F   ++ IW  +  V +E  G N++  +F+   D+ RI++GGPW +
Subjt:  EDKEVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY

Query:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV
        D  +LVL E  GS  + +L+F+YV FWI  H LP  C  R+    LG ++G  +  +  E G+  G+ +R +V  ++  PLKRG  V +G   +   + +
Subjt:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV

Query:  SYEKLLDFCYYCGKLGHVFQEC
         YE+L +FCYYCGK+GH+ ++C
Subjt:  SYEKLLDFCYYCGKLGHVFQEC

A0A5C7I6E5 CCHC-type domain-containing protein2.7e-3230Show/hide
Query:  KEVSNLLAKLKLAEDKRLVAIEDEDIEDTDKDFKNAIAC---KILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY
        +E++ L   L + E +  +     ++  TD+  K    C   K+LT++ +  EVF  +M++IW + G V IE    N++   F    D+ R+++GGPWS+
Subjt:  KEVSNLLAKLKLAEDKRLVAIEDEDIEDTDKDFKNAIAC---KILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY

Query:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV
        D AI++ +EP G+  I  L F YV FW+  H LP +C   +    LG+++G     +        G  +R +V   +++PL+R   V +    + + + +
Subjt:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV

Query:  SYEKLLDFCYYCGKLGHVFQECGEE--AANLSND--NLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGYNFHRKDMLGDESRNNRN
         YE+LLD+C+ CG+LGH+  EC E+    +LS+D     GV +R     K  +  +     D+ IWSR +   G  R   G                NRN
Subjt:  SYEKLLDFCYYCGKLGHVFQECGEE--AANLSND--NLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGYNFHRKDMLGDESRNNRN

Query:  LNSDSWRKKN
        L S +WR+++
Subjt:  LNSDSWRKKN

A0A6J1BSZ1 uncharacterized protein LOC1110054811.6e-3232.37Show/hide
Query:  SNLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLE-GLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY
        SNLL      KL   EDK  V I+   +E T K  + ++ CK+L+ + I+  V    +   W L+     ++  G N++L  F    D++RI++ GPW++
Subjt:  SNLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLE-GLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSY

Query:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV
        D A+++++ P   T   +++F+ VS W+HF  L   C  +  A  LGN +G FE  E +      G  LR +V+ ++ +PL RG  + +       WIP+
Subjt:  DDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPV

Query:  SYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNL-YGVEMR
         YE+L DF Y+CG+L H+ ++C +   +  + NL YG  +R
Subjt:  SYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNL-YGVEMR

A0A6J1D765 uncharacterized protein LOC1110179021.4e-3330.72Show/hide
Query:  EVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDA
        E+++L    K   D+   V I+      T  + K  +  K+ T+K I+ E    +M  +W +    + E  G N+Y+  FK L +KSR++  GPW+++ +
Subjt:  EVSNLLAKLKLAEDK-RLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDA

Query:  ILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYE
        +LVL  P  +    ++ F + +FWI  H +P  C   + A  LG  +G  E  E D      G  +R +VK ++ +PL+RG  +K  S  +  W P+ YE
Subjt:  ILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYE

Query:  KLLDFCYYCGKLGHVFQECGE--EAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGR---------GRSGRGRFGRGGYNFHRKDMLGDESR
        KL DFCY CGK+GH  +EC +  +    ++   YG  +R T   K +     S  +++  W  GR         GR GRG + R   N+  +D+ G ES 
Subjt:  KLLDFCYYCGKLGHVFQECGE--EAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGR---------GRSGRGRFGRGGYNFHRKDMLGDESR

Query:  NNRNLNSDSWRKKNPETPT
        + R +     R  N E+ T
Subjt:  NNRNLNSDSWRKKNPETPT

A0A6J1DU55 uncharacterized protein LOC1110231357.0e-3330.09Show/hide
Query:  NLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDD
        NLLA     KL   ED+  + ++ + ++  ++    ++  K+L  + I+ +V S ++   W +E  + +E  G N++L  F    D +R++K GPW +D 
Subjt:  NLLA-----KLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMMARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDD

Query:  AILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSY
        A++VL++P  S +I ELEF  V+FWIH   LP     +  A  LGN +G F   +C+E G   G +LR +V  +I +PL+RG  + I       WIP+ Y
Subjt:  AILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSY

Query:  EKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGYNFHRKDMLGDESRNNRNLNSDSW
        E+L DFCY+CG +GH   +C         D  Y     D++ +            +   W R  G     + GR G +  R+D  G  S N+        
Subjt:  EKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGYNFHRKDMLGDESRNNRNLNSDSW

Query:  RKKNPETPTQPQSSEQTMEEVSEEERSQRKGGENSSPAS
         K+     T+ Q SEQT ++  + + +  + G+    AS
Subjt:  RKKNPETPTQPQSSEQTMEEVSEEERSQRKGGENSSPAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding8.5e-0724.83Show/hide
Query:  IIKGGPWSYDDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGS
        I++ GPWS++D + V++  + +    + EFK + FWI    +P      +   ++G  MG F                       +E  L R  +V    
Subjt:  IIKGGPWSYDDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETLRDKVKGNIEEPLKRGTNVKIGS

Query:  KAERTWIPVSYEKLLDFCYYCGKLGHVFQEC---GEEAANLSNDN
              +   YEKL +FC  CG L H   EC   G +  +  +D+
Subjt:  KAERTWIPVSYEKLLDFCYYCGKLGHVFQEC---GEEAANLSNDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAAAGGGCAACCAGAAGGTTTGCAACCTGGAGCAAGAGCTAAGCAATAGAGTTATGAAGACGAATCCGACGATAATCTCGGCCACTTCGACAGAGAAAGGGCC
ATCAAAAATCACGGAAGTATCAGGGTATCCCAACCACGGGGAACCAGAAGATAAAGAAGTTTCAAATTTATTAGCGAAGCTAAAATTAGCAGAGGACAAGAGGCTGGTGG
CGATCGAAGACGAAGACATTGAAGATACTGATAAAGACTTCAAGAACGCTATAGCCTGCAAGATTCTAACAACAAAATATATTAATCCTGAAGTGTTCTCAATCATGATG
GCAAGGATCTGGGGCCTGGAGGGCCTGGTGAAGATAGAGAAATCAGGGACCAACGTGTACCTATGCAAATTTAAATGGCTCAGAGATAAATCTAGAATAATCAAAGGCGG
GCCGTGGTCTTATGACGACGCTATTCTCGTATTAGAAGAACCAAAAGGGAGTACTAGTATAGAAGAACTGGAGTTTAAATATGTCTCTTTCTGGATTCACTTCCATAAAT
TACCCAGAGTTTGTTTTTGCAGGAAATATGCAGAGGCTTTGGGGAACGTTATGGGAACCTTCGAATCAGCTGAATGTGATGAATATGGGAAAATGGGAGGAGAAACATTG
CGGGACAAGGTAAAAGGGAATATCGAAGAACCGTTGAAAAGAGGCACAAACGTCAAAATAGGTTCGAAGGCTGAAAGAACATGGATTCCCGTTTCTTACGAAAAACTTCT
GGATTTTTGTTATTATTGCGGGAAACTCGGCCATGTTTTCCAAGAATGTGGGGAAGAAGCAGCAAATCTCAGCAATGATAACCTTTACGGGGTAGAGATGCGTGATACTA
AGGGAAGCAAAGGGATTTACAGGAGCTGGAAATCAGACCTTAAGGATCAGAACATCTGGTCAAGAGGTAGAGGAAGAAGCGGTCGGGGAAGATTTGGAAGAGGAGGATAC
AATTTTCATAGAAAAGATATGCTGGGGGATGAATCTCGTAACAATAGAAACTTGAACAGCGACAGTTGGCGCAAAAAGAACCCAGAAACTCCCACCCAACCCCAAAGCTC
AGAACAAACCATGGAAGAAGTTAGCGAAGAGGAAAGAAGTCAAAGAAAAGGTGGTGAAAATTCATCGCCGGCGAGTGGCGACATCAACGGCAGATCTCCGATGGAGTTAT
CCCCAGCACACAGCTCAAGAATTCAAACGGCTACTGACAAGGAAGCATCAGATAAAGACAACAGTAAAGGAAAAGAGAAAATCTCTGACACAGAGGGGTCTTTTCAACCG
AACAATTCAGAATATAAGATAAGAGAGGATAAACCCAAAAGCCCAGACAGCAGTCCAAGAATACATAATCAGGGCTATCAGTCAGGGGCCCATAGAGGAAAACTCCAATA
CAAAGAAATGGAAAGAAGGCTGCGGCCCATGGGAAATGAAGTCGCCTCTGATATATCTCCAGAAAAGGAACCCGAAAAGAAACAAGACAACCAGATGCTCCACCAATCTG
TCACGAATAGGAAGCATCATCCGGGAAAATCATGGAAAAGAAGAGCTCGAGAAGGAAATCTAGAAAAAAATCCAGTGCTTAATACTCAAGCTCAAACCGAGAAAAAACAT
GGGAGAGAGGACCAAGAGGAGACAGAACAAAGCAAGAGGACTCGGGTAGACTACTTCGGCATGGCCGTTGGGATATCGGCGGAGGCTGCGGAGCAGCCCCGCCGGACGCC
ATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAAAGGGCAACCAGAAGGTTTGCAACCTGGAGCAAGAGCTAAGCAATAGAGTTATGAAGACGAATCCGACGATAATCTCGGCCACTTCGACAGAGAAAGGGCC
ATCAAAAATCACGGAAGTATCAGGGTATCCCAACCACGGGGAACCAGAAGATAAAGAAGTTTCAAATTTATTAGCGAAGCTAAAATTAGCAGAGGACAAGAGGCTGGTGG
CGATCGAAGACGAAGACATTGAAGATACTGATAAAGACTTCAAGAACGCTATAGCCTGCAAGATTCTAACAACAAAATATATTAATCCTGAAGTGTTCTCAATCATGATG
GCAAGGATCTGGGGCCTGGAGGGCCTGGTGAAGATAGAGAAATCAGGGACCAACGTGTACCTATGCAAATTTAAATGGCTCAGAGATAAATCTAGAATAATCAAAGGCGG
GCCGTGGTCTTATGACGACGCTATTCTCGTATTAGAAGAACCAAAAGGGAGTACTAGTATAGAAGAACTGGAGTTTAAATATGTCTCTTTCTGGATTCACTTCCATAAAT
TACCCAGAGTTTGTTTTTGCAGGAAATATGCAGAGGCTTTGGGGAACGTTATGGGAACCTTCGAATCAGCTGAATGTGATGAATATGGGAAAATGGGAGGAGAAACATTG
CGGGACAAGGTAAAAGGGAATATCGAAGAACCGTTGAAAAGAGGCACAAACGTCAAAATAGGTTCGAAGGCTGAAAGAACATGGATTCCCGTTTCTTACGAAAAACTTCT
GGATTTTTGTTATTATTGCGGGAAACTCGGCCATGTTTTCCAAGAATGTGGGGAAGAAGCAGCAAATCTCAGCAATGATAACCTTTACGGGGTAGAGATGCGTGATACTA
AGGGAAGCAAAGGGATTTACAGGAGCTGGAAATCAGACCTTAAGGATCAGAACATCTGGTCAAGAGGTAGAGGAAGAAGCGGTCGGGGAAGATTTGGAAGAGGAGGATAC
AATTTTCATAGAAAAGATATGCTGGGGGATGAATCTCGTAACAATAGAAACTTGAACAGCGACAGTTGGCGCAAAAAGAACCCAGAAACTCCCACCCAACCCCAAAGCTC
AGAACAAACCATGGAAGAAGTTAGCGAAGAGGAAAGAAGTCAAAGAAAAGGTGGTGAAAATTCATCGCCGGCGAGTGGCGACATCAACGGCAGATCTCCGATGGAGTTAT
CCCCAGCACACAGCTCAAGAATTCAAACGGCTACTGACAAGGAAGCATCAGATAAAGACAACAGTAAAGGAAAAGAGAAAATCTCTGACACAGAGGGGTCTTTTCAACCG
AACAATTCAGAATATAAGATAAGAGAGGATAAACCCAAAAGCCCAGACAGCAGTCCAAGAATACATAATCAGGGCTATCAGTCAGGGGCCCATAGAGGAAAACTCCAATA
CAAAGAAATGGAAAGAAGGCTGCGGCCCATGGGAAATGAAGTCGCCTCTGATATATCTCCAGAAAAGGAACCCGAAAAGAAACAAGACAACCAGATGCTCCACCAATCTG
TCACGAATAGGAAGCATCATCCGGGAAAATCATGGAAAAGAAGAGCTCGAGAAGGAAATCTAGAAAAAAATCCAGTGCTTAATACTCAAGCTCAAACCGAGAAAAAACAT
GGGAGAGAGGACCAAGAGGAGACAGAACAAAGCAAGAGGACTCGGGTAGACTACTTCGGCATGGCCGTTGGGATATCGGCGGAGGCTGCGGAGCAGCCCCGCCGGACGCC
ATGA
Protein sequenceShow/hide protein sequence
MEKKGNQKVCNLEQELSNRVMKTNPTIISATSTEKGPSKITEVSGYPNHGEPEDKEVSNLLAKLKLAEDKRLVAIEDEDIEDTDKDFKNAIACKILTTKYINPEVFSIMM
ARIWGLEGLVKIEKSGTNVYLCKFKWLRDKSRIIKGGPWSYDDAILVLEEPKGSTSIEELEFKYVSFWIHFHKLPRVCFCRKYAEALGNVMGTFESAECDEYGKMGGETL
RDKVKGNIEEPLKRGTNVKIGSKAERTWIPVSYEKLLDFCYYCGKLGHVFQECGEEAANLSNDNLYGVEMRDTKGSKGIYRSWKSDLKDQNIWSRGRGRSGRGRFGRGGY
NFHRKDMLGDESRNNRNLNSDSWRKKNPETPTQPQSSEQTMEEVSEEERSQRKGGENSSPASGDINGRSPMELSPAHSSRIQTATDKEASDKDNSKGKEKISDTEGSFQP
NNSEYKIREDKPKSPDSSPRIHNQGYQSGAHRGKLQYKEMERRLRPMGNEVASDISPEKEPEKKQDNQMLHQSVTNRKHHPGKSWKRRAREGNLEKNPVLNTQAQTEKKH
GREDQEETEQSKRTRVDYFGMAVGISAEAAEQPRRTP