; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017468 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017468
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:4038206..4039504
RNA-Seq ExpressionLag0017468
SyntenyLag0017468
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]2.0e-3931.38Show/hide
Query:  EKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIVLEAPKNDQRT
        ++ED ++   EE  E +    + L+GKL +    +  A K  ++ AW+ +    ++ ++ N FLF+F +  D + + RNGPW FDRNL++L     +++ 
Subjt:  EKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIVLEAPKNDQRT

Query:  VDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVPDFCFHCGLIG
         D++ + V+FW+R+ +LP   R++ +A+K+GN +G F E+DP ++ N  G  +R++  LD+ KPL+RG   K ++K    W+  +YER+P+FCF CG IG
Subjt:  VDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVPDFCFHCGLIG

Query:  HVAKECSKNSDGEERN-----NKNFEFGMWMKFQG----FARPPKSTET--------PRNNEEKKANNDPKKNDRDEAGGKSEEGLNVAL
        H  KEC    D +E N      K+  +G W++       F  P K   +        P +++ K   ++ KK+   E   +   G+   L
Subjt:  HVAKECSKNSDGEERN-----NKNFEFGMWMKFQG----FARPPKSTET--------PRNNEEKKANNDPKKNDRDEAGGKSEEGLNVAL

OMO70721.1 Zinc finger, CCHC-type [Corchorus olitorius]2.9e-3834.85Show/hide
Query:  DSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFD
        + L D WK F L   E++D++  ++ E   +  Q   +L+GKLL+ +  +K A  N +   WK  KE  +  ++ N FLFKF +  DKE +    PW F 
Subjt:  DSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFD

Query:  RNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIR
         NL++      D R  D  F K  FWIR+ +L +G R   +A  IG  +GE +++D   +   W + +R+RV +D+TKPLRR  + K ++        + 
Subjt:  RNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIR

Query:  YERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWM
        YER P FC+ CG IGHV+++CS     +E   +  ++G WM
Subjt:  YERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWM

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.7e-4438.4Show/hide
Query:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQY-LIGKLLSNRIISKVALKNALSTAWKTR-KEFNIEIVDTNTFLFKFESLDDKEWITRNG
        M   +L++EWK F L   E++ I    +   LE  G+  +  LI KLLS R IS   LKN L  AWK   K F+++I+  N FLF F    D+  I R G
Subjt:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQY-LIGKLLSNRIISKVALKNALSTAWKTR-KEFNIEIVDTNTFLFKFESLDDKEWITRNG

Query:  PWFFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETC
        PW FDR LI+++ P +  + +D++F  V  W+   +L +   N  +A ++GN +G F +++ N N   WG+ +R+RV+ D+ KPL RG     +     C
Subjt:  PWFFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETC

Query:  WISIRYERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQG
        WI I+YER+PDF +HCG + H+ K+CS      +  +KN ++G W++FQG
Subjt:  WISIRYERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.5e-4738.15Show/hide
Query:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPW
        M+ ++L+ +W++F L   E E  M  + +     +      L+GKLL+ RIIS   L   L  AWK   +  +E +  N FLF F    D   + + GPW
Subjt:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPW

Query:  FFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWI
        FFD+ LIVL+ P + +   ++EFN+V FWI + +LP+ + N  +A ++GN +G F+++D N     WG S+RIRV +DITKPLRRG     +     CWI
Subjt:  FFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWI

Query:  SIRYERVPDFCFHCGLIGHVAKEC-SKNSDGEERNNKNFEFGMWMKFQG
         I+YER+PDFC+ CG+IGH + +C ++    ++ +    E+G W++F G
Subjt:  SIRYERVPDFCFHCGLIGHVAKEC-SKNSDGEERNNKNFEFGMWMKFQG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.5e-3929.13Show/hide
Query:  LIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKE-FNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDR
        L++EWK F L   E+E  +  +         +  Q L+GKL   R I+   +KN + TAWK     F ++ +  N FLF F    D+  I ++GPW FDR
Subjt:  LIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKE-FNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDR

Query:  NLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRY
         L+++  P       +++F K+  W+R  +LP+G     +A ++GN LG F E D +     WG+++R+RV LDI+KPLRRG     +      WI I+Y
Subjt:  NLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRY

Query:  ERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQGFARP--PKSTETPRNNEEKKANNDPKKNDRDEAGGKSEEGLN-------VALNLES
        ER+PDFC+HCGL                 + K  ++G W+++QG  +P  P+  +   +  +K  NN    +      G   +G+        +A+ +ES
Subjt:  ERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQGFARP--PKSTETPRNNEEKKANNDPKKNDRDEAGGKSEEGLN-------VALNLES

Query:  PMDEEIQ---DPNLRGQTGVISMPSEELMNSYKRFENWGMLHLTKEVPRLESP-NSALAGLEVSQTGSISSSRTNPTWKRR
        P+ E  +   +P+ +G++ V+    E+ +N              KE+  L  P  S    ++ S + S++    +P +  R
Subjt:  PMDEEIQ---DPNLRGQTGVISMPSEELMNSYKRFENWGMLHLTKEVPRLESP-NSALAGLEVSQTGSISSSRTNPTWKRR

TrEMBL top hitse value%identityAlignment
A0A2Z6NZV1 Uncharacterized protein9.7e-4031.38Show/hide
Query:  EKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIVLEAPKNDQRT
        ++ED ++   EE  E +    + L+GKL +    +  A K  ++ AW+ +    ++ ++ N FLF+F +  D + + RNGPW FDRNL++L     +++ 
Subjt:  EKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIVLEAPKNDQRT

Query:  VDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVPDFCFHCGLIG
         D++ + V+FW+R+ +LP   R++ +A+K+GN +G F E+DP ++ N  G  +R++  LD+ KPL+RG   K ++K    W+  +YER+P+FCF CG IG
Subjt:  VDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVPDFCFHCGLIG

Query:  HVAKECSKNSDGEERN-----NKNFEFGMWMKFQG----FARPPKSTET--------PRNNEEKKANNDPKKNDRDEAGGKSEEGLNVAL
        H  KEC    D +E N      K+  +G W++       F  P K   +        P +++ K   ++ KK+   E   +   G+   L
Subjt:  HVAKECSKNSDGEERN-----NKNFEFGMWMKFQG----FARPPKSTET--------PRNNEEKKANNDPKKNDRDEAGGKSEEGLNVAL

A0A6J1BSZ1 uncharacterized protein LOC1110054811.3e-4438.4Show/hide
Query:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQY-LIGKLLSNRIISKVALKNALSTAWKTR-KEFNIEIVDTNTFLFKFESLDDKEWITRNG
        M   +L++EWK F L   E++ I    +   LE  G+  +  LI KLLS R IS   LKN L  AWK   K F+++I+  N FLF F    D+  I R G
Subjt:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQY-LIGKLLSNRIISKVALKNALSTAWKTR-KEFNIEIVDTNTFLFKFESLDDKEWITRNG

Query:  PWFFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETC
        PW FDR LI+++ P +  + +D++F  V  W+   +L +   N  +A ++GN +G F +++ N N   WG+ +R+RV+ D+ KPL RG     +     C
Subjt:  PWFFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETC

Query:  WISIRYERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQG
        WI I+YER+PDF +HCG + H+ K+CS      +  +KN ++G W++FQG
Subjt:  WISIRYERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQG

A0A6J1DU55 uncharacterized protein LOC1110231352.2e-4738.15Show/hide
Query:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPW
        M+ ++L+ +W++F L   E E  M  + +     +      L+GKLL+ RIIS   L   L  AWK   +  +E +  N FLF F    D   + + GPW
Subjt:  MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPW

Query:  FFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWI
        FFD+ LIVL+ P + +   ++EFN+V FWI + +LP+ + N  +A ++GN +G F+++D N     WG S+RIRV +DITKPLRRG     +     CWI
Subjt:  FFDRNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWI

Query:  SIRYERVPDFCFHCGLIGHVAKEC-SKNSDGEERNNKNFEFGMWMKFQG
         I+YER+PDFC+ CG+IGH + +C ++    ++ +    E+G W++F G
Subjt:  SIRYERVPDFCFHCGLIGHVAKEC-SKNSDGEERNNKNFEFGMWMKFQG

A0A6J1DX30 uncharacterized protein LOC1110248742.2e-3929.13Show/hide
Query:  LIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKE-FNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDR
        L++EWK F L   E+E  +  +         +  Q L+GKL   R I+   +KN + TAWK     F ++ +  N FLF F    D+  I ++GPW FDR
Subjt:  LIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKE-FNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDR

Query:  NLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRY
         L+++  P       +++F K+  W+R  +LP+G     +A ++GN LG F E D +     WG+++R+RV LDI+KPLRRG     +      WI I+Y
Subjt:  NLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRY

Query:  ERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQGFARP--PKSTETPRNNEEKKANNDPKKNDRDEAGGKSEEGLN-------VALNLES
        ER+PDFC+HCGL                 + K  ++G W+++QG  +P  P+  +   +  +K  NN    +      G   +G+        +A+ +ES
Subjt:  ERVPDFCFHCGLIGHVAKECSKNSDGEERNNKNFEFGMWMKFQGFARP--PKSTETPRNNEEKKANNDPKKNDRDEAGGKSEEGLN-------VALNLES

Query:  PMDEEIQ---DPNLRGQTGVISMPSEELMNSYKRFENWGMLHLTKEVPRLESP-NSALAGLEVSQTGSISSSRTNPTWKRR
        P+ E  +   +P+ +G++ V+    E+ +N              KE+  L  P  S    ++ S + S++    +P +  R
Subjt:  PMDEEIQ---DPNLRGQTGVISMPSEELMNSYKRFENWGMLHLTKEVPRLESP-NSALAGLEVSQTGSISSSRTNPTWKRR

A0A803N409 Uncharacterized protein3.7e-3932.95Show/hide
Query:  DSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFD
        D +I EW++ +L E E + +    E + L  + Q    L+GKL + +  +  A+K  L+T WK      I ++DTN F+F+F + +DK+W+    PWFFD
Subjt:  DSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFD

Query:  RNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIR
          L++L     D++  D++ ++   W+R+ ++P   R   V R++G+ LG F++ D   +   WG  +RI+V +DI KPL RG +F     ++  WI I+
Subjt:  RNLIVLEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIR

Query:  YERVPDFCFHCGLIGHVAKECS-KNSDGEERNNKNFEFGMWMKFQGFARPPKSTETPRNNEEKK
        YE++ DFCF+CG + H  K C+ K  D E+ +   +++G W++    A P K+++      EK+
Subjt:  YERVPDFCFHCGLIGHVAKECS-KNSDGEERNNKNFEFGMWMKFQGFARPPKSTETPRNNEEKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41590.1 unknown protein8.5e-1224.68Show/hide
Query:  WKRFNLLELEKEDIMSFNEEEKLEIKGQTNQY-LIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIV
        W     LEL +ED   F   E   +   TN+  +I + L+ R+ +  ++  AL   W    + +  I+D     F F++  D   + R  PW F+   + 
Subjt:  WKRFNLLELEKEDIMSFNEEEKLEIKGQTNQY-LIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIV

Query:  LEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVP
          A + +          +D W++I  +P+ Y ++    +I  +LGE L +D +   +     IR+RV+  IT  L R F     +  ET  I  +YER+ 
Subjt:  LEAPKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVP

Query:  DFCFHCGLIGH-------------VAKECSKNSDGEERNNKNFEFGMWMKFQGFAR--PPKSTETPRNNEEKKA---NNDPKKNDRDEAGGKS---EEGL
          C  C    H             +A+E +   D   R++ N +  M        +  PP+ +  P N++E  A   + D  +ND     G+S    + L
Subjt:  DFCFHCGLIGH-------------VAKECSKNSDGEERNNKNFEFGMWMKFQGFAR--PPKSTETPRNNEEKKA---NNDPKKNDRDEAGGKS---EEGL

Query:  NVALNLESPMDE
        + A N  +P  E
Subjt:  NVALNLESPMDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATAGACAGCCTGATAGACGAATGGAAGAGGTTCAACCTATTGGAGTTAGAAAAAGAAGACATCATGTCTTTCAATGAGGAAGAAAAGCTGGAGATAAAGGGACA
AACAAATCAATACCTGATAGGCAAACTGCTCTCAAACAGAATAATATCAAAAGTGGCGCTCAAGAATGCTTTATCAACCGCCTGGAAGACAAGGAAAGAATTTAATATTG
AAATTGTGGACACAAATACTTTCCTCTTCAAATTCGAAAGCCTGGATGACAAAGAATGGATAACAAGAAACGGTCCATGGTTCTTTGACAGGAATTTAATAGTTCTTGAA
GCTCCAAAGAACGACCAGCGAACAGTTGACATAGAGTTCAATAAAGTTGACTTCTGGATAAGAATAATAAACCTTCCAATAGGATACCGAAACGATATAGTGGCAAGAAA
GATTGGCAACAATCTAGGTGAATTTCTGGAAATTGACCCAAACAGAAATGAAAATCCATGGGGCAACAGCATAAGGATCAGGGTCAAATTAGACATCACCAAACCACTAA
GGAGAGGTTTCATGTTCAAAAAAGAAAACAAGACCGAAACTTGTTGGATCTCTATCCGTTACGAAAGAGTCCCTGACTTCTGCTTCCACTGTGGACTAATTGGGCATGTA
GCAAAGGAGTGTTCCAAGAACTCTGATGGAGAGGAAAGAAACAACAAAAATTTTGAATTTGGCATGTGGATGAAATTCCAGGGCTTTGCTAGGCCACCAAAAAGCACTGA
AACTCCAAGGAACAACGAGGAGAAGAAAGCTAATAATGATCCAAAAAAGAATGATAGAGATGAAGCAGGGGGAAAAAGTGAAGAAGGACTGAATGTTGCCCTAAATTTGG
AAAGTCCGATGGATGAAGAAATTCAAGATCCGAATCTAAGAGGACAGACAGGGGTTATTTCAATGCCTTCAGAAGAATTGATGAATTCTTATAAAAGATTTGAAAATTGG
GGTATGTTGCATCTGACCAAGGAAGTGCCAAGACTAGAGAGTCCCAACAGTGCCCTTGCTGGGTTAGAAGTCAGTCAAACCGGTTCAATATCTTCCTCCAGGACAAATCC
GACTTGGAAGAGAAGAGGAAGACGTTCTCAATCTGAGGAAATTCAAATTGGAGTATCAAGATCAAAGAAAAGGAAATGTGAGGAAGTGGAAGACAACAAGAAGAAAAAGT
CCAAGGAGACAAAGGAAGTCGAGGAGGTTGTCACGGTTTCTGAACAGAATTTGGCAGTGGCTGTTTCACAGCCCTGCCTGGGATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATAGACAGCCTGATAGACGAATGGAAGAGGTTCAACCTATTGGAGTTAGAAAAAGAAGACATCATGTCTTTCAATGAGGAAGAAAAGCTGGAGATAAAGGGACA
AACAAATCAATACCTGATAGGCAAACTGCTCTCAAACAGAATAATATCAAAAGTGGCGCTCAAGAATGCTTTATCAACCGCCTGGAAGACAAGGAAAGAATTTAATATTG
AAATTGTGGACACAAATACTTTCCTCTTCAAATTCGAAAGCCTGGATGACAAAGAATGGATAACAAGAAACGGTCCATGGTTCTTTGACAGGAATTTAATAGTTCTTGAA
GCTCCAAAGAACGACCAGCGAACAGTTGACATAGAGTTCAATAAAGTTGACTTCTGGATAAGAATAATAAACCTTCCAATAGGATACCGAAACGATATAGTGGCAAGAAA
GATTGGCAACAATCTAGGTGAATTTCTGGAAATTGACCCAAACAGAAATGAAAATCCATGGGGCAACAGCATAAGGATCAGGGTCAAATTAGACATCACCAAACCACTAA
GGAGAGGTTTCATGTTCAAAAAAGAAAACAAGACCGAAACTTGTTGGATCTCTATCCGTTACGAAAGAGTCCCTGACTTCTGCTTCCACTGTGGACTAATTGGGCATGTA
GCAAAGGAGTGTTCCAAGAACTCTGATGGAGAGGAAAGAAACAACAAAAATTTTGAATTTGGCATGTGGATGAAATTCCAGGGCTTTGCTAGGCCACCAAAAAGCACTGA
AACTCCAAGGAACAACGAGGAGAAGAAAGCTAATAATGATCCAAAAAAGAATGATAGAGATGAAGCAGGGGGAAAAAGTGAAGAAGGACTGAATGTTGCCCTAAATTTGG
AAAGTCCGATGGATGAAGAAATTCAAGATCCGAATCTAAGAGGACAGACAGGGGTTATTTCAATGCCTTCAGAAGAATTGATGAATTCTTATAAAAGATTTGAAAATTGG
GGTATGTTGCATCTGACCAAGGAAGTGCCAAGACTAGAGAGTCCCAACAGTGCCCTTGCTGGGTTAGAAGTCAGTCAAACCGGTTCAATATCTTCCTCCAGGACAAATCC
GACTTGGAAGAGAAGAGGAAGACGTTCTCAATCTGAGGAAATTCAAATTGGAGTATCAAGATCAAAGAAAAGGAAATGTGAGGAAGTGGAAGACAACAAGAAGAAAAAGT
CCAAGGAGACAAAGGAAGTCGAGGAGGTTGTCACGGTTTCTGAACAGAATTTGGCAGTGGCTGTTTCACAGCCCTGCCTGGGATTATGA
Protein sequenceShow/hide protein sequence
MEIDSLIDEWKRFNLLELEKEDIMSFNEEEKLEIKGQTNQYLIGKLLSNRIISKVALKNALSTAWKTRKEFNIEIVDTNTFLFKFESLDDKEWITRNGPWFFDRNLIVLE
APKNDQRTVDIEFNKVDFWIRIINLPIGYRNDIVARKIGNNLGEFLEIDPNRNENPWGNSIRIRVKLDITKPLRRGFMFKKENKTETCWISIRYERVPDFCFHCGLIGHV
AKECSKNSDGEERNNKNFEFGMWMKFQGFARPPKSTETPRNNEEKKANNDPKKNDRDEAGGKSEEGLNVALNLESPMDEEIQDPNLRGQTGVISMPSEELMNSYKRFENW
GMLHLTKEVPRLESPNSALAGLEVSQTGSISSSRTNPTWKRRGRRSQSEEIQIGVSRSKKRKCEEVEDNKKKKSKETKEVEEVVTVSEQNLAVAVSQPCLGL