; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036337 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036337
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:44549460..44551433
RNA-Seq ExpressionLag0036337
SyntenyLag0036337
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]4.4e-5440.96Show/hide
Query:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKIN-QGLLVDRLGVNLFIFRFVNEMDRVRVVRQGP
        M  SNL++EW    LTSEE++++V  D  A++ +G+ L   L+ KLL  + +   V++   + AWK++ +   VD +G N+F+F F    DR R++R GP
Subjt:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKIN-QGLLVDRLGVNLFIFRFVNEMDRVRVVRQGP

Query:  WVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLW
        W F++ L+++  P+   KP D  F  VS W+H FDL L   N+TMA R+GNA+G++E+V+S      WG+ LR+RV+ D+  PL RGI++  DGP+ G W
Subjt:  WVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLW

Query:  VPFRYERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTG
        +P +YERLP     C  L HI +DC+         +   QYG WLRF G
Subjt:  VPFRYERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.5e-6244.98Show/hide
Query:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPW
        MD  NL+ +W +  LTSEE+E+++  D +AV  + + L + L+GKLL  + + A+V+ +    AWK+   L V+ +G NLF+F F  E D  RV++ GPW
Subjt:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPW

Query:  VFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWV
         F+K L+VL  P      ++  F+ V+FWIH+FDLP+ W N+TMA R+GNA+G + +VD       WG SLRIRV +D+T PLRRGI+I  DGP+ G W+
Subjt:  VFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWV

Query:  PFRYERLPGICSFCRILGHITRDC-AQFLRSDQGFAPPPQYGDWLRFTG
        P +YERLP  C FC ++GH + DC A++L +        +YG WLRF G
Subjt:  PFRYERLPGICSFCRILGHITRDC-AQFLRSDQGFAPPPQYGDWLRFTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.2e-4731.17Show/hide
Query:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKI-NQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFE
        +L++EW    LTSEEEE ++  D  A   +G  L   L+GKL   +P+   VM+   R AWK+ N    V  LG NLF+F F   +DR ++ + GPW F+
Subjt:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKI-NQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFE

Query:  KFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFR
        + L+++  P+  + P++  F+ +  W+  FDLPL    + MA R+GNA+G +EE D  +    WG++LR+RV LD++ PLRRGI++  DGP+ G W+P +
Subjt:  KFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFR

Query:  YERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTGRGMIMQPFAKREPVRSPRRANPGSHRLALDVLSVQDPAAPADELPRRVQRGIRIE
        YERLP  C  C +     +                QYG WLR+ G      P  K+   +       G++  +     V   +      P      I +E
Subjt:  YERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTGRGMIMQPFAKREPVRSPRRANPGSHRLALDVLSVQDPAAPADELPRRVQRGIRIE

Query:  EPGERSMPRQIENLMPAMYSLSGGPVRYSKGKEKV-IEEA----PVFKGGRPTVSCSRRNGCSSSNYRP
         P   +  +  E   P+    S  PV   +G++++ ++E     P  K G P++  S  +  +  +  P
Subjt:  EPGERSMPRQIENLMPAMYSLSGGPVRYSKGKEKV-IEEA----PVFKGGRPTVSCSRRNGCSSSNYRP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]3.9e-4240.08Show/hide
Query:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEK
        +L+D    LSLTSEE+ + V    E+          CL+GKLL  +P   E M+    + W+  +G+ V  +G NLF+F F + +D+ RV+  GPW F+K
Subjt:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEK

Query:  FLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRY
         LL+L      ++P+D   + V FW+HV +LPL   N+ + E +GNAVG + ++D  +G + WG ++RIRV LD+  PLRRG+++        +WV F+Y
Subjt:  FLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRY

Query:  ERLPGICSFCRILGHITRDCAQFLRSDQGF-APPPQYGDWLR
        ERLP  C FC  LGH  R+C   L S  G      QYG WLR
Subjt:  ERLPGICSFCRILGHITRDCAQFLRSDQGF-APPPQYGDWLR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]2.5e-4139.18Show/hide
Query:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLG---FCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWV
        +LVD    LSLTSEE+ +     R   D +  ++G    CL+GKLL  +P   E M+    + W+  +G+ V  +G NLF+F F + +D+ RV+  GPW 
Subjt:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLG---FCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWV

Query:  FEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVP
        F+K LL+L      ++P+D   + V FW+HV +LPL   N+ + + +GNAVG + ++D  +G + WG ++RIRV +D+  PLRRG+++        +WV 
Subjt:  FEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVP

Query:  FRYERLPGICSFCRILGHITRDCAQFLR-SDQGFAPPPQYGDWLR
        F+YERLP  C FC  LGH  R+C   L  +D       QYG WLR
Subjt:  FRYERLPGICSFCRILGHITRDCAQFLR-SDQGFAPPPQYGDWLR

TrEMBL top hitse value%identityAlignment
A0A2N9E6P9 CCHC-type domain-containing protein2.7e-4139.83Show/hide
Query:  LVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKF
        LV+EW + SLT E+E    T + +A+  S      CLLGKL+  KP     ++      W + +G     +G NLFIF+F +E +R RV+   PW+F  +
Subjt:  LVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKF

Query:  LLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYE
        LL L         A   FS   FW+ +  +PL++  +   +R+GNA+G   +VD   G + WG  LRIR+ LD T P+ RG R+        LWV F+YE
Subjt:  LLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYE

Query:  RLPGICSFCRILGHITRDCAQFLRSD-QGFAPPPQYGDWLR
        RLP +C FC +LGH  RDC   LRS  QG     QYG WLR
Subjt:  RLPGICSFCRILGHITRDCAQFLRSD-QGFAPPPQYGDWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054812.1e-5440.96Show/hide
Query:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKIN-QGLLVDRLGVNLFIFRFVNEMDRVRVVRQGP
        M  SNL++EW    LTSEE++++V  D  A++ +G+ L   L+ KLL  + +   V++   + AWK++ +   VD +G N+F+F F    DR R++R GP
Subjt:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKIN-QGLLVDRLGVNLFIFRFVNEMDRVRVVRQGP

Query:  WVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLW
        W F++ L+++  P+   KP D  F  VS W+H FDL L   N+TMA R+GNA+G++E+V+S      WG+ LR+RV+ D+  PL RGI++  DGP+ G W
Subjt:  WVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLW

Query:  VPFRYERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTG
        +P +YERLP     C  L HI +DC+         +   QYG WLRF G
Subjt:  VPFRYERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTG

A0A6J1DU55 uncharacterized protein LOC1110231357.3e-6344.98Show/hide
Query:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPW
        MD  NL+ +W +  LTSEE+E+++  D +AV  + + L + L+GKLL  + + A+V+ +    AWK+   L V+ +G NLF+F F  E D  RV++ GPW
Subjt:  MDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPW

Query:  VFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWV
         F+K L+VL  P      ++  F+ V+FWIH+FDLP+ W N+TMA R+GNA+G + +VD       WG SLRIRV +D+T PLRRGI+I  DGP+ G W+
Subjt:  VFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWV

Query:  PFRYERLPGICSFCRILGHITRDC-AQFLRSDQGFAPPPQYGDWLRFTG
        P +YERLP  C FC ++GH + DC A++L +        +YG WLRF G
Subjt:  PFRYERLPGICSFCRILGHITRDC-AQFLRSDQGFAPPPQYGDWLRFTG

A0A6J1DX30 uncharacterized protein LOC1110248742.5e-4731.17Show/hide
Query:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKI-NQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFE
        +L++EW    LTSEEEE ++  D  A   +G  L   L+GKL   +P+   VM+   R AWK+ N    V  LG NLF+F F   +DR ++ + GPW F+
Subjt:  NLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKI-NQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFE

Query:  KFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFR
        + L+++  P+  + P++  F+ +  W+  FDLPL    + MA R+GNA+G +EE D  +    WG++LR+RV LD++ PLRRGI++  DGP+ G W+P +
Subjt:  KFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFR

Query:  YERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTGRGMIMQPFAKREPVRSPRRANPGSHRLALDVLSVQDPAAPADELPRRVQRGIRIE
        YERLP  C  C +     +                QYG WLR+ G      P  K+   +       G++  +     V   +      P      I +E
Subjt:  YERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTGRGMIMQPFAKREPVRSPRRANPGSHRLALDVLSVQDPAAPADELPRRVQRGIRIE

Query:  EPGERSMPRQIENLMPAMYSLSGGPVRYSKGKEKV-IEEA----PVFKGGRPTVSCSRRNGCSSSNYRP
         P   +  +  E   P+    S  PV   +G++++ ++E     P  K G P++  S  +  +  +  P
Subjt:  EPGERSMPRQIENLMPAMYSLSGGPVRYSKGKEKV-IEEA----PVFKGGRPTVSCSRRNGCSSSNYRP

A0A6P6S3G2 uncharacterized protein LOC1136872931.5e-3938.2Show/hide
Query:  LSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLP
        L L  +EEE+ + P  +A D    L   C+LGKL   K    E +    +  W  ++GL    LG NLF+F+F + +D+ +V   GPW F+  LLV+   
Subjt:  LSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLP

Query:  IRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYERLPGICS
        I  ++  +      SFW+ V++LPL W N   AE +GN +GVYE  + R     WG  LRIRVK+ L  PL+R + ++ +G +    V F+YERLP +C 
Subjt:  IRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYERLPGICS

Query:  FCRILGHITRDCAQFLRSDQGFAPPPQYGDWLR
        +C  +GH  RDC   L +       PQYG WLR
Subjt:  FCRILGHITRDCAQFLRSDQGFAPPPQYGDWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein8.6e-1625.3Show/hide
Query:  RMRWNLEDSSIFVVKQVCWHICLLFFFPLLADSMDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKI
        + R+ + +S+  +  +   H  L F+ PL   S    NL      ++L  ++   ++  D   V+ +     F L G+ + P+      +  +    W  
Subjt:  RMRWNLEDSSIFVVKQVCWHICLLFFFPLLADSMDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLLCPKPLGAEVMQKNFRAAWKI

Query:  NQGLLVDR-LGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLF
          GL+  R +    F F F  E     V+R+GPW F  ++++L    +  +P    F F+ FW+ +  +P  + N+ + E IG A+G   + D     + 
Subjt:  NQGLLVDR-LGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEEVDSRNGFLF

Query:  WGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYERLPGICSFCRILGH
             R+ +  D+THPLR          ++ L + FRYERL G C  C +L H
Subjt:  WGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYERLPGICSFCRILGH

AT5G36228.1 nucleic acid binding;zinc ion binding7.3e-1524.35Show/hide
Query:  LLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFN
        LLG++L P+    E         W +   +    L    F  RF +E+D +  +R+ PWVF ++   + L      P +   +F+  W+H+  +PL + +
Subjt:  LLGKLLCPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFN

Query:  QTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLR--RGIRIYPDGPLSGLWVPFRYERLPGICSFCRILGHITRDCAQFLRSDQ
        +   E I + +G    +D         T +R++V++D T PLR  R +R           + F YE+L  +C+ C  + H    C   +  ++
Subjt:  QTMAERIGNAVGVYEEVDSRNGFLFWGTSLRIRVKLDLTHPLR--RGIRIYPDGPLSGLWVPFRYERLPGICSFCRILGHITRDCAQFLRSDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCACCGGTTTATTTTTCAGTGGGATACGTTTTCAAGTCTGGAGATTCTCTCCAGCGGATTTGTAGCTTTCTTGCTACTATTTCAGCGGATGCGATGGAACCTGGA
AGACTCATCGATTTTTGTGGTCAAGCAGGTCTGTTGGCATATCTGTCTGCTCTTTTTCTTTCCTTTGCTTGCTGACTCAATGGATCCTTCAAATTTGGTGGATGAGTGGG
CGCAGTTGAGTCTTACTTCGGAGGAAGAAGAACTTTCGGTCACTCCAGACCGTGAAGCTGTAGATCGTTCCGGGGAATTGCTCGGTTTTTGCTTGTTGGGAAAACTGCTA
TGTCCCAAGCCTCTCGGTGCGGAAGTTATGCAGAAAAACTTCAGGGCAGCGTGGAAGATTAATCAAGGATTACTAGTGGATCGGCTGGGCGTGAATCTGTTCATTTTCCG
ATTCGTAAATGAGATGGATCGAGTTAGGGTGGTTCGTCAGGGGCCGTGGGTCTTTGAGAAATTTTTGCTGGTGTTAGTGCTCCCTATTCGTGGTTTGAAGCCTGCTGATC
ATCCATTTTCATTTGTATCCTTCTGGATTCATGTGTTTGATTTACCCCTAGATTGGTTTAATCAAACCATGGCGGAACGCATTGGCAATGCAGTGGGAGTCTATGAGGAA
GTCGATAGCCGTAACGGTTTCTTATTCTGGGGTACAAGTTTGCGCATCCGAGTTAAACTTGATTTAACTCATCCTCTCCGGCGTGGCATTCGTATTTATCCTGATGGTCC
TCTTAGCGGCCTGTGGGTTCCTTTCAGGTATGAGCGTCTACCGGGAATTTGTTCCTTTTGTCGCATTCTTGGTCATATTACTCGTGATTGTGCCCAGTTCCTTAGATCGG
ACCAAGGTTTTGCTCCTCCCCCACAATATGGGGATTGGTTACGTTTTACAGGGAGAGGGATGATTATGCAACCGTTTGCGAAAAGGGAACCTGTTCGTAGCCCGCGAAGA
GCGAATCCAGGGAGTCATCGATTAGCTCTTGATGTGTTGTCGGTTCAAGACCCAGCTGCTCCAGCGGATGAGTTGCCCAGACGGGTCCAAAGGGGTATCCGAATTGAAGA
ACCAGGCGAAAGGAGTATGCCAAGGCAAATTGAAAATTTGATGCCGGCGATGTATTCTCTTTCGGGCGGTCCGGTGAGGTATTCTAAAGGAAAGGAGAAGGTTATTGAAG
AGGCGCCTGTGTTCAAAGGCGGTCGTCCAACGGTCTCGTGTTCCAGGCGGAATGGATGCTCCTCTTCGAATTATCGACCGGATTTAACCATAGCGGGAGTTCAGGCCACC
GTTCCGGTGGGCGGTTCTTCAACGGTGGGAGAACATTCAATGCTGAAGACTGATTATGTGCAATCAATGAAGGATTGCAACGCTCGACTGTCAAAGAATTTACTTTCCAT
TTTCAAGAAGGCATCCGTTTCAACTATGGTCGTAACGTCTGAATCGAGCAAGGGAGTAACGCCAGAACATGGCCACGTGCTGGAGGCTGAGGATGATTTGGAAAGTGATC
CTGAGATGACTGAAATGGTTGAGGAGGAGGTGTTGGCCATAGAGGATGCTGGTATTGAGGCCCAAAACGATTTTGGTGGTCTATCTGAGGCCCATTCGGAGGCTGATATT
GATGATGCCTGTGAGGCCCAGCCGCAGGACTTTTTGGAGCAGGCCGAAGTGGATGGAATTTCGTCTACTGGGCTTCAGGGTGAAGGATTTGTTTTCACTTCACAAAGGTC
GTTGATTCAATGTGCTGCTAGGGGACCTACTTGGAAAAAGAGAGCACGTGCTGGGAAAGTTCCTTTGGGCTTGAGTTTGGAAGGTATGAATGAATTTCAGAAGCGGAAAG
ATGGTCCGATTTTGTTCTCTCCCGGGAACATTAAGCGTCCTAAATTTGCTAATGATGTTTGTAAAAAGGCGAGGGTTGCTGAGCAGCCTCGCCTTGCGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCACCGGTTTATTTTTCAGTGGGATACGTTTTCAAGTCTGGAGATTCTCTCCAGCGGATTTGTAGCTTTCTTGCTACTATTTCAGCGGATGCGATGGAACCTGGA
AGACTCATCGATTTTTGTGGTCAAGCAGGTCTGTTGGCATATCTGTCTGCTCTTTTTCTTTCCTTTGCTTGCTGACTCAATGGATCCTTCAAATTTGGTGGATGAGTGGG
CGCAGTTGAGTCTTACTTCGGAGGAAGAAGAACTTTCGGTCACTCCAGACCGTGAAGCTGTAGATCGTTCCGGGGAATTGCTCGGTTTTTGCTTGTTGGGAAAACTGCTA
TGTCCCAAGCCTCTCGGTGCGGAAGTTATGCAGAAAAACTTCAGGGCAGCGTGGAAGATTAATCAAGGATTACTAGTGGATCGGCTGGGCGTGAATCTGTTCATTTTCCG
ATTCGTAAATGAGATGGATCGAGTTAGGGTGGTTCGTCAGGGGCCGTGGGTCTTTGAGAAATTTTTGCTGGTGTTAGTGCTCCCTATTCGTGGTTTGAAGCCTGCTGATC
ATCCATTTTCATTTGTATCCTTCTGGATTCATGTGTTTGATTTACCCCTAGATTGGTTTAATCAAACCATGGCGGAACGCATTGGCAATGCAGTGGGAGTCTATGAGGAA
GTCGATAGCCGTAACGGTTTCTTATTCTGGGGTACAAGTTTGCGCATCCGAGTTAAACTTGATTTAACTCATCCTCTCCGGCGTGGCATTCGTATTTATCCTGATGGTCC
TCTTAGCGGCCTGTGGGTTCCTTTCAGGTATGAGCGTCTACCGGGAATTTGTTCCTTTTGTCGCATTCTTGGTCATATTACTCGTGATTGTGCCCAGTTCCTTAGATCGG
ACCAAGGTTTTGCTCCTCCCCCACAATATGGGGATTGGTTACGTTTTACAGGGAGAGGGATGATTATGCAACCGTTTGCGAAAAGGGAACCTGTTCGTAGCCCGCGAAGA
GCGAATCCAGGGAGTCATCGATTAGCTCTTGATGTGTTGTCGGTTCAAGACCCAGCTGCTCCAGCGGATGAGTTGCCCAGACGGGTCCAAAGGGGTATCCGAATTGAAGA
ACCAGGCGAAAGGAGTATGCCAAGGCAAATTGAAAATTTGATGCCGGCGATGTATTCTCTTTCGGGCGGTCCGGTGAGGTATTCTAAAGGAAAGGAGAAGGTTATTGAAG
AGGCGCCTGTGTTCAAAGGCGGTCGTCCAACGGTCTCGTGTTCCAGGCGGAATGGATGCTCCTCTTCGAATTATCGACCGGATTTAACCATAGCGGGAGTTCAGGCCACC
GTTCCGGTGGGCGGTTCTTCAACGGTGGGAGAACATTCAATGCTGAAGACTGATTATGTGCAATCAATGAAGGATTGCAACGCTCGACTGTCAAAGAATTTACTTTCCAT
TTTCAAGAAGGCATCCGTTTCAACTATGGTCGTAACGTCTGAATCGAGCAAGGGAGTAACGCCAGAACATGGCCACGTGCTGGAGGCTGAGGATGATTTGGAAAGTGATC
CTGAGATGACTGAAATGGTTGAGGAGGAGGTGTTGGCCATAGAGGATGCTGGTATTGAGGCCCAAAACGATTTTGGTGGTCTATCTGAGGCCCATTCGGAGGCTGATATT
GATGATGCCTGTGAGGCCCAGCCGCAGGACTTTTTGGAGCAGGCCGAAGTGGATGGAATTTCGTCTACTGGGCTTCAGGGTGAAGGATTTGTTTTCACTTCACAAAGGTC
GTTGATTCAATGTGCTGCTAGGGGACCTACTTGGAAAAAGAGAGCACGTGCTGGGAAAGTTCCTTTGGGCTTGAGTTTGGAAGGTATGAATGAATTTCAGAAGCGGAAAG
ATGGTCCGATTTTGTTCTCTCCCGGGAACATTAAGCGTCCTAAATTTGCTAATGATGTTTGTAAAAAGGCGAGGGTTGCTGAGCAGCCTCGCCTTGCGCCATGA
Protein sequenceShow/hide protein sequence
MEHRFIFQWDTFSSLEILSSGFVAFLLLFQRMRWNLEDSSIFVVKQVCWHICLLFFFPLLADSMDPSNLVDEWAQLSLTSEEEELSVTPDREAVDRSGELLGFCLLGKLL
CPKPLGAEVMQKNFRAAWKINQGLLVDRLGVNLFIFRFVNEMDRVRVVRQGPWVFEKFLLVLVLPIRGLKPADHPFSFVSFWIHVFDLPLDWFNQTMAERIGNAVGVYEE
VDSRNGFLFWGTSLRIRVKLDLTHPLRRGIRIYPDGPLSGLWVPFRYERLPGICSFCRILGHITRDCAQFLRSDQGFAPPPQYGDWLRFTGRGMIMQPFAKREPVRSPRR
ANPGSHRLALDVLSVQDPAAPADELPRRVQRGIRIEEPGERSMPRQIENLMPAMYSLSGGPVRYSKGKEKVIEEAPVFKGGRPTVSCSRRNGCSSSNYRPDLTIAGVQAT
VPVGGSSTVGEHSMLKTDYVQSMKDCNARLSKNLLSIFKKASVSTMVVTSESSKGVTPEHGHVLEAEDDLESDPEMTEMVEEEVLAIEDAGIEAQNDFGGLSEAHSEADI
DDACEAQPQDFLEQAEVDGISSTGLQGEGFVFTSQRSLIQCAARGPTWKKRARAGKVPLGLSLEGMNEFQKRKDGPILFSPGNIKRPKFANDVCKKARVAEQPRLAP