; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg013917 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg013917
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold3:35103535..35105178
RNA-Seq ExpressionSpg013917
SyntenySpg013917
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268866.1 unnamed protein product [Prunus armeniaca]2.2e-3532.7Show/hide
Query:  LRLSQKESCGISKEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPS
        L L+ KE  G+  E       N  L   L   V T++  N +AF++ M +AW   R++ + ++ DN+FL    + + ++ +I    W F++ L+LL TP 
Subjt:  LRLSQKESCGISKEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPS

Query:  TFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYIC
            P  +V   A +WVQ HN+P+        R +G R+G+ + +      EC G++L +RV+L   +PL R + L    + K ++  KYERLPDFCY C
Subjt:  TFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYIC

Query:  GMLGHNDGSCP-----ENKKSEKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMG
        G +GH    C      E +  EK +GSWL ++ + + R     GR+ + G++  GR   RS G
Subjt:  GMLGHNDGSCP-----ENKKSEKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMG

OMP03234.1 hypothetical protein COLO4_10561 [Corchorus olitorius]1.4e-3735.24Show/hide
Query:  KVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKV
        K+ + R +N D  R V+   W     L +  +G+ +++   +S   ++R+ ++  W FN+ L++L     F   EEI  +   +W+Q H++P+G   E V
Subjt:  KVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKV

Query:  ARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP-------ENKKSEKQFGSWL
         + +G   GEV++IDT       G+FLR+R  L  N+PL RG+ L     GK+LV  +YE+LPDFCY+CG L H +  C        ++ K +K++G WL
Subjt:  ARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP-------ENKKSEKQFGSWL

Query:  RAETYHMPRS
        RAE   +PRS
Subjt:  RAETYHMPRS

TXG54301.1 hypothetical protein EZV62_019557 [Acer yangbiense]2.9e-3532.26Show/hide
Query:  EELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTA
        EE+G +G   ++ + L  K+ + R +N +AFR V+   W+   ++ I+ V +N+FL      + R+++ +   W F++CL++L  P    +  ++ F+ A
Subjt:  EELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTA

Query:  IYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGK-VLVGIKYERLPDFCYICGMLGHNDGSCPE
         +W+Q H++PI   N + A+ L  +IGEV++I T+  D C G+FLRV+V++  ++PL R L L    +G  V+VG+KYERLP+FCY CG LGH    CP+
Subjt:  IYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGK-VLVGIKYERLPDFCYICGMLGHNDGSCPE

Query:  NKKSEK-------QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRSMGRQNWGSDDSEEE
         +  ++       +FG W+RA       +   P TG       G ++ + R G+ R    FR+++      G D + ++
Subjt:  NKKSEK-------QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRSMGRQNWGSDDSEEE

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.6e-3631Show/hide
Query:  NLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVP
        N++  +  K+ T++ I+ +A R VM   W        + +G NI+++  +S   + R++    W FN+ L++L +P+  +QP ++ F+   +W+Q HN+P
Subjt:  NLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVP

Query:  IGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCPENKK-----SEK
            + ++A ILG ++G+V +I+ +  D   G F+RVRVK+  ++PL RG+ L       +   ++YE+LPDFCY CG +GH+   C +  K     S +
Subjt:  IGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCPENKK-----SEK

Query:  QFGSWLRA-----------ETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMGRQNWGSDDSEEEENEATLLE
        Q+G WLRA           E        F  G   + GR GRG +R R    +NW   D  E  +   + E
Subjt:  QFGSWLRA-----------ETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMGRQNWGSDDSEEEENEATLLE

XP_024035600.1 uncharacterized protein LOC112096408 [Citrus clementina]2.9e-3531.67Show/hide
Query:  DIVEKLNSLRLSQKESCGIS-KEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNR
        +++ K  ++ L  +E   IS    + + G  L + + L  K+   R +  +  R  M +AW   ++  ++++GDNIFL    S   + RI+    W F+R
Subjt:  DIVEKLNSLRLSQKESCGIS-KEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNR

Query:  CLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYE
         L+++  P      ++  F  A++WVQ HN+PI   ++++ + +GGRIG+V +++T++  EC G F+RVR+ +    PL + L L     G++ + I YE
Subjt:  CLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYE

Query:  RLPDFCYICGMLGHNDGSCPENK---KSEKQFGSWLRAET
        +LPDFC+ CG++GH    C + K   K +  +G+W+RA+T
Subjt:  RLPDFCYICGMLGHNDGSCPENK---KSEKQFGSWLRAET

TrEMBL top hitse value%identityAlignment
A0A1R3K847 Uncharacterized protein6.8e-3835.24Show/hide
Query:  KVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKV
        K+ + R +N D  R V+   W     L +  +G+ +++   +S   ++R+ ++  W FN+ L++L     F   EEI  +   +W+Q H++P+G   E V
Subjt:  KVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKV

Query:  ARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP-------ENKKSEKQFGSWL
         + +G   GEV++IDT       G+FLR+R  L  N+PL RG+ L     GK+LV  +YE+LPDFCY+CG L H +  C        ++ K +K++G WL
Subjt:  ARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP-------ENKKSEKQFGSWL

Query:  RAETYHMPRS
        RAE   +PRS
Subjt:  RAETYHMPRS

A0A5C7HBM7 CCHC-type domain-containing protein1.4e-3532.26Show/hide
Query:  EELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTA
        EE+G +G   ++ + L  K+ + R +N +AFR V+   W+   ++ I+ V +N+FL      + R+++ +   W F++CL++L  P    +  ++ F+ A
Subjt:  EELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTA

Query:  IYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGK-VLVGIKYERLPDFCYICGMLGHNDGSCPE
         +W+Q H++PI   N + A+ L  +IGEV++I T+  D C G+FLRV+V++  ++PL R L L    +G  V+VG+KYERLP+FCY CG LGH    CP+
Subjt:  IYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGK-VLVGIKYERLPDFCYICGMLGHNDGSCPE

Query:  NKKSEK-------QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRSMGRQNWGSDDSEEE
         +  ++       +FG W+RA       +   P TG       G ++ + R G+ R    FR+++      G D + ++
Subjt:  NKKSEK-------QFGSWLRA---ETYHMPRSPFTG-------GRSYDSGRRGRGR----FRYRSMGRQNWGSDDSEEE

A0A6J1D765 uncharacterized protein LOC1110179021.3e-3631Show/hide
Query:  NLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVP
        N++  +  K+ T++ I+ +A R VM   W        + +G NI+++  +S   + R++    W FN+ L++L +P+  +QP ++ F+   +W+Q HN+P
Subjt:  NLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVP

Query:  IGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCPENKK-----SEK
            + ++A ILG ++G+V +I+ +  D   G F+RVRVK+  ++PL RG+ L       +   ++YE+LPDFCY CG +GH+   C +  K     S +
Subjt:  IGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCPENKK-----SEK

Query:  QFGSWLRA-----------ETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMGRQNWGSDDSEEEENEATLLE
        Q+G WLRA           E        F  G   + GR GRG +R R    +NW   D  E  +   + E
Subjt:  QFGSWLRA-----------ETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMGRQNWGSDDSEEEENEATLLE

A0A6J5TXP7 CCHC-type domain-containing protein1.1e-3532.7Show/hide
Query:  LRLSQKESCGISKEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPS
        L L+ KE  G+  E       N  L   L   V T++  N +AF++ M +AW   R++ + ++ DN+FL    + + ++ +I    W F++ L+LL TP 
Subjt:  LRLSQKESCGISKEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPS

Query:  TFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYIC
            P  +V   A +WVQ HN+P+        R +G R+G+ + +      EC G++L +RV+L   +PL R + L    + K ++  KYERLPDFCY C
Subjt:  TFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYIC

Query:  GMLGHNDGSCP-----ENKKSEKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMG
        G +GH    C      E +  EK +GSWL ++ + + R     GR+ + G++  GR   RS G
Subjt:  GMLGHNDGSCP-----ENKKSEKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMG

A0A7N2KYZ3 CCHC-type domain-containing protein2.0e-3436.82Show/hide
Query:  NRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGL
        +RLA +  T R +N D+  R     W    +L I ++GDNI L   +    R+R++E   W F++ +++    ST ++   + F  AI+WVQFHN+P   
Subjt:  NRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGL

Query:  RNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPL--CRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSC------PEN-KKSE
         N+     +G  IG+V+++   E D   G FLRVR+ +  ++PL  CR L   G   G   VG+KYERLP+FCY CG L H +  C       EN KK  
Subjt:  RNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPL--CRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSC------PEN-KKSE

Query:  KQFGSWLRAETYHMPRSPFT
        +QFG W+RA+     R   T
Subjt:  KQFGSWLRAETYHMPRSPFT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding5.9e-1029.82Show/hide
Query:  STFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYI
        S FD   + +  T + WV+  N+P    +  +   +   +G  LK+D N ++   GRF RV +++   +PL   + +NG         + YE L   C  
Subjt:  STFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYI

Query:  CGMLGHNDGSCPEN
        CG+ GH   SCP N
Subjt:  CGMLGHNDGSCPEN

AT3G31430.1 unknown protein1.7e-1229.08Show/hide
Query:  DRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNG
        + ++    W FN  +ILL       +P+  +F    +WVQ   +P    N  V   +G  +G+VL  D N        F RV +      PL        
Subjt:  DRIIEESLWCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNG

Query:  GPTGKVLVGIKYERLPDFCYICGMLGHNDGSCPENKKSEKQ
              L+  +YERL  FC +CGML H+ G+C      E+Q
Subjt:  GPTGKVLVGIKYERLPDFCYICGMLGHNDGSCPENKKSEKQ

AT3G42140.1 zinc ion binding;nucleic acid binding4.8e-0439.29Show/hide
Query:  RFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP
        RFL  R+     E +   L  N G    VL   +YE+L +FC  CGML H+   CP
Subjt:  RFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP

AT5G36228.1 nucleic acid binding;zinc ion binding7.4e-1328Show/hide
Query:  WCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLV
        W FN   I L     F   + + F     WV    +P+   +E+   I+   +GEV+ +D NE       F+RV+V++ F EPL     +      + ++
Subjt:  WCFNRCLILLATPSTFDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLV

Query:  GIKYERLPDFCYICGMLGHNDGSCP
        G +YE+L   C  C  + H    CP
Subjt:  GIKYERLPDFCYICGMLGHNDGSCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTGTTGAGAAACTAAATTCACTCAGGCTCTCCCAAAAGGAGTCTTGTGGGATTAGTAAAGAAGAACTAGGCATACTGGGGAATAACTTGAATCTGGAAAATCG
TTTGGCTTGTAAAGTTGCTACGAACCGGATTATCAACACAGATGCGTTCCGAAGAGTAATGTGCAAGGCCTGGAATTTTTGTCGAGACTTAACTATTGATAACGTTGGTG
ATAATATCTTCCTTGTTAACCCGCAATCAGCTCAAGGGCGCGACAGAATTATCGAGGAAAGTCTTTGGTGTTTTAATAGATGTTTGATCCTCCTTGCAACACCATCAACA
TTTGACCAGCCGGAAGAAATTGTGTTCGATACGGCAATATACTGGGTTCAATTCCACAACGTTCCAATCGGACTGAGAAACGAAAAGGTGGCTCGAATTTTAGGTGGTCG
GATAGGCGAGGTCCTTAAAATCGACACCAACGAAGTTGATGAATGTTGCGGTCGTTTCCTCCGTGTGAGAGTGAAACTGCAATTCAACGAACCACTTTGTCGTGGCTTAA
CACTAAACGGCGGCCCAACAGGTAAAGTTTTAGTTGGAATAAAGTATGAACGATTACCCGACTTCTGCTATATTTGTGGAATGCTAGGGCACAATGATGGATCTTGTCCG
GAGAACAAAAAATCGGAGAAACAGTTCGGATCTTGGCTTCGAGCAGAAACTTACCACATGCCCAGAAGCCCATTCACCGGCGGCCGATCTTATGACTCAGGACGGAGAGG
ACGCGGACGATTCAGGTATCGTTCAATGGGCAGGCAGAATTGGGGATCTGATGATTCTGAGGAGGAAGAAAACGAAGCAACACTACTGGAAAGAACAGGCCAATCGGATG
TCAGAAATAAGCTCGCCGAAGAAGACCAGAACCGGCCGGAAAAAGAAGAAACTCAGAGATCACCGGAATCCCATCGACGGAAAAGGAATGAAACTGACCTTCAGCCACAA
TCAATGTCACCGACGCAATCTATGGAAACCGAATCACCCAAAAACGGTAAAAATTCATTTATTGCAGGCTTATTCCCAAAAACGGCTGAGCAAGATGCTGAAGGGGAGAA
ATCGATCCTCAAGGAAGCTCCTTTAATGGCTGAAGATTTATTCCCAAATATGGAGGAGAGAGAAGAGAACCCAATCATGAAGAATCTAATGGATCAAATTGAAGAGAGGG
CTCAACAACATGAATTTGGGCCTGGCATCCTTATGGGCCAAGTGGATTCTGACTTCCAGGAGAATGAAGGAGACAAAGCCCACACAAAAAATACTAATCACATTGGGCCC
TCCAACGAGTTTCACAACATGGAGGTGGATCCCAACTCTTGTCACTCACAGAATGGGGGCCAACGTCAGACTGAAGAAAGGGAAGCGAGCAAAGTCGTGGTGAGGAAAAA
TGCAAAGTCGTGGAAGCGAGCAAACAGAAAGGGAAGTCAATGGGTAACTAAAAATTCCAATTCACAAACTCTTACGAGCTCTATTGGAAAGAAAAGAGACTCAGATGAAC
TCCCTTTGAAAGATAGTGAAGGAAAGAAGATCAAATGTATCACATCAGAGATCGATACGATTATATCGGCGGAGCCTGTTAAACAGGCCCGCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTGTTGAGAAACTAAATTCACTCAGGCTCTCCCAAAAGGAGTCTTGTGGGATTAGTAAAGAAGAACTAGGCATACTGGGGAATAACTTGAATCTGGAAAATCG
TTTGGCTTGTAAAGTTGCTACGAACCGGATTATCAACACAGATGCGTTCCGAAGAGTAATGTGCAAGGCCTGGAATTTTTGTCGAGACTTAACTATTGATAACGTTGGTG
ATAATATCTTCCTTGTTAACCCGCAATCAGCTCAAGGGCGCGACAGAATTATCGAGGAAAGTCTTTGGTGTTTTAATAGATGTTTGATCCTCCTTGCAACACCATCAACA
TTTGACCAGCCGGAAGAAATTGTGTTCGATACGGCAATATACTGGGTTCAATTCCACAACGTTCCAATCGGACTGAGAAACGAAAAGGTGGCTCGAATTTTAGGTGGTCG
GATAGGCGAGGTCCTTAAAATCGACACCAACGAAGTTGATGAATGTTGCGGTCGTTTCCTCCGTGTGAGAGTGAAACTGCAATTCAACGAACCACTTTGTCGTGGCTTAA
CACTAAACGGCGGCCCAACAGGTAAAGTTTTAGTTGGAATAAAGTATGAACGATTACCCGACTTCTGCTATATTTGTGGAATGCTAGGGCACAATGATGGATCTTGTCCG
GAGAACAAAAAATCGGAGAAACAGTTCGGATCTTGGCTTCGAGCAGAAACTTACCACATGCCCAGAAGCCCATTCACCGGCGGCCGATCTTATGACTCAGGACGGAGAGG
ACGCGGACGATTCAGGTATCGTTCAATGGGCAGGCAGAATTGGGGATCTGATGATTCTGAGGAGGAAGAAAACGAAGCAACACTACTGGAAAGAACAGGCCAATCGGATG
TCAGAAATAAGCTCGCCGAAGAAGACCAGAACCGGCCGGAAAAAGAAGAAACTCAGAGATCACCGGAATCCCATCGACGGAAAAGGAATGAAACTGACCTTCAGCCACAA
TCAATGTCACCGACGCAATCTATGGAAACCGAATCACCCAAAAACGGTAAAAATTCATTTATTGCAGGCTTATTCCCAAAAACGGCTGAGCAAGATGCTGAAGGGGAGAA
ATCGATCCTCAAGGAAGCTCCTTTAATGGCTGAAGATTTATTCCCAAATATGGAGGAGAGAGAAGAGAACCCAATCATGAAGAATCTAATGGATCAAATTGAAGAGAGGG
CTCAACAACATGAATTTGGGCCTGGCATCCTTATGGGCCAAGTGGATTCTGACTTCCAGGAGAATGAAGGAGACAAAGCCCACACAAAAAATACTAATCACATTGGGCCC
TCCAACGAGTTTCACAACATGGAGGTGGATCCCAACTCTTGTCACTCACAGAATGGGGGCCAACGTCAGACTGAAGAAAGGGAAGCGAGCAAAGTCGTGGTGAGGAAAAA
TGCAAAGTCGTGGAAGCGAGCAAACAGAAAGGGAAGTCAATGGGTAACTAAAAATTCCAATTCACAAACTCTTACGAGCTCTATTGGAAAGAAAAGAGACTCAGATGAAC
TCCCTTTGAAAGATAGTGAAGGAAAGAAGATCAAATGTATCACATCAGAGATCGATACGATTATATCGGCGGAGCCTGTTAAACAGGCCCGCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MDIVEKLNSLRLSQKESCGISKEELGILGNNLNLENRLACKVATNRIINTDAFRRVMCKAWNFCRDLTIDNVGDNIFLVNPQSAQGRDRIIEESLWCFNRCLILLATPST
FDQPEEIVFDTAIYWVQFHNVPIGLRNEKVARILGGRIGEVLKIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLNGGPTGKVLVGIKYERLPDFCYICGMLGHNDGSCP
ENKKSEKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRSMGRQNWGSDDSEEEENEATLLERTGQSDVRNKLAEEDQNRPEKEETQRSPESHRRKRNETDLQPQ
SMSPTQSMETESPKNGKNSFIAGLFPKTAEQDAEGEKSILKEAPLMAEDLFPNMEEREENPIMKNLMDQIEERAQQHEFGPGILMGQVDSDFQENEGDKAHTKNTNHIGP
SNEFHNMEVDPNSCHSQNGGQRQTEEREASKVVVRKNAKSWKRANRKGSQWVTKNSNSQTLTSSIGKKRDSDELPLKDSEGKKIKCITSEIDTIISAEPVKQARREP