; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021630 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021630
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold2:16605967..16607568
RNA-Seq ExpressionSpg021630
SyntenySpg021630
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]5.9e-4940.57Show/hide
Query:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK
        ++E W+ F  T+EE+ I VD+D  A + T K L  SLI KLL+ R I+  V++   K AW +  +  +V+ +G N+F+F+     D+ RILR GPW FD+
Subjt:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK

Query:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY
         ++++  P+ + KP  M+F+ V  W+HF DL +   N +MA RLGNAIG FE+ ++          L VRV  D+ KPL R IK+NLD   G CW  I+Y
Subjt:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY

Query:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG
        E+L D    CG ++H ++DCS   +  +  S+  QYG W++F G
Subjt:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]3.1e-5846.31Show/hide
Query:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKF
        ++ +W+KF  T+EE+ I +DVD  A K   + L +SL+GKLLA RII+ DV+ R    AW +  +LTVE +G+NLF+F    E D  R+++ GPW FDK 
Subjt:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKF

Query:  VLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYE
        ++VL KP      + +EF  V FWIH  DLPM   N +MA RLGNAIG F + D    G +   SL +RV +DI+KPLRR IK+N+D   G CW  I+YE
Subjt:  VLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYE

Query:  KLFDLCSFCGIIEHNVRDCSSFYMADEPPSQ-RNQYGMWMQFSG
        +L D C FCG+I H+  DC + Y+A +  S+  ++YG W++F G
Subjt:  KLFDLCSFCGIIEHNVRDCSSFYMADEPPSQ-RNQYGMWMQFSG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.4e-4235.15Show/hide
Query:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK
        ++E W+ F  T+EEE   +DVD  A   T   L   L+GKL   R I   VM+   + AW + +    V+ LG NLF+FS     D+ +I + GPW FD+
Subjt:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK

Query:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY
         +++++KP+ ++ P+ ++F  +P W+ F DLP+      MA RLGNA+G FEE D          +L VRV LDISKPLRR IK+NLD   G  W  I+Y
Subjt:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY

Query:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG------------------KAPNVFRSPSTSPLGKTDMAIDVVDHSTPNA
        E+L D C  CG+                   +++QYG W+++ G                  K+ N   S STSP+G     +     + P A
Subjt:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG------------------KAPNVFRSPSTSPLGKTDMAIDVVDHSTPNA

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]3.1e-3430.64Show/hide
Query:  NFTAEEEGIIVDVDRQAAKATSKSLGFS---LIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLS
        + T+EE+ ++    R   ++TS  +G S   L+GKLL  R    + M+    + W     + V  +G NLF+F      D+ R+L  GPW FDK +L+L 
Subjt:  NFTAEEEGIIVDVDRQAAKATSKSLGFS---LIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLS

Query:  KPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDL
        +  P V+P+ ++   V FW+H C+LP+   N  + E +GNA+G+F + D   GG+    ++ +RVALD+ KPLRR +K+ L  +    W   +YE+L   
Subjt:  KPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDL

Query:  CSFCGIIEHNVRDC-SSFYMADEPPSQRNQYGMWMQFSGKAPNVFRSPSTSPLGKTDMAIDVVDHSTPNAATRLPVGHRAGEGNTDNGSPMNVFDSR
        C FCG + H+ R+C      AD       QYG W++         +S  +   G  D  +      T      +P+     + N + G+P+    +R
Subjt:  CSFCGIIEHNVRDC-SSFYMADEPPSQRNQYGMWMQFSGKAPNVFRSPSTSPLGKTDMAIDVVDHSTPNAATRLPVGHRAGEGNTDNGSPMNVFDSR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]7.0e-3430.54Show/hide
Query:  NFTAEEEGIIVDVDRQAAKATSKSLGFS---LIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLS
        + T+EE+ ++    R    +TS  +G S   L+GKLL  R    + M+    + W     + V  +G NLF+F      D+ R+L  GPW FDK +L+L 
Subjt:  NFTAEEEGIIVDVDRQAAKATSKSLGFS---LIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLS

Query:  KPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDL
        +  P V+P+ ++   V FW+H C+LP+   N  + + +GNA+G+F + D   GG+A   ++ +RVA+D+ KPLRR +K+ L  S    W   +YE+L   
Subjt:  KPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDL

Query:  CSFCGIIEHNVRDC-SSFYMADEPPSQRNQYGMWMQFSGKAPNVFRSPSTSPLGKTDMAIDVVDHSTPNAATRLPVGHRAGEGNTDNGSPMNVFDSRS
        C FCG + H+ R+C      AD       QYG W++         +S  +   G  D  +      T      +P+     + N + G+P+    +R+
Subjt:  CSFCGIIEHNVRDC-SSFYMADEPPSQRNQYGMWMQFSGKAPNVFRSPSTSPLGKTDMAIDVVDHSTPNAATRLPVGHRAGEGNTDNGSPMNVFDSRS

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein5.1e-3033.33Show/hide
Query:  EEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPMVK
        ++ G I  +     +   +SL  SLIGK +  ++I  +  +      W   +E+T+E +G N+F F  +   D+ RIL  GPWLFDK +LVL +     K
Subjt:  EEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPMVK

Query:  PTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGII
         T ++F+YVPFWI   +LP+   N  +   LG  +G+ +E D G  G    + + +RV +D+  PL+R ++V L +        I YE+L + C +CG I
Subjt:  PTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGII

Query:  EHNVRDCSSFYMADEPPSQRNQYGMWMQ
         H VRDC      +   S   ++G WM+
Subjt:  EHNVRDCSSFYMADEPPSQRNQYGMWMQ

A0A6J1BSZ1 uncharacterized protein LOC1110054812.9e-4940.57Show/hide
Query:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK
        ++E W+ F  T+EE+ I VD+D  A + T K L  SLI KLL+ R I+  V++   K AW +  +  +V+ +G N+F+F+     D+ RILR GPW FD+
Subjt:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK

Query:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY
         ++++  P+ + KP  M+F+ V  W+HF DL +   N +MA RLGNAIG FE+ ++          L VRV  D+ KPL R IK+NLD   G CW  I+Y
Subjt:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY

Query:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG
        E+L D    CG ++H ++DCS   +  +  S+  QYG W++F G
Subjt:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG

A0A6J1DU55 uncharacterized protein LOC1110231351.5e-5846.31Show/hide
Query:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKF
        ++ +W+KF  T+EE+ I +DVD  A K   + L +SL+GKLLA RII+ DV+ R    AW +  +LTVE +G+NLF+F    E D  R+++ GPW FDK 
Subjt:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKF

Query:  VLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYE
        ++VL KP      + +EF  V FWIH  DLPM   N +MA RLGNAIG F + D    G +   SL +RV +DI+KPLRR IK+N+D   G CW  I+YE
Subjt:  VLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYE

Query:  KLFDLCSFCGIIEHNVRDCSSFYMADEPPSQ-RNQYGMWMQFSG
        +L D C FCG+I H+  DC + Y+A +  S+  ++YG W++F G
Subjt:  KLFDLCSFCGIIEHNVRDCSSFYMADEPPSQ-RNQYGMWMQFSG

A0A6J1DX30 uncharacterized protein LOC1110248746.8e-4335.15Show/hide
Query:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK
        ++E W+ F  T+EEE   +DVD  A   T   L   L+GKL   R I   VM+   + AW + +    V+ LG NLF+FS     D+ +I + GPW FD+
Subjt:  MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSE-LTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDK

Query:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY
         +++++KP+ ++ P+ ++F  +P W+ F DLP+      MA RLGNA+G FEE D          +L VRV LDISKPLRR IK+NLD   G  W  I+Y
Subjt:  FVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRY

Query:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG------------------KAPNVFRSPSTSPLGKTDMAIDVVDHSTPNA
        E+L D C  CG+                   +++QYG W+++ G                  K+ N   S STSP+G     +     + P A
Subjt:  EKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWMQFSG------------------KAPNVFRSPSTSPLGKTDMAIDVVDHSTPNA

A0A803LK12 Uncharacterized protein1.2e-3136.78Show/hide
Query:  MMENWEKFNFTAEEEGII--VDVDRQAAKATSKSLGFSLIGKLLAPRIIAGD-VMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLF
        ++ +WE F  T EE  I+    ++   AK+T K L  SL+GK+L  +    D +M+R  K  W +  ++ V  +  NLF+F    EED++R+L   PW F
Subjt:  MMENWEKFNFTAEEEGII--VDVDRQAAKATSKSLGFSLIGKLLAPRIIAGD-VMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLF

Query:  DKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASI
        DK +L+L       +P+ + F+  PFW+   D+     N + A  +G+A+G F EYD     L   E + ++V LDI+KPLRR IKV    S  S W S 
Subjt:  DKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASI

Query:  RYEKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWM
        +YE+L D   FCG + H  +DC      DE  S   QYG +M
Subjt:  RYEKLFDLCSFCGIIEHNVRDCSSFYMADEPPSQRNQYGMWM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein1.9e-1329.66Show/hide
Query:  FIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDIS
        FIF+L  EE    +LR+GPW F+ ++++L +     +P +  F ++PFW+    +P    N  + E +G A+G+  + D     +AR +   V +  DI+
Subjt:  FIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDIS

Query:  KPLRRCIKVNLDESHG-SCWASIRYEKLFDLCSFCGIIEHNVRDC
         PLR   + +   + G +     RYE+L   C  CG++ H+   C
Subjt:  KPLRRCIKVNLDESHG-SCWASIRYEKLFDLCSFCGIIEHNVRDC

AT3G42140.1 zinc ion binding;nucleic acid binding4.1e-0823.6Show/hide
Query:  PSELTVEKLGQNLFIFSLKW----EEDQVRILRQGPWLFDKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGG
        P  +  E +G+ L I  +++    EE    ILR+GPW F+ ++ V+ +   +      EFK +PFWI    +P+      +   +G  +G F        
Subjt:  PSELTVEKLGQNLFIFSLKW----EEDQVRILRQGPWLFDKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGG

Query:  GLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGIIEHNVRDC
               L   +  D+S                      +YEKL + C+ CG++ H+  +C
Subjt:  GLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGIIEHNVRDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.5e-1022.28Show/hide
Query:  SLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPY
        SL+G++L P+  + +         W + +++    L    F    + E D +  LR+ PW+F+++ + L +      PT     ++  W+H   +P+   
Subjt:  SLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPMVKPTMMEFKYVPFWIHFCDLPMDPY

Query:  NLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGIIEHNVRDC
        +    E + + +G     D      ++   + V+V +D ++PLR   +V    S         YEKL  +C+ C  + H V  C
Subjt:  NLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGIIEHNVRDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGAAAATTGGGAAAAATTCAATTTTACGGCTGAGGAAGAAGGAATTATAGTCGACGTGGACAGACAAGCTGCAAAAGCAACAAGTAAGTCTTTGGGTTTTAGCTT
GATCGGGAAGTTACTGGCGCCACGCATCATTGCTGGAGATGTGATGAGGAGAAATTTCAAAGCTGCTTGGAACATCCCCAGTGAGCTCACGGTGGAAAAGCTTGGTCAGA
ATCTCTTTATTTTCTCTTTGAAGTGGGAGGAGGATCAAGTGCGCATTCTTCGACAAGGTCCATGGTTGTTTGACAAATTTGTGCTGGTATTGTCTAAGCCAATTCCTATG
GTCAAGCCAACGATGATGGAATTCAAATACGTGCCATTTTGGATCCATTTTTGTGATCTTCCTATGGATCCTTACAACCTTTCAATGGCAGAAAGATTGGGCAATGCAAT
TGGCCGCTTTGAGGAATATGATAATGGTGGCGGGGGTCTTGCTCGGAAAGAAAGCCTTCATGTGCGAGTTGCTCTTGATATATCTAAACCTCTTCGCAGATGTATCAAAG
TCAATTTGGATGAATCTCATGGGAGCTGTTGGGCATCAATTCGTTATGAAAAGCTATTCGACCTTTGCTCGTTTTGTGGCATAATTGAACATAATGTCAGAGATTGCAGC
TCGTTCTATATGGCCGACGAGCCTCCATCACAGAGAAACCAATACGGGATGTGGATGCAATTTTCTGGTAAGGCTCCAAATGTTTTTCGATCACCAAGCACAAGTCCTTT
AGGGAAAACTGATATGGCGATTGATGTTGTCGATCACTCTACTCCAAATGCTGCGACACGGCTACCGGTGGGTCATCGAGCCGGTGAAGGGAACACCGACAATGGTTCTC
CGATGAACGTATTCGATAGTAGGTCAATGGACGTCTCGTCAGCGGTGGCTGAAAACGTTACTCTTCCACAGACGGCAAATAATTTAGAATTGAATACTGAAGCTGGCAAA
ATTAATGTTGATGAAAATTCAAAAGTGAAAAAACGTTTGGATTATGATTCTTTTTCATTGGAAATTAATAAATTGCAAGAGAAAAATAAGGACGATTTAAAGATGAAGGA
GAAAATCACGCAGACGATGGTGGGTTTCAATTTTCAGCCGAATTTTGAGCAGATGCCTTCGAAACCACCCATGAATTCAGATATCTTTGGGGCATCACGTGCCAGCTATT
TTTCCAATGACCAACCGAATGGAATCAATTCATCTGGGTCGCCTTCAATGAACCAGAAGTTGCAGACTTACCCGGGAAACACGATTATGCAGGCCATTGAATTACATTTT
GGGTCGCACATCACGATTAAACCCGAGGGGTTGTTTATGCAAAATCCACCGAATACGGGCTTCATGAAGGAGGATCGATCTGGGCTAAAGGTATGGAAACGCAGAGCTCG
GGCTTCAATATCAGACCCCACTGGAGTCTCACAGATGGATAATCAAAAGAAAAGAGCTGGGAATGGAATTCTGGGTGGCAGTAGGAAGCGTGTGAGGATTGATGAGGACA
ACCAGAATGTGCAAATGGAACCATCGGCGGAGATTGCGGAGCAGTCCCGTCGGGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGAAAATTGGGAAAAATTCAATTTTACGGCTGAGGAAGAAGGAATTATAGTCGACGTGGACAGACAAGCTGCAAAAGCAACAAGTAAGTCTTTGGGTTTTAGCTT
GATCGGGAAGTTACTGGCGCCACGCATCATTGCTGGAGATGTGATGAGGAGAAATTTCAAAGCTGCTTGGAACATCCCCAGTGAGCTCACGGTGGAAAAGCTTGGTCAGA
ATCTCTTTATTTTCTCTTTGAAGTGGGAGGAGGATCAAGTGCGCATTCTTCGACAAGGTCCATGGTTGTTTGACAAATTTGTGCTGGTATTGTCTAAGCCAATTCCTATG
GTCAAGCCAACGATGATGGAATTCAAATACGTGCCATTTTGGATCCATTTTTGTGATCTTCCTATGGATCCTTACAACCTTTCAATGGCAGAAAGATTGGGCAATGCAAT
TGGCCGCTTTGAGGAATATGATAATGGTGGCGGGGGTCTTGCTCGGAAAGAAAGCCTTCATGTGCGAGTTGCTCTTGATATATCTAAACCTCTTCGCAGATGTATCAAAG
TCAATTTGGATGAATCTCATGGGAGCTGTTGGGCATCAATTCGTTATGAAAAGCTATTCGACCTTTGCTCGTTTTGTGGCATAATTGAACATAATGTCAGAGATTGCAGC
TCGTTCTATATGGCCGACGAGCCTCCATCACAGAGAAACCAATACGGGATGTGGATGCAATTTTCTGGTAAGGCTCCAAATGTTTTTCGATCACCAAGCACAAGTCCTTT
AGGGAAAACTGATATGGCGATTGATGTTGTCGATCACTCTACTCCAAATGCTGCGACACGGCTACCGGTGGGTCATCGAGCCGGTGAAGGGAACACCGACAATGGTTCTC
CGATGAACGTATTCGATAGTAGGTCAATGGACGTCTCGTCAGCGGTGGCTGAAAACGTTACTCTTCCACAGACGGCAAATAATTTAGAATTGAATACTGAAGCTGGCAAA
ATTAATGTTGATGAAAATTCAAAAGTGAAAAAACGTTTGGATTATGATTCTTTTTCATTGGAAATTAATAAATTGCAAGAGAAAAATAAGGACGATTTAAAGATGAAGGA
GAAAATCACGCAGACGATGGTGGGTTTCAATTTTCAGCCGAATTTTGAGCAGATGCCTTCGAAACCACCCATGAATTCAGATATCTTTGGGGCATCACGTGCCAGCTATT
TTTCCAATGACCAACCGAATGGAATCAATTCATCTGGGTCGCCTTCAATGAACCAGAAGTTGCAGACTTACCCGGGAAACACGATTATGCAGGCCATTGAATTACATTTT
GGGTCGCACATCACGATTAAACCCGAGGGGTTGTTTATGCAAAATCCACCGAATACGGGCTTCATGAAGGAGGATCGATCTGGGCTAAAGGTATGGAAACGCAGAGCTCG
GGCTTCAATATCAGACCCCACTGGAGTCTCACAGATGGATAATCAAAAGAAAAGAGCTGGGAATGGAATTCTGGGTGGCAGTAGGAAGCGTGTGAGGATTGATGAGGACA
ACCAGAATGTGCAAATGGAACCATCGGCGGAGATTGCGGAGCAGTCCCGTCGGGAGCCATGA
Protein sequenceShow/hide protein sequence
MMENWEKFNFTAEEEGIIVDVDRQAAKATSKSLGFSLIGKLLAPRIIAGDVMRRNFKAAWNIPSELTVEKLGQNLFIFSLKWEEDQVRILRQGPWLFDKFVLVLSKPIPM
VKPTMMEFKYVPFWIHFCDLPMDPYNLSMAERLGNAIGRFEEYDNGGGGLARKESLHVRVALDISKPLRRCIKVNLDESHGSCWASIRYEKLFDLCSFCGIIEHNVRDCS
SFYMADEPPSQRNQYGMWMQFSGKAPNVFRSPSTSPLGKTDMAIDVVDHSTPNAATRLPVGHRAGEGNTDNGSPMNVFDSRSMDVSSAVAENVTLPQTANNLELNTEAGK
INVDENSKVKKRLDYDSFSLEINKLQEKNKDDLKMKEKITQTMVGFNFQPNFEQMPSKPPMNSDIFGASRASYFSNDQPNGINSSGSPSMNQKLQTYPGNTIMQAIELHF
GSHITIKPEGLFMQNPPNTGFMKEDRSGLKVWKRRARASISDPTGVSQMDNQKKRAGNGILGGSRKRVRIDEDNQNVQMEPSAEIAEQSRREP