; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012228 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012228
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr1:38841872..38843593
RNA-Seq ExpressionLag0012228
SyntenyLag0012228
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.5e-5342.17Show/hide
Query:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPA-GLTVEKLGPNLFLFSLRSEEEQARIIRQGP
        MA  +++  W+NF LT+EE+   VD+D  A E T + L  SLI KLLS   I+  V++   K AW +     +V+ +G N+FLF+     ++ RI+R GP
Subjt:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPA-GLTVEKLGPNLFLFSLRSEEEQARIIRQGP

Query:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW
        W F++ L+++  P+ + KP  M+F+ V+ WVHF++L +   N +MA RL NAIG FE+ ++      W   LRVRV  D+ KPL R IK+ LD PMG CW
Subjt:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW

Query:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTG
         PI+YE+LPD   +CG + H ++DCS     V S S+  +YG WL++ G
Subjt:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]3.8e-3836.68Show/hide
Query:  LDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLF
        +D++   WE+F  T +E  T V +DR    +T+ ++   ++ KL +   I+ + +R   KS W +      E LG N+++   +S  E++R++  GPW F
Subjt:  LDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLF

Query:  NKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWK-ESLRVRVTLDITKPLRRCIKVILDEPMGSCWSP
        NK LLVL+ P    +P  M F F AFW+  + +P E  +  MA  L   +G  EE +  G   GW    +RVRV +D++KPLRR IK + +      W P
Subjt:  NKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWK-ESLRVRVTLDITKPLRRCIKVILDEPMGSCWSP

Query:  IRYEKLPDLCDYCGTIGHGVRDCS--AYYLAVGSPSQCNRYGAWLQYTGRTTTLFRSLS
        +RYEKLPD C  CG IGH  R+C   +  +   SP Q   YG WL    R T L +S+S
Subjt:  IRYEKLPDLCDYCGTIGHGVRDCS--AYYLAVGSPSQCNRYGAWLQYTGRTTTLFRSLS

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]7.2e-6145.53Show/hide
Query:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLFN
        ++++  W+ F LT+EE+   +DVD  A ++  + L +SL+GKLL+   I+ DV+ +    AW +   LTVE +G NLFLF    E +  R+++ GPW F+
Subjt:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLFN

Query:  KFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCWSPIR
        K L+VL KP      + +EF  VAFW+H ++LPM   N +MA RL NAIG F + D   +   W  SLR+RV +DITKPLRR IK+ +D PMG CW PI+
Subjt:  KFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCWSPIR

Query:  YEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQC-NRYGAWLQYTG
        YE+LPD C +CG IGH   DC A YLA    S+  + YG WL++ G
Subjt:  YEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQC-NRYGAWLQYTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.7e-4635.96Show/hide
Query:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIP-AGLTVEKLGPNLFLFSLRSEEEQARIIRQGP
        MA  D++  W+NF LT+EEE T +DVD  A   T   L   L+GKL     I   VM+   ++AW +      V+ LG NLFLFS     ++ +I + GP
Subjt:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIP-AGLTVEKLGPNLFLFSLRSEEEQARIIRQGP

Query:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW
        W F++ L++++KP+ ++ P+ ++F  +  WV F++LP+      MA RL NA+G FEE D       W  +LRVRV LDI+KPLRR IK+ LD P+G  W
Subjt:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW

Query:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTGRT-TTLFRSLSTSPMSMNRMAVDHPSTLQSPPIAGNVGRSRLLGQISGSSC
         PI+YE+LPD C +CG               + S  + ++YG+WL+Y G    T+ +        +++   +  S+  SP  AG+ G        +G   
Subjt:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTGRT-TTLFRSLSTSPMSMNRMAVDHPSTLQSPPIAGNVGRSRLLGQISGSSC

Query:  TKISSPRTDS---GAKP
          + SP T++   GA+P
Subjt:  TKISSPRTDS---GAKP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]2.6e-3436Show/hide
Query:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFS---LIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPW
        D ++      +LT+EE+     V R   E TS  +G S   L+GKLL+      + M+    S W    G+ V  +G NLF+F      ++ R++  GPW
Subjt:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFS---LIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPW

Query:  LFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRF--EEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILD--EPMG
         F+K LL+L +  P V+P+ ++   V FWVH   LP+ L N  +   + NA+G+F   +Y++GG +  W  ++R+RV LD+ KPLRR +K+ L   EP+ 
Subjt:  LFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRF--EEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILD--EPMG

Query:  SCWSPIRYEKLPDLCDYCGTIGHGVRDC-SAYYLAVGSPSQCNRYGAWLQ
          W   +YE+LP  C +CG +GH  R+C      A GS     +YGAWL+
Subjt:  SCWSPIRYEKLPDLCDYCGTIGHGVRDC-SAYYLAVGSPSQCNRYGAWLQ

TrEMBL top hitse value%identityAlignment
A0A2N9GWE9 Uncharacterized protein3.6e-3433.2Show/hide
Query:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAW-NIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLF
        D+++  W  F+LT E EG  V +   A E++       L+GKL +  +   + ++      W  +  G+T   +G NLF+F  R + E+ R++   PWLF
Subjt:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAW-NIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLF

Query:  NKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCWSPI
        +  LL+L +       + ++F +  FWV FY +P+         ++ +  G+ EE D     +GW   LRVR+ LDITKP+ R  +V+    +G  W   
Subjt:  NKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCWSPI

Query:  RYEKLPDLCDYCGTIGHGVRDC-SAYYLAVGSPSQCNRYGAWLQ
        +YE+LP LC +CG IGH  RDC S       SP    +YG WL+
Subjt:  RYEKLPDLCDYCGTIGHGVRDC-SAYYLAVGSPSQCNRYGAWLQ

A0A6J1BSZ1 uncharacterized protein LOC1110054817.0e-5442.17Show/hide
Query:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPA-GLTVEKLGPNLFLFSLRSEEEQARIIRQGP
        MA  +++  W+NF LT+EE+   VD+D  A E T + L  SLI KLLS   I+  V++   K AW +     +V+ +G N+FLF+     ++ RI+R GP
Subjt:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPA-GLTVEKLGPNLFLFSLRSEEEQARIIRQGP

Query:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW
        W F++ L+++  P+ + KP  M+F+ V+ WVHF++L +   N +MA RL NAIG FE+ ++      W   LRVRV  D+ KPL R IK+ LD PMG CW
Subjt:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW

Query:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTG
         PI+YE+LPD   +CG + H ++DCS     V S S+  +YG WL++ G
Subjt:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTG

A0A6J1D765 uncharacterized protein LOC1110179021.9e-3836.68Show/hide
Query:  LDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLF
        +D++   WE+F  T +E  T V +DR    +T+ ++   ++ KL +   I+ + +R   KS W +      E LG N+++   +S  E++R++  GPW F
Subjt:  LDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLF

Query:  NKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWK-ESLRVRVTLDITKPLRRCIKVILDEPMGSCWSP
        NK LLVL+ P    +P  M F F AFW+  + +P E  +  MA  L   +G  EE +  G   GW    +RVRV +D++KPLRR IK + +      W P
Subjt:  NKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWK-ESLRVRVTLDITKPLRRCIKVILDEPMGSCWSP

Query:  IRYEKLPDLCDYCGTIGHGVRDCS--AYYLAVGSPSQCNRYGAWLQYTGRTTTLFRSLS
        +RYEKLPD C  CG IGH  R+C   +  +   SP Q   YG WL    R T L +S+S
Subjt:  IRYEKLPDLCDYCGTIGHGVRDCS--AYYLAVGSPSQCNRYGAWLQYTGRTTTLFRSLS

A0A6J1DU55 uncharacterized protein LOC1110231353.5e-6145.53Show/hide
Query:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLFN
        ++++  W+ F LT+EE+   +DVD  A ++  + L +SL+GKLL+   I+ DV+ +    AW +   LTVE +G NLFLF    E +  R+++ GPW F+
Subjt:  DDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNLFLFSLRSEEEQARIIRQGPWLFN

Query:  KFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCWSPIR
        K L+VL KP      + +EF  VAFW+H ++LPM   N +MA RL NAIG F + D   +   W  SLR+RV +DITKPLRR IK+ +D PMG CW PI+
Subjt:  KFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCWSPIR

Query:  YEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQC-NRYGAWLQYTG
        YE+LPD C +CG IGH   DC A YLA    S+  + YG WL++ G
Subjt:  YEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQC-NRYGAWLQYTG

A0A6J1DX30 uncharacterized protein LOC1110248748.3e-4735.96Show/hide
Query:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIP-AGLTVEKLGPNLFLFSLRSEEEQARIIRQGP
        MA  D++  W+NF LT+EEE T +DVD  A   T   L   L+GKL     I   VM+   ++AW +      V+ LG NLFLFS     ++ +I + GP
Subjt:  MALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIP-AGLTVEKLGPNLFLFSLRSEEEQARIIRQGP

Query:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW
        W F++ L++++KP+ ++ P+ ++F  +  WV F++LP+      MA RL NA+G FEE D       W  +LRVRV LDI+KPLRR IK+ LD P+G  W
Subjt:  WLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVILDEPMGSCW

Query:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTGRT-TTLFRSLSTSPMSMNRMAVDHPSTLQSPPIAGNVGRSRLLGQISGSSC
         PI+YE+LPD C +CG               + S  + ++YG+WL+Y G    T+ +        +++   +  S+  SP  AG+ G        +G   
Subjt:  SPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTGRT-TTLFRSLSTSPMSMNRMAVDHPSTLQSPPIAGNVGRSRLLGQISGSSC

Query:  TKISSPRTDS---GAKP
          + SP T++   GA+P
Subjt:  TKISSPRTDS---GAKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein3.9e-1227.78Show/hide
Query:  FLFSLRSEEEQARIIRQGPWLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDIT
        F+F+L  EE    ++R+GPW FN ++++L +  P +      F F+ FWV    +P +  N  +   +  A+G+  + D    ++   +  RV +  DIT
Subjt:  FLFSLRSEEEQARIIRQGPWLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDIT

Query:  KPLRRCIKVILDEPMGSCWSPIRYEKLPDLCDYCGTIGHGVRDC
         PLR          + +     RYE+L   C+ CG + H    C
Subjt:  KPLRRCIKVILDEPMGSCWSPIRYEKLPDLCDYCGTIGHGVRDC

AT3G42140.1 zinc ion binding;nucleic acid binding3.7e-0723.94Show/hide
Query:  FSLRSEEEQARIIRQGPWLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKP
        F  +SEE    I+R+GPW FN ++ V+ +   +   +  EFK + FW+    +P+    A +   +   +G F E + G                     
Subjt:  FSLRSEEEQARIIRQGPWLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKP

Query:  LRRCIKVILDEPMGSCWSPIRYEKLPDLCDYCGTIGHGVRDC
          R + V+            +YEKL + C  CG + H   +C
Subjt:  LRRCIKVILDEPMGSCWSPIRYEKLPDLCDYCGTIGHGVRDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCAATCCATCGCAATCGGTGCAGTCGAAAATCTCTGTTGTTTTTTGCAGGTTTTGTTCACGTCAGAGTTCCCATTTTTGCCCCCATGGCCCTTGACGACATGAT
CGGTCATTGGGAAAATTTCAACCTAACTGCTGAGGAGGAAGGCACCGAGGTTGATGTAGATCGACAAGCAGCGGAGATCACAAGCAGATCGTTGGGGTTTAGTCTTATTG
GCAAACTCCTATCTCCTCATTTTATTGCAGGTGATGTCATGCGGAAAAATTTCAAATCCGCTTGGAATATACCAGCGGGTCTCACTGTGGAGAAATTAGGACCAAATCTG
TTTCTTTTTTCTCTACGGTCGGAAGAGGAGCAGGCTCGTATCATCCGTCAGGGACCCTGGCTCTTTAACAAGTTTTTACTTGTTCTTTCCAAACCAATCCCAATGGTTAA
ACCAACAGCCATGGAGTTCAAATTTGTGGCGTTCTGGGTACACTTTTATGAACTTCCGATGGAGCTGTTCAACGCATCAATGGCGGCACGTCTCGAAAACGCTATTGGAC
GTTTCGAGGAATATGACAATGGAGGGCGTATCCTGGGATGGAAGGAAAGTTTACGTGTCCGGGTTACTCTCGATATCACAAAACCCCTTCGAAGGTGTATTAAAGTTATT
CTCGACGAACCTATGGGGAGTTGCTGGAGTCCAATTCGATACGAAAAGCTCCCGGATCTATGCGATTATTGTGGGACTATTGGACATGGCGTGAGAGATTGTAGCGCTTA
CTATCTTGCGGTCGGATCTCCATCTCAGTGCAACCGCTACGGAGCGTGGCTTCAGTATACGGGTCGAACTACCACTCTGTTTCGATCCCTAAGCACAAGCCCGATGAGCA
TGAACAGAATGGCGGTGGATCATCCCTCTACACTCCAGTCGCCTCCAATCGCCGGGAATGTTGGCCGTTCTCGACTTCTTGGCCAAATCAGCGGATCCTCATGTACTAAA
ATTAGCTCCCCGCGGACGGATTCCGGCGCAAAGCCTATGGATATTTGGCCGGCGACGGTGGAGAATCAGACTGTGCCGTCGGTGGCATTAAATAGCGATTTTAATCGCGG
ATTGAATAAGGGCAAAGCGGAAGTTAATGCGGCTGAAGCAAAAAAGGTAAAAAAGAAACTTGAATATAATGATTTCTTTGTCGTTACTAACTCTCAGCCATTAATGGCGT
CAACTGTGGAGGATTTTGGGGCAATAAATCAGGGGCTGAATAACGGCTTACCTTCAAGCTTTCAACATCCACAAGTGAAGCTTCCAGATGAGTTGAGCCGGCCTGATCTC
AATGGGCCAATTAGTGGGCCGAATAAAGAGGAATTAAAGCAGCAAGCGGGCCTACTTCAATCGGCCAACCAATTTCAGCAACCAAATTCTAATGCAGATTTCAAATTTTC
GAATGGGCCCCAAATTCGCCAACCAGTCCAGCAGACCCAATCTTTTGCAGATCTGATTCCACAGCAACCGCAAGTTGTCCTGGGCTTACCAAATTCGAAAACATGGAAGC
GTATGGCCCGAACTAGTAATGTGGACTCTGGGACATCGTCAAGTGGAGATGGGATTAAGAAAAGAATGGGAGATGGCATGCTGAATGGTAATAAGAAACGTGCTCGAACT
GAGGAAGAGGATTCGTCTGATCATGAAACTCCAACGGTGGAGGCTGTTGAACAGCCCCGCCGAGAACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCAATCCATCGCAATCGGTGCAGTCGAAAATCTCTGTTGTTTTTTGCAGGTTTTGTTCACGTCAGAGTTCCCATTTTTGCCCCCATGGCCCTTGACGACATGAT
CGGTCATTGGGAAAATTTCAACCTAACTGCTGAGGAGGAAGGCACCGAGGTTGATGTAGATCGACAAGCAGCGGAGATCACAAGCAGATCGTTGGGGTTTAGTCTTATTG
GCAAACTCCTATCTCCTCATTTTATTGCAGGTGATGTCATGCGGAAAAATTTCAAATCCGCTTGGAATATACCAGCGGGTCTCACTGTGGAGAAATTAGGACCAAATCTG
TTTCTTTTTTCTCTACGGTCGGAAGAGGAGCAGGCTCGTATCATCCGTCAGGGACCCTGGCTCTTTAACAAGTTTTTACTTGTTCTTTCCAAACCAATCCCAATGGTTAA
ACCAACAGCCATGGAGTTCAAATTTGTGGCGTTCTGGGTACACTTTTATGAACTTCCGATGGAGCTGTTCAACGCATCAATGGCGGCACGTCTCGAAAACGCTATTGGAC
GTTTCGAGGAATATGACAATGGAGGGCGTATCCTGGGATGGAAGGAAAGTTTACGTGTCCGGGTTACTCTCGATATCACAAAACCCCTTCGAAGGTGTATTAAAGTTATT
CTCGACGAACCTATGGGGAGTTGCTGGAGTCCAATTCGATACGAAAAGCTCCCGGATCTATGCGATTATTGTGGGACTATTGGACATGGCGTGAGAGATTGTAGCGCTTA
CTATCTTGCGGTCGGATCTCCATCTCAGTGCAACCGCTACGGAGCGTGGCTTCAGTATACGGGTCGAACTACCACTCTGTTTCGATCCCTAAGCACAAGCCCGATGAGCA
TGAACAGAATGGCGGTGGATCATCCCTCTACACTCCAGTCGCCTCCAATCGCCGGGAATGTTGGCCGTTCTCGACTTCTTGGCCAAATCAGCGGATCCTCATGTACTAAA
ATTAGCTCCCCGCGGACGGATTCCGGCGCAAAGCCTATGGATATTTGGCCGGCGACGGTGGAGAATCAGACTGTGCCGTCGGTGGCATTAAATAGCGATTTTAATCGCGG
ATTGAATAAGGGCAAAGCGGAAGTTAATGCGGCTGAAGCAAAAAAGGTAAAAAAGAAACTTGAATATAATGATTTCTTTGTCGTTACTAACTCTCAGCCATTAATGGCGT
CAACTGTGGAGGATTTTGGGGCAATAAATCAGGGGCTGAATAACGGCTTACCTTCAAGCTTTCAACATCCACAAGTGAAGCTTCCAGATGAGTTGAGCCGGCCTGATCTC
AATGGGCCAATTAGTGGGCCGAATAAAGAGGAATTAAAGCAGCAAGCGGGCCTACTTCAATCGGCCAACCAATTTCAGCAACCAAATTCTAATGCAGATTTCAAATTTTC
GAATGGGCCCCAAATTCGCCAACCAGTCCAGCAGACCCAATCTTTTGCAGATCTGATTCCACAGCAACCGCAAGTTGTCCTGGGCTTACCAAATTCGAAAACATGGAAGC
GTATGGCCCGAACTAGTAATGTGGACTCTGGGACATCGTCAAGTGGAGATGGGATTAAGAAAAGAATGGGAGATGGCATGCTGAATGGTAATAAGAAACGTGCTCGAACT
GAGGAAGAGGATTCGTCTGATCATGAAACTCCAACGGTGGAGGCTGTTGAACAGCCCCGCCGAGAACCATGA
Protein sequenceShow/hide protein sequence
MVAIHRNRCSRKSLLFFAGFVHVRVPIFAPMALDDMIGHWENFNLTAEEEGTEVDVDRQAAEITSRSLGFSLIGKLLSPHFIAGDVMRKNFKSAWNIPAGLTVEKLGPNL
FLFSLRSEEEQARIIRQGPWLFNKFLLVLSKPIPMVKPTAMEFKFVAFWVHFYELPMELFNASMAARLENAIGRFEEYDNGGRILGWKESLRVRVTLDITKPLRRCIKVI
LDEPMGSCWSPIRYEKLPDLCDYCGTIGHGVRDCSAYYLAVGSPSQCNRYGAWLQYTGRTTTLFRSLSTSPMSMNRMAVDHPSTLQSPPIAGNVGRSRLLGQISGSSCTK
ISSPRTDSGAKPMDIWPATVENQTVPSVALNSDFNRGLNKGKAEVNAAEAKKVKKKLEYNDFFVVTNSQPLMASTVEDFGAINQGLNNGLPSSFQHPQVKLPDELSRPDL
NGPISGPNKEELKQQAGLLQSANQFQQPNSNADFKFSNGPQIRQPVQQTQSFADLIPQQPQVVLGLPNSKTWKRMARTSNVDSGTSSSGDGIKKRMGDGMLNGNKKRART
EEEDSSDHETPTVEAVEQPRREP