; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G004510 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G004510
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein OBERON 4-like
Genome locationCmo_Chr08:2783368..2789839
RNA-Seq ExpressionCmoCh08G004510
SyntenyCmoCh08G004510
Gene Ontology termsGO:0032502 - developmental process (biological process)
InterPro domainsIPR044254 - BRCT domain-containing protein At4g02110-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571720.1 hypothetical protein SDJN03_28448, partial [Cucurbita argyrosperma subsp. sororia]1.7e-7868.13Show/hide
Query:  DSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSET
        D      H+     KSPRIVNCPS +HGGSG+DHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLE+E ALAS+PESN EPE +PKS+T
Subjt:  DSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSET

Query:  GCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE--------
         CEAESFPESEDKL++E+HLESDNGQREVESE+L+QV++ S+V+EAE LDKS+DVAKE E  SDD GLSES D+SI LGN TKDERDVVADE        
Subjt:  GCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE--------

Query:  -------------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKE
                           VQLDDECKE+KG+D EV T++FDV DKE+ K+
Subjt:  -------------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKE

KAG6592293.1 hypothetical protein SDJN03_14639, partial [Cucurbita argyrosperma subsp. sororia]8.3e-10264.87Show/hide
Query:  RFGSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGE
        R GS AVSCKQ PNEGEFLDYQSSELRTA+SECF THS GWDSKSPRIVNCPS                                               
Subjt:  RFGSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGE

Query:  LEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDA-GLSE
            LE ETALAS PE NLEP+P+PKS+ GCEAESF ESEDKLVEE+HLESDNG+REVESE+LDQVQKDS VVEAELLDKS+D   E +  +D+   L +
Subjt:  LEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDA-GLSE

Query:  SWDISIDLGNCTKDERDVVADEVQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTAQEQDD
        S     + GN   D+++ +   VQLDDECKE+K LDQEV T++FDVSDKEVEKEMSDGETTKTTK  TQNF D+G RVAETSKNK+KKKKSGRTAQ++DD
Subjt:  SWDISIDLGNCTKDERDVVADEVQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTAQEQDD

Query:  LDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTE
        LDK+LAELGE PTISK ADPPLSSQ AKVENPPDLVTPP A  +KEAEEEST+
Subjt:  LDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTE

KAG6596896.1 hypothetical protein SDJN03_10076, partial [Cucurbita argyrosperma subsp. sororia]5.6e-8270.97Show/hide
Query:  DSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSET
        DSE    H+      SPRIVNCPSP+HG SGSDHEF VELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVET  ASEPE N EPEP+PKS+T
Subjt:  DSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSET

Query:  GCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE--------
        GCEAESFP+SEDKLVEE+HLESDNGQREVESE+LDQV++ S+V+EAE LDKS+DVAKE +A SDD GLSES D+SI LGN TKDERDVVADE        
Subjt:  GCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE--------

Query:  -------------------VQLDDECKENKGLDQEVNTKNFDVSDKEV
                           VQLDDECKE+KG+D EV T++FDVSDKE+
Subjt:  -------------------VQLDDECKENKGLDQEVNTKNFDVSDKEV

KAG6599050.1 Metalloendoproteinase 3-MMP, partial [Cucurbita argyrosperma subsp. sororia]3.4e-7972.96Show/hide
Query:  KSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKL
        KSPRIVNCPS +H GSGSDHE SVELDPKSTNPQLSSW LN+DEKKSKGPNKR+EGELEPDLEVETALASEPESN EPEP+PKS+TGCEAESFPESEDK 
Subjt:  KSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKL

Query:  VEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE----------------------
        VEE+HLESDNGQREV+SE+LDQVQ+ S+V+EAE L+KS+DVAKE EA S+D GLSES DISI LGN TKDERDVVADE                      
Subjt:  VEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE----------------------

Query:  -----VQLDDECKENKGLDQEVNTKNFDVSDKE
             VQLDDECKE+KG+D +V T++FD+SDKE
Subjt:  -----VQLDDECKENKGLDQEVNTKNFDVSDKE

XP_023003327.1 uncharacterized protein LOC111496965 [Cucurbita maxima]9.6e-8255.86Show/hide
Query:  GSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELE
        GS AVSCKQ PNEGEFLDYQSSELRTAD ECF THS GWDSKSP IVNCPS                                                 
Subjt:  GSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELE

Query:  PDLEVETALASEPESNLEPEPN--PKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSES
          LE ETALASEPE NLEPEP   PK ETGCEAESFP+SEDKLVEE+HLESDNGQREVESE+LD+VQK ++VVEAE LDKS+DVAKE EACSDDA LSES
Subjt:  PDLEVETALASEPESNLEPEPN--PKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSES

Query:  WDISIDLGNCTKDERDVVADEVQ------LDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTA
         D+SIDLGNC+KDERDVV DE        LDDECKE+K L+QEV TK+F   DKEVEKE+ +                                      
Subjt:  WDISIDLGNCTKDERDVVADEVQ------LDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTA

Query:  QEQDDLDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKK
                          ISKPADPPLSSQ AKVENPPDLVTPP A  EKE EEESTE   A KKKK
Subjt:  QEQDDLDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKK

TrEMBL top hitse value%identityAlignment
A0A0A0K8A2 Uncharacterized protein1.2e-4555.41Show/hide
Query:  EDEKKSKGPN-KREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLD
        ED +   G N + EEGELEPD E E A+  E E N+EPE  PKSE GCEAESFPESEDKL  E+HLE+DN QRE+ESE+  + QK SIV E ELLDK  D
Subjt:  EDEKKSKGPN-KREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLD

Query:  VAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSD
        + K  E CSDDAGLSES ++S +  NCTKDE DVVADE                           VQLD  CKE+KG+D ++ TK+FDV  K+VEKE+SD
Subjt:  VAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSD

Query:  GETTKTTKASTQNFVDEGIRVA
        GE TK ++A TQNF D+G  VA
Subjt:  GETTKTTKASTQNFVDEGIRVA

A0A6J1H741 protein OBERON 4-like7.2e-6760.97Show/hide
Query:  SVELDPKSTNPQLSSWALNEDEKKSKGP-----------------------------NKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESF
        S E     T+P L+S  +   E KSK P                             ++ EEGELEPD E ETALASEPE NLEPEP PKSETGCEAESF
Subjt:  SVELDPKSTNPQLSSWALNEDEKKSKGP-----------------------------NKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESF

Query:  PESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------
        PESEDK+VEE+HLESDNGQ EVESE+LDQVQK S+VVEAELLDKS+DVAKE EACSDDAGLSES D+SIDLGNC K+ERDVVADE               
Subjt:  PESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------

Query:  ------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVA
                    VQLDDECKE+KG+DQEV T++FDVS+KEVEKEMSDGETTK T A TQNF D+G  VA
Subjt:  ------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVA

A0A6J1HBU0 uncharacterized protein LOC1114620203.0e-6553.44Show/hide
Query:  RFGSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPS------------------------------PKHGG----SGSDHEFS
        R  S AVSCK+FP+E    D    E  +  +    +  + + S   + +N PS                              P  G     S   H  S
Subjt:  RFGSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPS------------------------------PKHGG----SGSDHEFS

Query:  VELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQV
        V  +  +T+PQLSS ALNEDEKKSK P K EEGELE D E E AL SEP+ NLEPE +PK +TG +AESFPE EDKLVEE+H ESDNGQREVESE+LDQV
Subjt:  VELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQV

Query:  QKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------------------VQLDDECKENKGLDQEVN
        QK S+VVEAELLDKS++VAKE EACSDD GLSES D+SIDL NCTKDERDVVADE                           VQLDD+CK++KG+DQEV 
Subjt:  QKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------------------VQLDDECKENKGLDQEVN

Query:  TKNFDVSDKEVEKEMSDGET
        TKNFDVS KEVEKEMSDGET
Subjt:  TKNFDVSDKEVEKEMSDGET

A0A6J1KM51 uncharacterized protein LOC1114969654.6e-8255.86Show/hide
Query:  GSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELE
        GS AVSCKQ PNEGEFLDYQSSELRTAD ECF THS GWDSKSP IVNCPS                                                 
Subjt:  GSFAVSCKQFPNEGEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELE

Query:  PDLEVETALASEPESNLEPEPN--PKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSES
          LE ETALASEPE NLEPEP   PK ETGCEAESFP+SEDKLVEE+HLESDNGQREVESE+LD+VQK ++VVEAE LDKS+DVAKE EACSDDA LSES
Subjt:  PDLEVETALASEPESNLEPEPN--PKSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSES

Query:  WDISIDLGNCTKDERDVVADEVQ------LDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTA
         D+SIDLGNC+KDERDVV DE        LDDECKE+K L+QEV TK+F   DKEVEKE+ +                                      
Subjt:  WDISIDLGNCTKDERDVVADEVQ------LDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTA

Query:  QEQDDLDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKK
                          ISKPADPPLSSQ AKVENPPDLVTPP A  EKE EEESTE   A KKKK
Subjt:  QEQDDLDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKK

A0A6J1KQE8 protein OBERON 4-like3.8e-6861.71Show/hide
Query:  SVELDPKSTNPQLSSWALNEDEKKSKGP-----------------------------NKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESF
        S E     T+P L+S  +   E KSK P                             ++ EEGELEPD E ETALASEPE NLEPEP+PKSETGCEAESF
Subjt:  SVELDPKSTNPQLSSWALNEDEKKSKGP-----------------------------NKREEGELEPDLEVETALASEPESNLEPEPNPKSETGCEAESF

Query:  PESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------
        P+SEDKLVEE+HLESDNGQREVESE+LDQVQK S+VVEAELLDKS+DVAKE EACSDDAGLSES D+SIDLGNC K+ERDVVADE               
Subjt:  PESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADE---------------

Query:  ------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVA
                    VQLD+ECKE+KG+DQEVNT++F VSDKEVEKEMSDGETTKTT A TQNF D+G  VA
Subjt:  ------------VQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVA

SwissProt top hitse value%identityAlignment
O04251 BRCT domain-containing protein At4g021107.2e-3263.27Show/hide
Query:  SSQAGKFLMEEPYEWHKNGLTEYSAINLKAPRKCRLLRERTGHGAFYGMRSIIYGESIAPPLDTLKHAVKAGDGTILATSPPYTKFLKSGVDLAVVSP
        S +AGK L EEPYEWH +GL+   AINL++P+K RL+RE+TGHGA YG+R ++YG+   P LDTLK AVKAGDGTILAT+PPYT+FL    D A++SP
Subjt:  SSQAGKFLMEEPYEWHKNGLTEYSAINLKAPRKCRLLRERTGHGAFYGMRSIIYGESIAPPLDTLKHAVKAGDGTILATSPPYTKFLKSGVDLAVVSP

Arabidopsis top hitse value%identityAlignment
AT1G76720.1 eukaryotic translation initiation factor 2 (eIF-2) family protein4.4e-0837.42Show/hide
Query:  KDERDVVADEVQLDD-ECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVD-EGIRVAETSKNKRKKK---KSGRTAQEQDDLDKVLAE
        KD+ +V  DE Q+      E    D+E     F    K      S G+ +   + S     D + + V ET K K+KKK   K  RT +E+DDLDK+LAE
Subjt:  KDERDVVADEVQLDD-ECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVD-EGIRVAETSKNKRKKK---KSGRTAQEQDDLDKVLAE

Query:  LGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKK
        LGE P   +PA    + +  KV+  P  V P     EKE E+E+ ET  A+KKKK
Subjt:  LGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKK

AT1G76810.1 eukaryotic translation initiation factor 2 (eIF-2) family protein4.8e-0735.37Show/hide
Query:  GNCTKDERDVVADEVQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKK--KSGRTAQEQDDLDKVLA
        G    + RD   DE  ++DE  E+  +      K+       V   + D      TK S      + + V ET K+K+KKK  KSGRT QE++DLDK+LA
Subjt:  GNCTKDERDVVADEVQLDDECKENKGLDQEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKK--KSGRTAQEQDDLDKVLA

Query:  ELGEWPTISKPA-DPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKKHSSQAGK
         LGE P   +PA   P+  + A+ E        P A VE   E+E  E T A KKKK   +  K
Subjt:  ELGEWPTISKPA-DPPLSSQRAKVENPPDLVTPPGALVEKEAEEESTETTVARKKKKHSSQAGK

AT4G02110.1 transcription coactivators5.1e-3363.27Show/hide
Query:  SSQAGKFLMEEPYEWHKNGLTEYSAINLKAPRKCRLLRERTGHGAFYGMRSIIYGESIAPPLDTLKHAVKAGDGTILATSPPYTKFLKSGVDLAVVSP
        S +AGK L EEPYEWH +GL+   AINL++P+K RL+RE+TGHGA YG+R ++YG+   P LDTLK AVKAGDGTILAT+PPYT+FL    D A++SP
Subjt:  SSQAGKFLMEEPYEWHKNGLTEYSAINLKAPRKCRLLRERTGHGAFYGMRSIIYGESIAPPLDTLKHAVKAGDGTILATSPPYTKFLKSGVDLAVVSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATATCGAAACTTTTGAATCTGAGAAGCAGAAAGTGAAAAATGAGGGAGTGACAACTCCATCAAATGCAGTAAGGTCCACGCAGCTATGTGCTACCAGTTGCTTTAG
GTCTAGGAGAACTCCATTGAAGAGCGCCACTTTGCATAGATTAGGCATCCATGATGCCTCATGTAAAATGGCAGTAGGTGAAATAAAAGATAATTATAATATTGAGAAGA
CGGAAGAAATCTCAGTTGGTTGCTTCAGGGCCCTAATACTGAAGTCCATACTGCACCCGTTTGTAAGATTCGGGAGCTTTGCGGTTTCATGCAAGCAGTTCCCCAATGAA
GGTGAGTTTCTAGACTATCAATCTTCTGAATTAAGAACTGCAGATAGTGAGTGTTTTTCGACCCATTCTCGTGGATGGGACTCAAAGAGTCCGAGGATTGTAAATTGTCC
TTCCCCAAAGCATGGTGGCAGTGGCAGTGATCATGAATTCTCAGTCGAACTAGATCCGAAATCTACCAATCCCCAATTGAGTTCTTGGGCTCTAAATGAGGATGAAAAGA
AGAGTAAAGGGCCCAACAAAAGGGAGGAAGGAGAATTAGAACCTGACCTCGAGGTTGAAACTGCGCTTGCATCTGAACCTGAATCGAATCTTGAACCCGAACCAAATCCG
AAATCGGAAACTGGGTGTGAAGCTGAATCGTTTCCTGAGAGTGAAGATAAATTGGTAGAAGAAAGACATTTGGAGTCTGATAATGGTCAAAGGGAGGTTGAAAGTGAGAG
TCTAGACCAAGTTCAGAAAGACAGTATTGTGGTAGAGGCTGAGTTGTTGGACAAGAGCTTGGACGTGGCTAAAGAGGGAGAAGCTTGTAGTGATGATGCTGGTTTATCAG
AAAGTTGGGATATATCGATTGATTTAGGGAATTGTACTAAGGATGAACGTGATGTTGTGGCTGACGAAGTTCAGTTGGATGACGAGTGCAAGGAAAACAAGGGCCTCGAT
CAAGAAGTGAATACCAAGAATTTTGATGTATCAGATAAAGAGGTAGAAAAAGAAATGTCAGATGGAGAGACAACAAAAACTACCAAGGCATCGACTCAAAATTTTGTTGA
TGAAGGTATAAGGGTGGCAGAAACTTCAAAAAATAAAAGGAAAAAGAAGAAGAGTGGAAGGACCGCTCAAGAACAAGATGATTTAGATAAGGTTCTTGCTGAGCTAGGAG
AGTGGCCTACTATATCAAAACCAGCAGATCCTCCTCTTTCTTCACAAAGGGCCAAAGTTGAGAACCCACCTGACCTTGTAACTCCTCCTGGTGCATTAGTTGAAAAGGAA
GCTGAAGAAGAAAGCACAGAAACGACTGTCGCTAGGAAGAAGAAAAAGCATAGTAGTCAGGCTGGAAAATTCTTGATGGAGGAGCCTTATGAATGGCACAAAAACGGCCT
CACTGAATACAGTGCTATCAATTTGAAGGCTCCTAGGAAGTGTCGGTTATTGAGGGAGAGAACAGGCCACGGTGCCTTTTATGGGATGCGTAGTATCATATACGGGGAAT
CTATTGCTCCACCTCTGGATACTCTCAAGCATGCTGTTAAGGCTGGAGATGGGACAATTCTAGCCACATCTCCACCTTATACTAAATTCCTGAAGTCTGGAGTTGATCTT
GCTGTTGTCAGCCCTGCACGCCACGTGTTGATTTGTGGGTCCAAGAGTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATATCGAAACTTTTGAATCTGAGAAGCAGAAAGTGAAAAATGAGGGAGTGACAACTCCATCAAATGCAGTAAGGTCCACGCAGCTATGTGCTACCAGTTGCTTTAG
GTCTAGGAGAACTCCATTGAAGAGCGCCACTTTGCATAGATTAGGCATCCATGATGCCTCATGTAAAATGGCAGTAGGTGAAATAAAAGATAATTATAATATTGAGAAGA
CGGAAGAAATCTCAGTTGGTTGCTTCAGGGCCCTAATACTGAAGTCCATACTGCACCCGTTTGTAAGATTCGGGAGCTTTGCGGTTTCATGCAAGCAGTTCCCCAATGAA
GGTGAGTTTCTAGACTATCAATCTTCTGAATTAAGAACTGCAGATAGTGAGTGTTTTTCGACCCATTCTCGTGGATGGGACTCAAAGAGTCCGAGGATTGTAAATTGTCC
TTCCCCAAAGCATGGTGGCAGTGGCAGTGATCATGAATTCTCAGTCGAACTAGATCCGAAATCTACCAATCCCCAATTGAGTTCTTGGGCTCTAAATGAGGATGAAAAGA
AGAGTAAAGGGCCCAACAAAAGGGAGGAAGGAGAATTAGAACCTGACCTCGAGGTTGAAACTGCGCTTGCATCTGAACCTGAATCGAATCTTGAACCCGAACCAAATCCG
AAATCGGAAACTGGGTGTGAAGCTGAATCGTTTCCTGAGAGTGAAGATAAATTGGTAGAAGAAAGACATTTGGAGTCTGATAATGGTCAAAGGGAGGTTGAAAGTGAGAG
TCTAGACCAAGTTCAGAAAGACAGTATTGTGGTAGAGGCTGAGTTGTTGGACAAGAGCTTGGACGTGGCTAAAGAGGGAGAAGCTTGTAGTGATGATGCTGGTTTATCAG
AAAGTTGGGATATATCGATTGATTTAGGGAATTGTACTAAGGATGAACGTGATGTTGTGGCTGACGAAGTTCAGTTGGATGACGAGTGCAAGGAAAACAAGGGCCTCGAT
CAAGAAGTGAATACCAAGAATTTTGATGTATCAGATAAAGAGGTAGAAAAAGAAATGTCAGATGGAGAGACAACAAAAACTACCAAGGCATCGACTCAAAATTTTGTTGA
TGAAGGTATAAGGGTGGCAGAAACTTCAAAAAATAAAAGGAAAAAGAAGAAGAGTGGAAGGACCGCTCAAGAACAAGATGATTTAGATAAGGTTCTTGCTGAGCTAGGAG
AGTGGCCTACTATATCAAAACCAGCAGATCCTCCTCTTTCTTCACAAAGGGCCAAAGTTGAGAACCCACCTGACCTTGTAACTCCTCCTGGTGCATTAGTTGAAAAGGAA
GCTGAAGAAGAAAGCACAGAAACGACTGTCGCTAGGAAGAAGAAAAAGCATAGTAGTCAGGCTGGAAAATTCTTGATGGAGGAGCCTTATGAATGGCACAAAAACGGCCT
CACTGAATACAGTGCTATCAATTTGAAGGCTCCTAGGAAGTGTCGGTTATTGAGGGAGAGAACAGGCCACGGTGCCTTTTATGGGATGCGTAGTATCATATACGGGGAAT
CTATTGCTCCACCTCTGGATACTCTCAAGCATGCTGTTAAGGCTGGAGATGGGACAATTCTAGCCACATCTCCACCTTATACTAAATTCCTGAAGTCTGGAGTTGATCTT
GCTGTTGTCAGCCCTGCACGCCACGTGTTGATTTGTGGGTCCAAGAGTTCTTAAATGATGAGATACCCTGTGTAGCGGCTGATTACTTGGTTGAGTATGTTTGCAAACCT
GGTTATCCTCTTGATAAACATGTTCTGTATAATACTTAAGCATGGGCGGTAAAATCCTTCTGCAACCTTCAGAGGAGATCAGAAGAAGTTTCCATGACTCGATCCCATAG
GATGATTGTAGTAATAACGATATTGCCTGCCAAGAGTGCGGGTCTCGAGATAGAGGGGAGATGATGCTCATTTGTGGCCGTGAAGATGGTTCAAATGGTTGTGGAATTGG
CATGCATATAGATTGCTGCAATCCTCCATCAATGGATTGGTTTTGTTCAGATTGTATTAGCAGCAGAAACAGCAAGAACTCCCAAAATAAAAGGAAAGGAGTCTCAGGGA
AGCGGAAGTTATTAGAAATCATGACATCTACAATGGTATGATATTGTTCACTTTGAGCATAAGCTTTCATGGCTTTGCTTTTTGTTGGTTTTCCTAAAAAGCCTCAATGG
AGATGTATTCCTTACTTATAGACCCATGATCATCCCCTTAATTAGCCAATGTGATACTCCCTCCAAACAATCCTCCCCTCGAACAAAGTACACCATAGAGCCTTTCCTGA
AGCCTATGTAGCCCTCGAACAGTCTACCCTTAATCGAGATTTGACTCCTTTCTTTGGAGCCCTTGAACAAAATATACCCTTTTGTTGGACACCAGTCACTTTTGACTATA
CCTTTGAGGCTACCAACTTCTTTGTTCGACACTTGAGGATTCAATTAGCGTGGCTAAGTTAAGGGCATGACTCTGATACTATCTTAGGAACTACGACCTCCACAATGGTC
TTTTAGATGGTCTGGAGAGTTTTTCATCAGTTGATTATCTGCTGCGGTTTGACTCTCTAATCACTCAGGTTTGGAAGCAATTTAGTTTGCATGCTTCGTAATTAAAATGG
TAATACTGCTGTTTATTTGAAAGTATAGAGTTTGTGAATAGTGACATTTTCTTTTACTTTTGTATCATCTGCAACAAGTTTCTTTAAATGTCCAAGGCTAGACTAGGATA
GAAAGGCTAGGCTTTTCTTGCTTTTCTGTGGAAACTTCTTGTTCTTGGGTCCCATCATCAGCCTTTATGATCAACCAACTCTTCCAGCATATCTGCACACTTCCCCTTAC
ATAAAAGCAAATAAAGATAATAGGTTTTATAAGACTTTGGATGACACAACGGTTGAAACCTAAATGCCACGAAAATAAAATAATGTTCTAATACTAATTGTTGTAGCGAA
GTAAAATCGATG
Protein sequenceShow/hide protein sequence
MYIETFESEKQKVKNEGVTTPSNAVRSTQLCATSCFRSRRTPLKSATLHRLGIHDASCKMAVGEIKDNYNIEKTEEISVGCFRALILKSILHPFVRFGSFAVSCKQFPNE
GEFLDYQSSELRTADSECFSTHSRGWDSKSPRIVNCPSPKHGGSGSDHEFSVELDPKSTNPQLSSWALNEDEKKSKGPNKREEGELEPDLEVETALASEPESNLEPEPNP
KSETGCEAESFPESEDKLVEERHLESDNGQREVESESLDQVQKDSIVVEAELLDKSLDVAKEGEACSDDAGLSESWDISIDLGNCTKDERDVVADEVQLDDECKENKGLD
QEVNTKNFDVSDKEVEKEMSDGETTKTTKASTQNFVDEGIRVAETSKNKRKKKKSGRTAQEQDDLDKVLAELGEWPTISKPADPPLSSQRAKVENPPDLVTPPGALVEKE
AEEESTETTVARKKKKHSSQAGKFLMEEPYEWHKNGLTEYSAINLKAPRKCRLLRERTGHGAFYGMRSIIYGESIAPPLDTLKHAVKAGDGTILATSPPYTKFLKSGVDL
AVVSPARHVLICGSKSS