; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G020610 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G020610
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationGy14Chr2:29512014..29513781
RNA-Seq ExpressionCsGy2G020610
SyntenyCsGy2G020610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444812.1 PREDICTED: uncharacterized protein LOC103488048 [Cucumis melo]1.40e-11767.69Show/hide
Query:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS
        STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS + L   RS
Subjt:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS

Query:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS
        ESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KSS
Subjt:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS

Query:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGA
         SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL DP    MGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGA

XP_008444813.1 PREDICTED: uncharacterized protein LOC103488049 [Cucumis melo]2.79e-18591.36Show/hide
Query:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
        MASTSSTHQCSISFDSDDFSPEE FVAQILQQLPLLIQ+S+FSLGLSPSWPIRRKRSAVDSPPDT SLITQPPLPP PC PSSEREKESSPTTPLSL+SL
Subjt:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL

Query:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
        PLSRSESDEN TIAKVSKKKAPVDKKSQYLETI+KLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEIL GF N+SVNP+ GTS+SVAME+AKLTVK
Subjt:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK

Query:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
        SS SNVENNHDECEPSMKNQTVP AEQ NS +NYQIPIGGIPLYDPSLGPMGIPDLNLSLEDI HK+YTKYLAA+ARQNRIQIWKNKNNNNN  GAPKLQ
Subjt:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ

Query:  S
        S
Subjt:  S

XP_011649663.1 uncharacterized protein LOC105434650 [Cucumis sativus]1.83e-9461.72Show/hide
Query:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS
        ST   SIS +SDDF+PE+H VA IL++LPLLIQ+S FSLGL PSWPIRRKRSAV SP   S+++ QPP PP     SSE +KE+SPTTPLSL SL LSRS
Subjt:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS

Query:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS
        ESDEN    KVSK+KAP+ KK +  E+++KLTHQ QAL  + EA K+ F + KTINSELKAKKQE   ILGG +N S  P+ GTSTS          KSS
Subjt:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS

Query:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNN
        D N+ENN  ECEPS KNQT P+AEQSN  QN+QIPI  IPL D     MGIPDLNL++E     NY K LAAKARQNR +I KNK N  N
Subjt:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNN

XP_011649664.1 myocardin-related transcription factor A [Cucumis sativus]8.39e-208100Show/hide
Query:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
        MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
Subjt:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL

Query:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
        PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
Subjt:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK

Query:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
        SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
Subjt:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ

Query:  S
        S
Subjt:  S

XP_022996985.1 zinc finger homeobox protein 4-like isoform X1 [Cucurbita maxima]1.39e-9155.41Show/hide
Query:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
        MAST+  HQC+   D D  +P+E    QIL + PLL+QQ  FSLGL P+WP+R KRSAV SPPD+ S++  P  PPPP  PSS ++KESSPTTP SL SL
Subjt:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL

Query:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
        PLSR ESDE    A +  KK  +DKKSQYLET+ +LT Q QAL G ++ +KRH+  LKT NSELKAK+Q+++      S NP+   S+S A++  K TVK
Subjt:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK

Query:  SSDSNVENNHDECEPSMKNQT-VPVA----EQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNN-
              +++  +C+P +KNQT VP A    EQSNS QN +IP G I +YDPS GP GIPDLNLS ++I  +NYT+ +AA+ARQNRI+IWK+KNNNNNNN 
Subjt:  SSDSNVENNHDECEPSMKNQT-VPVA----EQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNN-

Query:  GAPKL
        GA +L
Subjt:  GAPKL

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein4.06e-208100Show/hide
Query:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
        MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
Subjt:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL

Query:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
        PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
Subjt:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK

Query:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
        SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
Subjt:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ

Query:  S
        S
Subjt:  S

A0A1S3BAR4 uncharacterized protein LOC1034880491.35e-18591.36Show/hide
Query:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
        MASTSSTHQCSISFDSDDFSPEE FVAQILQQLPLLIQ+S+FSLGLSPSWPIRRKRSAVDSPPDT SLITQPPLPP PC PSSEREKESSPTTPLSL+SL
Subjt:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL

Query:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
        PLSRSESDEN TIAKVSKKKAPVDKKSQYLETI+KLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEIL GF N+SVNP+ GTS+SVAME+AKLTVK
Subjt:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK

Query:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
        SS SNVENNHDECEPSMKNQTVP AEQ NS +NYQIPIGGIPLYDPSLGPMGIPDLNLSLEDI HK+YTKYLAA+ARQNRIQIWKNKNNNNN  GAPKLQ
Subjt:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ

Query:  S
        S
Subjt:  S

A0A1S3BC34 uncharacterized protein LOC1034880486.77e-11867.69Show/hide
Query:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS
        STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS + L   RS
Subjt:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS

Query:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS
        ESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KSS
Subjt:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS

Query:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGA
         SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL DP    MGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGA

A0A5A7VA15 Uncharacterized protein6.77e-11867.69Show/hide
Query:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS
        STH+CSISF+SD+F+P+EHFVA IL++L LLIQ+S FSLGL PSWP+RRKRSAV SPPD SS++ QPP PP     SSER KESSPTTPLS + L   RS
Subjt:  STHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRS

Query:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS
        ESDEN    KVSK+KAP+DKK QYLETI+KLTHQ QAL  D+EAMK+HF++LKTINSELKAKKQE   ILGG +N S  P+ GTS+S          KSS
Subjt:  ESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQE---ILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSS

Query:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGA
         SNVENN  EC+PSMKNQT PVAEQSN  QN+QIP G IPL DP    MGIPDLNL++E  +  NYTKY+AAKARQNRI+IWKNK NNN+N  A
Subjt:  DSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGA

A0A5A7VHE1 Uncharacterized protein1.35e-18591.36Show/hide
Query:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL
        MASTSSTHQCSISFDSDDFSPEE FVAQILQQLPLLIQ+S+FSLGLSPSWPIRRKRSAVDSPPDT SLITQPPLPP PC PSSEREKESSPTTPLSL+SL
Subjt:  MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSL

Query:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK
        PLSRSESDEN TIAKVSKKKAPVDKKSQYLETI+KLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEIL GF N+SVNP+ GTS+SVAME+AKLTVK
Subjt:  PLSRSESDENTTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVK

Query:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ
        SS SNVENNHDECEPSMKNQTVP AEQ NS +NYQIPIGGIPLYDPSLGPMGIPDLNLSLEDI HK+YTKYLAA+ARQNRIQIWKNKNNNNN  GAPKLQ
Subjt:  SSDSNVENNHDECEPSMKNQTVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQ

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTTCTTCCACTCATCAATGCTCCATCTCCTTCGATTCCGACGACTTCTCCCCTGAAGAACACTTCGTCGCTCAAATCCTCCAACAATTACCTCTTCTCAT
TCAACAATCCCACTTCTCTCTTGGCCTATCACCTTCCTGGCCTATCCGACGCAAGAGATCCGCCGTCGATTCCCCGCCGGACACGTCCTCCCTCATCACTCAACCGCCGC
TTCCACCACCACCTTGTCTACCGTCGTCCGAGAGAGAAAAGGAGTCTAGTCCAACTACTCCGCTTTCACTCCACTCCTTGCCCTTGTCAAGGAGTGAATCTGATGAGAAT
ACTACTATTGCTAAGGTCTCCAAGAAGAAAGCCCCTGTCGATAAGAAATCTCAGTATTTGGAAACCATAGAGAAATTAACCCACCAGAAACAAGCTCTGGAAGGGGACAT
TGAAGCTATGAAGCGACATTTTATCAATCTGAAAACTATAAATTCGGAGTTGAAAGCCAAAAAGCAAGAGATTCTGGGTGGTTTCAGTAATCTATCAGTAAATCCAAAAT
TTGGGACCTCAACTTCGGTCGCCATGGAAATAGCTAAGTTAACAGTGAAATCCTCAGACTCAAATGTGGAGAATAATCACGATGAATGTGAACCGTCGATGAAGAATCAG
ACGGTTCCAGTGGCAGAACAGAGCAACAGTATTCAGAATTACCAAATTCCAATTGGGGGAATTCCTTTGTATGATCCTTCATTGGGCCCAATGGGGATTCCTGATTTGAA
CCTCTCTTTGGAAGATATTCTTCATAAGAATTACACAAAATATTTGGCTGCTAAAGCAAGACAAAACAGAATTCAGATCTGGAAAAACAAGAACAACAACAACAACAACA
ATGGAGCTCCCAAATTGCAATCCTAA
mRNA sequenceShow/hide mRNA sequence
CACGTGTCACAAAGCCACATCCCCTTCTCCAAAATCCTTTCCTATTTTGTTTTGCGAAACCCAGTCCCCAAGTAACACCATTCTCCTCCTCTTCCTCTTCCTCTACCTCT
TCTTCTTCCTCCGGTGTTCTATTAATCCCTAAACCCCTTCTCTTCCATGGAATTTCTCTTCATCTGAACTTTCACTCACCAATTCTTTCGATCAAACCAAATAGATCGAT
TCTCCATGGCTTCCACTTCTTCCACTCATCAATGCTCCATCTCCTTCGATTCCGACGACTTCTCCCCTGAAGAACACTTCGTCGCTCAAATCCTCCAACAATTACCTCTT
CTCATTCAACAATCCCACTTCTCTCTTGGCCTATCACCTTCCTGGCCTATCCGACGCAAGAGATCCGCCGTCGATTCCCCGCCGGACACGTCCTCCCTCATCACTCAACC
GCCGCTTCCACCACCACCTTGTCTACCGTCGTCCGAGAGAGAAAAGGAGTCTAGTCCAACTACTCCGCTTTCACTCCACTCCTTGCCCTTGTCAAGGAGTGAATCTGATG
AGAATACTACTATTGCTAAGGTCTCCAAGAAGAAAGCCCCTGTCGATAAGAAATCTCAGTATTTGGAAACCATAGAGAAATTAACCCACCAGAAACAAGCTCTGGAAGGG
GACATTGAAGCTATGAAGCGACATTTTATCAATCTGAAAACTATAAATTCGGAGTTGAAAGCCAAAAAGCAAGAGATTCTGGGTGGTTTCAGTAATCTATCAGTAAATCC
AAAATTTGGGACCTCAACTTCGGTCGCCATGGAAATAGCTAAGTTAACAGTGAAATCCTCAGACTCAAATGTGGAGAATAATCACGATGAATGTGAACCGTCGATGAAGA
ATCAGACGGTTCCAGTGGCAGAACAGAGCAACAGTATTCAGAATTACCAAATTCCAATTGGGGGAATTCCTTTGTATGATCCTTCATTGGGCCCAATGGGGATTCCTGAT
TTGAACCTCTCTTTGGAAGATATTCTTCATAAGAATTACACAAAATATTTGGCTGCTAAAGCAAGACAAAACAGAATTCAGATCTGGAAAAACAAGAACAACAACAACAA
CAACAATGGAGCTCCCAAATTGCAATCCTAATTCCACTATTTTCCTCCTTTTTTTTTCTTTTTAATCTTAGGATTCAATTCATCATTCACCAATCCCTGTTTTGATTCAT
GAATTGGGGTAGTTTTTAATTTTATTTTTTCAATTGGGTTACTCAAATTGTAAAGGTACAGAATCTCAGAGTAAGAGTAAGAGCTGACCCTGCTCCCAAATATTTCTCTT
AATATATATCTTGTGTTTTTTTTTTTCCTCTTTTCGTATATTCTTTAATTAAAGTATAAATAATAAAAGCTACTTCTTTTTCCTTTTTGGT
Protein sequenceShow/hide protein sequence
MASTSSTHQCSISFDSDDFSPEEHFVAQILQQLPLLIQQSHFSLGLSPSWPIRRKRSAVDSPPDTSSLITQPPLPPPPCLPSSEREKESSPTTPLSLHSLPLSRSESDEN
TTIAKVSKKKAPVDKKSQYLETIEKLTHQKQALEGDIEAMKRHFINLKTINSELKAKKQEILGGFSNLSVNPKFGTSTSVAMEIAKLTVKSSDSNVENNHDECEPSMKNQ
TVPVAEQSNSIQNYQIPIGGIPLYDPSLGPMGIPDLNLSLEDILHKNYTKYLAAKARQNRIQIWKNKNNNNNNNGAPKLQS