; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G08250 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G08250
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHomeobox leucine zipper protein
Genome locationChr4:5987775..5988862
RNA-Seq ExpressionCSPI04G08250
SyntenyCSPI04G08250
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0047834 - D-threo-aldose 1-dehydrogenase activity (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576630.1 Homeobox-leucine zipper protein ATHB-7, partial [Cucurbita argyrosperma subsp. sororia]3.5e-4059.3Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--
        MNKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+RARWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+ L  
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--

Query:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI
        +GP EG      + +T +ET EKP++  D L Q    Y NNNYNT  +    GEG+  L+MTH  EGRS+ I
Subjt:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI

KAG7014681.1 Homeobox-leucine zipper protein ATHB-7, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-4059.3Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--
        MNKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+RARWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+ L  
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--

Query:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI
        +GP EG      + +T +ET EKP++  D L Q    Y NNNYNT  +    GEG+  L+MTH  EGRS+ I
Subjt:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI

XP_004137302.1 homeobox-leucine zipper protein ATHB-7-like [Cucumis sativus]3.1e-8196.93Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVLQG
        MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSL SQLQKLTVLQG
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVLQG

Query:  PCEGTIIRMTLKKTTHETHEKPKNLPDALEQMYSNNNYNTKCLVIGEGELQMTHLLEGRSDRI
        PCEG IIRMTLKKTTHETHEKPK LP ALEQMYSN+NYNTKCLVIGEGELQMTHLLEGRSDRI
Subjt:  PCEGTIIRMTLKKTTHETHEKPKNLPDALEQMYSNNNYNTKCLVIGEGELQMTHLLEGRSDRI

XP_022923069.1 homeobox-leucine zipper protein ATHB-12-like isoform X2 [Cucurbita moschata]7.8e-4059.41Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--
        MNKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+RARWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+ L  
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--

Query:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSD
        +GP EG      + +T +ET EKP++  D L Q    Y NNNYNT  +    GEG+  L+MTH  EGRS+
Subjt:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSD

XP_038900057.1 homeobox-leucine zipper protein ATHB-12-like isoform X2 [Benincasa hispida]5.9e-4061.29Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTV-LQ
        MNKRR SDEQI++LE I+Y T+S+L+SR+ ++LAT+LGLQP+QI IWFQN+RARWKSKE + NF+SL+A  D+L SQF+TLQEENNS+ SQLQKL V LQ
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTV-LQ

Query:  GPCEGTIIRMTLKKTTHETHEKPKNLPDALE---QMYSNNNYNT--KCLVIGEGE
        GPC G      + +TT ET E+PK+  D LE     Y NNNYNT  KC+  GEG+
Subjt:  GPCEGTIIRMTLKKTTHETHEKPKNLPDALE---QMYSNNNYNT--KCLVIGEGE

TrEMBL top hitse value%identityAlignment
A0A6J1E599 homeobox-leucine zipper protein ATHB-12-like isoform X23.8e-4059.41Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--
        MNKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+RARWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+ L  
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--

Query:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSD
        +GP EG      + +T +ET EKP++  D L Q    Y NNNYNT  +    GEG+  L+MTH  EGRS+
Subjt:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSD

A0A6J1E8K2 homeobox-leucine zipper protein ATHB-12-like isoform X13.8e-4059.41Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--
        MNKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+RARWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+ L  
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL--

Query:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSD
        +GP EG      + +T +ET EKP++  D L Q    Y NNNYNT  +    GEG+  L+MTH  EGRS+
Subjt:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSD

A0A6J1G2H1 homeobox-leucine zipper protein ATHB-12-like3.1e-3456.02Show/hide
Query:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKL--TVL
        MNKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LA +LGLQP+QI+IWFQN+RARWKSK+ + +F+SLRA  D+LASQF TLQEENNSL SQLQKL   V 
Subjt:  MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKL--TVL

Query:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNTKCLVIGEGELQMTHLLEGRSD
        + P EG        K   E   K + L D  EQ    Y NNN  T+      G  Q THLLEGRS+
Subjt:  QGPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNTKCLVIGEGELQMTHLLEGRSD

A0A6J1J3H1 homeobox-leucine zipper protein ATHB-12-like isoform X22.1e-3857.89Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT--VLQ
        NKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+R RWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+  V +
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT--VLQ

Query:  GPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI
        GP EG      + KT +ET EKP++  D L Q    Y +NNYNT  +    GEG+  L+MTH  EG S+ I
Subjt:  GPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI

A0A6J1J6P1 homeobox-leucine zipper protein ATHB-12-like isoform X12.1e-3857.89Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT--VLQ
        NKRRFSDEQI++LE+I+Y ++SKL+SR+ ++LAT+LGLQP+QI IWFQN+R RWKSKE + NF+SLRA  D+LASQF+TLQEE NSL SQLQKL+  V +
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT--VLQ

Query:  GPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI
        GP EG      + KT +ET EKP++  D L Q    Y +NNYNT  +    GEG+  L+MTH  EG S+ I
Subjt:  GPCEGTIIRMTLKKTTHETHEKPKNLPDALEQ---MYSNNNYNT--KCLVIGEGE--LQMTHLLEGRSDRI

SwissProt top hitse value%identityAlignment
P46897 Homeobox-leucine zipper protein ATHB-75.8e-2240Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL---
        N+RRFSDEQIK+LE ++  +E++L  R+ ++LA +LGLQP+Q+ IWFQNKRARWKSK+ +  +  LR   D+LASQFE+L++E  +L S+LQ+L      
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL---

Query:  -----QGPCEGTIIRMTLKKTTHET------HEKPKNLPDALEQMYSNNNYNTKC
             +  C G    + L  T HE+        KP+ +   +E      ++   C
Subjt:  -----QGPCEGTIIRMTLKKTTHET------HEKPKNLPDALEQMYSNNNYNTKC

Q01IK0 Homeobox-leucine zipper protein HOX223.9e-1848.39Show/hide
Query:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKL
        KRRF++EQI++LE++++   +KL  R+  +LA +LGLQP+Q+ IWFQNKRARW+SK+ + ++ +LR+K D L S+ E+L++E  +L  QL +L
Subjt:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKL

Q651Z5 Homeobox-leucine zipper protein HOX61.0e-1851Show/hide
Query:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQGP
        K+RFS+EQIK+LE++ + T++KL  RQ ++LA +LGLQP+Q+ IWFQNKRARWKSK+ +  + +LR   D L   +E+L++E  +L  QL+KL  +LQ P
Subjt:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQGP

Q9M276 Homeobox-leucine zipper protein ATHB-122.4e-2038.78Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQG
        N++RFS+EQIK+LE I+  +E++L  R+ +++A +LGLQP+Q+ IWFQNKRARWK+K+ ++ + +LRA  ++LASQFE +++E  SL S+LQ+L   +Q 
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQG

Query:  P--------CEGTIIRMTLKKTTHETHEKPKNLPDALEQMYSNNNYN
        P        C    + ++    +H    +P+   D    + ++ +YN
Subjt:  P--------CEGTIIRMTLKKTTHETHEKPKNLPDALEQMYSNNNYN

Q9XH35 Homeobox-leucine zipper protein HOX61.0e-1851Show/hide
Query:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQGP
        K+RFS+EQIK+LE++ + T++KL  RQ ++LA +LGLQP+Q+ IWFQNKRARWKSK+ +  + +LR   D L   +E+L++E  +L  QL+KL  +LQ P
Subjt:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQGP

Arabidopsis top hitse value%identityAlignment
AT2G22430.1 homeobox protein 62.9e-1644.44Show/hide
Query:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKL-TVLQG
        KRR S  Q+K LE  + L E+KL   + +KLA +LGLQP+Q+ +WFQN+RARWK+K+ ++++  L+ + D L   F++L+ +N SL  ++ KL T L G
Subjt:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKL-TVLQG

AT2G46680.1 homeobox 74.1e-2340Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL---
        N+RRFSDEQIK+LE ++  +E++L  R+ ++LA +LGLQP+Q+ IWFQNKRARWKSK+ +  +  LR   D+LASQFE+L++E  +L S+LQ+L      
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVL---

Query:  -----QGPCEGTIIRMTLKKTTHET------HEKPKNLPDALEQMYSNNNYNTKC
             +  C G    + L  T HE+        KP+ +   +E      ++   C
Subjt:  -----QGPCEGTIIRMTLKKTTHET------HEKPKNLPDALEQMYSNNNYNTKC

AT2G46680.2 homeobox 71.0e-2139.22Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQ------LQKL
        N+RRFSDEQIK+LE ++  +E++L  R+ ++LA +LGLQP+Q+ IWFQNKRARWKSK+ +  +  LR   D+LASQFE+L++E  +L S+       +K 
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQ------LQKL

Query:  TVLQGPCEGTIIRMTLKKTTHET------HEKPKNLPDALEQMYSNNNYNTKC
           +  C G    + L  T HE+        KP+ +   +E      ++   C
Subjt:  TVLQGPCEGTIIRMTLKKTTHET------HEKPKNLPDALEQMYSNNNYNTKC

AT3G01470.1 homeobox 18.3e-1643.14Show/hide
Query:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQGP
        KRR + EQ+  LE  +  TE+KL   +  +LA KLGLQP+Q+ +WFQN+RARWK+K+ + ++  L++  D L S ++++  +N+ LRS++  LT  LQG 
Subjt:  KRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQGP

Query:  CE
         E
Subjt:  CE

AT3G61890.1 homeobox 121.7e-2138.78Show/hide
Query:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQG
        N++RFS+EQIK+LE I+  +E++L  R+ +++A +LGLQP+Q+ IWFQNKRARWK+K+ ++ + +LRA  ++LASQFE +++E  SL S+LQ+L   +Q 
Subjt:  NKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLT-VLQG

Query:  P--------CEGTIIRMTLKKTTHETHEKPKNLPDALEQMYSNNNYN
        P        C    + ++    +H    +P+   D    + ++ +YN
Subjt:  P--------CEGTIIRMTLKKTTHETHEKPKNLPDALEQMYSNNNYN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAAGGAGGTTTAGTGATGAACAAATTAAAACACTGGAGGCAATATATTATTTGACTGAATCGAAGTTGAATTCGAGGCAGGTGATCAAGCTAGCAACTAAGTT
GGGACTGCAACCTCAACAGATTACTATATGGTTTCAGAACAAAAGAGCAAGATGGAAGTCCAAAGAGAAGCAGGAGAATTTCAAAAGCCTAAGAGCCAAGTGTGATGATC
TAGCATCTCAGTTTGAAACGTTACAGGAAGAGAACAACTCCTTGCGCTCACAGTTGCAGAAGCTAACCGTTCTTCAAGGACCTTGTGAGGGCACTATCATTCGAATGACA
TTGAAAAAAACAACTCATGAAACTCATGAAAAGCCAAAAAACTTACCAGATGCATTGGAACAGATGTATTCGAACAATAACTATAACACAAAATGTTTAGTGATTGGAGA
AGGTGAACTTCAAATGACCCATTTGTTGGAAGGCAGAAGCGACCGCATTATGCCACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAAGGAGGTTTAGTGATGAACAAATTAAAACACTGGAGGCAATATATTATTTGACTGAATCGAAGTTGAATTCGAGGCAGGTGATCAAGCTAGCAACTAAGTT
GGGACTGCAACCTCAACAGATTACTATATGGTTTCAGAACAAAAGAGCAAGATGGAAGTCCAAAGAGAAGCAGGAGAATTTCAAAAGCCTAAGAGCCAAGTGTGATGATC
TAGCATCTCAGTTTGAAACGTTACAGGAAGAGAACAACTCCTTGCGCTCACAGTTGCAGAAGCTAACCGTTCTTCAAGGACCTTGTGAGGGCACTATCATTCGAATGACA
TTGAAAAAAACAACTCATGAAACTCATGAAAAGCCAAAAAACTTACCAGATGCATTGGAACAGATGTATTCGAACAATAACTATAACACAAAATGTTTAGTGATTGGAGA
AGGTGAACTTCAAATGACCCATTTGTTGGAAGGCAGAAGCGACCGCATTATGCCACATTGAGAAGCAACTTTTGAACCCAAATGGATCCTTTGAAACATCTGACACATGG
TGCATTTTTTATGCCGATGGTGGTGTCTATGAATATCAGTCGTGTAGCAGTTTGCAGGGACAGAGTTTCTGGGGATAAAGAAAGATAAGATGGCCCATAATTTCATGACT
TATTACTACCAGCTGAAAAATTGCCATTTAGACAACAGATATGAACAAAACTGAGGAACTGCCAACTGCTGCTAGCTGTTTTAGCAAGTACATTCCTTCCTTATCTTTGT
GCAATTTACGAAAAGAAGAGACATTGGAACTGTATTACTTTGAAATGCATTAGTAGCATCCAAGAAGTAACGCAGACAGCTCAATCAATCGACCTTTTTAGGTGATGCAC
ACACAATTGCTGAACTTCATTTCTTTTCTGGGGGGGCCTCTTGTTATGACTTCTGAGCAACATCTGTTAACTAAGATCAGTAGTTGATCAACAAGTTAAAAAAAAGTATA
TTTTATATGATCTTT
Protein sequenceShow/hide protein sequence
MNKRRFSDEQIKTLEAIYYLTESKLNSRQVIKLATKLGLQPQQITIWFQNKRARWKSKEKQENFKSLRAKCDDLASQFETLQEENNSLRSQLQKLTVLQGPCEGTIIRMT
LKKTTHETHEKPKNLPDALEQMYSNNNYNTKCLVIGEGELQMTHLLEGRSDRIMPH