; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G34800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G34800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLOB domain-containing protein
Genome locationChr1:29916912..29920402
RNA-Seq ExpressionCSPI01G34800
SyntenyCSPI01G34800
Gene Ontology termsGO:0010087 - phloem or xylem histogenesis (biological process)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR004883 - Lateral organ boundaries, LOB


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583812.1 LOB domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia]3.9e-8193.64Show/hide
Query:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
        MKTENG RKQ GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELP+HQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
Subjt:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ

Query:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW
         QID LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSP NSGSPSSRLMGSQ SR MFEMDMVVDQAM +S+W
Subjt:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW

XP_004145462.1 LOB domain-containing protein 4 [Cucumis sativus]5.9e-90100Show/hide
Query:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
        MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
Subjt:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ

Query:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
        IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
Subjt:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW

XP_008459081.1 PREDICTED: LOB domain-containing protein 4 [Cucumis melo]1.6e-8797.06Show/hide
Query:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
        MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
Subjt:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ

Query:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
        IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRL+GS SR MFEMDMVVDQA+E+SMW
Subjt:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW

XP_022927419.1 LOB domain-containing protein 4-like [Cucurbita moschata]1.7e-8193.64Show/hide
Query:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
        MKTENG RKQ GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELP+HQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
Subjt:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ

Query:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW
         QID+LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSP NSGSPSSRLMGSQ SR MFEMDMVVDQAM +S+W
Subjt:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW

XP_038895183.1 LOB domain-containing protein 4-like [Benincasa hispida]8.9e-8695.88Show/hide
Query:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
        MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ Q
Subjt:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ

Query:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
        ID LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSR MFEMDM VDQA+ +SMW
Subjt:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW

TrEMBL top hitse value%identityAlignment
A0A061GE84 LOB domain-containing protein 4 isoform 12.1e-7283.93Show/hide
Query:  ENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDA
        E+GRKQ G A+PCAACKLLRRRC ++CVFAPYFPAD+P KFANVHKVFGASNVNKMLQELP+HQRGDAVSS+VYEANARVRDPVYGCVGAISSLQ QIDA
Subjt:  ENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDA

Query:  LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQA-MEESMW
        LQTQLALAQAEVVHLRVRQTASFS++G  P SPSNSGSPSS+LMGSQ++ MF++DMVVD A + ESMW
Subjt:  LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQA-MEESMW

A0A0A0LYG0 LOB domain-containing protein2.9e-90100Show/hide
Query:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
        MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
Subjt:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ

Query:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
        IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
Subjt:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW

A0A1S3CAK2 LOB domain-containing protein 47.8e-8897.06Show/hide
Query:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
        MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ
Subjt:  MKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQ

Query:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW
        IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRL+GS SR MFEMDMVVDQA+E+SMW
Subjt:  IDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW

A0A6J1EHY2 LOB domain-containing protein 4-like8.4e-8293.64Show/hide
Query:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
        MKTENG RKQ GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELP+HQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
Subjt:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ

Query:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW
         QID+LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSP NSGSPSSRLMGSQ SR MFEMDMVVDQAM +S+W
Subjt:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW

A0A6J1KP63 LOB domain-containing protein 4-like8.4e-8293.64Show/hide
Query:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
        MKTENG RKQ GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELP+HQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ
Subjt:  MKTENG-RKQ-GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQ

Query:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW
         QID+LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSP NSGSPSSRLMGSQ SR MFEMDMVVDQAM +S+W
Subjt:  HQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMFEMDMVVDQAMEESMW

SwissProt top hitse value%identityAlignment
Q8L5T5 LOB domain-containing protein 153.0e-3649.4Show/hide
Query:  PCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQAE
        PCAACKLLRRRC +EC F+PYF   +PHKFA+VHKVFGASNV+KML E+P  QR DA +SLVYEAN R+RDPVYGC+GAIS+LQ Q+ ALQ +L   ++E
Subjt:  PCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQAE

Query:  VVHLRVR------------QTASFSNYG------LSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVD
        ++  + R            Q A F N G        P  P+    P++    S S  +F      D
Subjt:  VVHLRVR------------QTASFSNYG------LSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVD

Q8LBW3 LOB domain-containing protein 121.3e-4468.33Show/hide
Query:  GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLA
        G G++PCA+CKLLRRRC ++C+FAPYFP D PHKFA VHKVFGASNV+KMLQELP+HQR DAV+SLV+EANARVRDPVYGCVGAIS LQ+Q+  LQ QLA
Subjt:  GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLA

Query:  LAQAEVVHLRVRQTASFSNY
        +AQAE++ ++++   +  ++
Subjt:  LAQAEVVHLRVRQTASFSNY

Q9FML4 Protein LATERAL ORGAN BOUNDARIES2.3e-3659.38Show/hide
Query:  APCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQA
        +PCAACK LRR+C   C+FAPYFP ++PHKFANVHK+FGASNV K+L EL  HQR DAV+SL YEA ARVRDPVYGCVGAIS LQ Q+  LQ +L  A A
Subjt:  APCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQA

Query:  EVVHLRVRQTASFSNYGLSPTSPSNSGS
        ++ H           YGLS ++    G+
Subjt:  EVVHLRVRQTASFSNYGLSPTSPSNSGS

Q9SA51 LOB domain-containing protein 31.5e-4066.67Show/hide
Query:  ENGRKQGGGAAPCAACKLLRRRC-GEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQID
        + G + G   +PCA CKLLRR+C  + CVFAPYFPA +P+KFA VHK+FGASNVNKMLQEL  + R DAV S+VYEANAR++DPVYGCVG ISSL  Q++
Subjt:  ENGRKQGGGAAPCAACKLLRRRC-GEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQID

Query:  ALQTQLALAQAEVVHLR
         LQTQLA AQAE++H+R
Subjt:  ALQTQLALAQAEVVHLR

Q9SHE9 LOB domain-containing protein 42.0e-6474.56Show/hide
Query:  ENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDA
        E+ RKQ G A+PCAACKLLRRRC ++CVF+PYFPAD+P KFANVH+VFGASNVNKMLQELPIHQRGDAVSS+VYEANARVRDPVYGCVGAISSLQ QID 
Subjt:  ENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDA

Query:  LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMF-EMDMVVDQAMEESMW
        LQ QLALAQAEVVHLRVRQ+ +F  +GL P SPS+SGSPSS+ +  Q ++ MF  MD+V + ++ ESMW
Subjt:  LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMF-EMDMVVDQAMEESMW

Arabidopsis top hitse value%identityAlignment
AT1G16530.1 ASYMMETRIC LEAVES 2-like 91.1e-4166.67Show/hide
Query:  ENGRKQGGGAAPCAACKLLRRRC-GEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQID
        + G + G   +PCA CKLLRR+C  + CVFAPYFPA +P+KFA VHK+FGASNVNKMLQEL  + R DAV S+VYEANAR++DPVYGCVG ISSL  Q++
Subjt:  ENGRKQGGGAAPCAACKLLRRRC-GEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQID

Query:  ALQTQLALAQAEVVHLR
         LQTQLA AQAE++H+R
Subjt:  ALQTQLALAQAEVVHLR

AT1G31320.1 LOB domain-containing protein 41.4e-6574.56Show/hide
Query:  ENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDA
        E+ RKQ G A+PCAACKLLRRRC ++CVF+PYFPAD+P KFANVH+VFGASNVNKMLQELPIHQRGDAVSS+VYEANARVRDPVYGCVGAISSLQ QID 
Subjt:  ENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDA

Query:  LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMF-EMDMVVDQAMEESMW
        LQ QLALAQAEVVHLRVRQ+ +F  +GL P SPS+SGSPSS+ +  Q ++ MF  MD+V + ++ ESMW
Subjt:  LQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQ-SRTMF-EMDMVVDQAMEESMW

AT2G30130.1 Lateral organ boundaries (LOB) domain family protein9.6e-4668.33Show/hide
Query:  GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLA
        G G++PCA+CKLLRRRC ++C+FAPYFP D PHKFA VHKVFGASNV+KMLQELP+HQR DAV+SLV+EANARVRDPVYGCVGAIS LQ+Q+  LQ QLA
Subjt:  GGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLA

Query:  LAQAEVVHLRVRQTASFSNY
        +AQAE++ ++++   +  ++
Subjt:  LAQAEVVHLRVRQTASFSNY

AT5G63090.2 Lateral organ boundaries (LOB) domain family protein1.6e-3759.38Show/hide
Query:  APCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQA
        +PCAACK LRR+C   C+FAPYFP ++PHKFANVHK+FGASNV K+L EL  HQR DAV+SL YEA ARVRDPVYGCVGAIS LQ Q+  LQ +L  A A
Subjt:  APCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQA

Query:  EVVHLRVRQTASFSNYGLSPTSPSNSGS
        ++ H           YGLS ++    G+
Subjt:  EVVHLRVRQTASFSNYGLSPTSPSNSGS

AT5G63090.3 Lateral organ boundaries (LOB) domain family protein1.6e-3759.38Show/hide
Query:  APCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQA
        +PCAACK LRR+C   C+FAPYFP ++PHKFANVHK+FGASNV K+L EL  HQR DAV+SL YEA ARVRDPVYGCVGAIS LQ Q+  LQ +L  A A
Subjt:  APCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSSLVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQA

Query:  EVVHLRVRQTASFSNYGLSPTSPSNSGS
        ++ H           YGLS ++    G+
Subjt:  EVVHLRVRQTASFSNYGLSPTSPSNSGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAGAGGGGTCAGTGTCCAAACTCCAAGAGGCAGAGGAGAGTGGCAGAGAATAACAAAGGAGGAGGGGGGGATACGTGTGTCATAAGACTAGAGACGGACGATAT
GAAGACCGAAAATGGGAGGAAACAAGGGGGAGGAGCAGCCCCATGCGCAGCATGCAAGCTTCTCAGAAGACGGTGTGGGGAAGAATGTGTGTTTGCTCCATACTTTCCCG
CCGACCAACCTCATAAGTTCGCCAACGTTCATAAGGTTTTCGGCGCTAGCAATGTTAACAAAATGCTTCAGGAGCTGCCAATTCATCAACGAGGAGATGCAGTGAGCAGT
TTGGTGTACGAAGCAAATGCTCGAGTGCGCGACCCGGTGTATGGGTGTGTTGGAGCAATATCATCTCTACAGCATCAAATAGACGCCCTCCAAACTCAACTGGCTTTAGC
TCAAGCCGAGGTGGTGCACCTCAGGGTGCGTCAGACAGCATCATTTTCAAACTATGGGCTAAGCCCAACAAGCCCAAGTAATAGTGGGTCACCCTCATCAAGGCTCATGG
GCTCACAATCTAGGACCATGTTTGAGATGGATATGGTTGTGGACCAAGCCATGGAGGAATCAATGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAGAGGGGTCAGTGTCCAAACTCCAAGAGGCAGAGGAGAGTGGCAGAGAATAACAAAGGAGGAGGGGGGGATACGTGTGTCATAAGACTAGAGACGGACGATAT
GAAGACCGAAAATGGGAGGAAACAAGGGGGAGGAGCAGCCCCATGCGCAGCATGCAAGCTTCTCAGAAGACGGTGTGGGGAAGAATGTGTGTTTGCTCCATACTTTCCCG
CCGACCAACCTCATAAGTTCGCCAACGTTCATAAGGTTTTCGGCGCTAGCAATGTTAACAAAATGCTTCAGGAGCTGCCAATTCATCAACGAGGAGATGCAGTGAGCAGT
TTGGTGTACGAAGCAAATGCTCGAGTGCGCGACCCGGTGTATGGGTGTGTTGGAGCAATATCATCTCTACAGCATCAAATAGACGCCCTCCAAACTCAACTGGCTTTAGC
TCAAGCCGAGGTGGTGCACCTCAGGGTGCGTCAGACAGCATCATTTTCAAACTATGGGCTAAGCCCAACAAGCCCAAGTAATAGTGGGTCACCCTCATCAAGGCTCATGG
GCTCACAATCTAGGACCATGTTTGAGATGGATATGGTTGTGGACCAAGCCATGGAGGAATCAATGTGGTGATATTATCATACTATTATTATTTTCCTAAAATCCCATTTC
CATGGAGATTTAGTTAGTTTTGCCATTTTTTCTTTCTTTTTTCTACCCTATTTTGTCAAATTGGCTTTCACCTGCTTGTTCGCCCATTTGCTATCTTGATTTCCTACTCA
TATACTACTTACTTTCCTATATGCCGACCTGAGCATAGCTCCAGCAGTCAGGTAATATTTTCTTTTGTTCAAGTTGGAAAGATTTGGATCATCTTTCCTAATGATAGTGT
AATACTTTTTAAGAAACTCGTATATAAATATATATATCTAAACATATTATTGAGATATTATGTTCTGG
Protein sequenceShow/hide protein sequence
MSERGQCPNSKRQRRVAENNKGGGGDTCVIRLETDDMKTENGRKQGGGAAPCAACKLLRRRCGEECVFAPYFPADQPHKFANVHKVFGASNVNKMLQELPIHQRGDAVSS
LVYEANARVRDPVYGCVGAISSLQHQIDALQTQLALAQAEVVHLRVRQTASFSNYGLSPTSPSNSGSPSSRLMGSQSRTMFEMDMVVDQAMEESMW