; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012860 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012860
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationCmo_Chr09:11422299..11428815
RNA-Seq ExpressionCmoCh09G012860
SyntenyCmoCh09G012860
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022925442.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.2e-5863.49Show/hide
Query:  EVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSEL
        E+ R+     L   + Q+L SHL N+ DDLE K NGASIYRIP+HIKKV+P AFKP+ +SFGPYHH + HL P+EK KHLAL  F+RRCGL  EDIV+EL
Subjt:  EVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSEL

Query:  FIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLL
        + MLEDLQRSYDKLDD+WK  P KFLE+MILDGC M+ VL  D     FT+TDV+RDMLLLENQLP+ LLDKLY M   E+N+   SL+
Subjt:  FIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLL

XP_022925443.1 UPF0481 protein At3g47200-like [Cucurbita moschata]3.3e-6466.18Show/hide
Query:  KDEVKRVMELSPLYYRVV---QSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTED
        KD V  V+ELSPL   VV   ++L+  L ++ D  + K NGASIYRIPEHIKKVNPNAFKP+ +SFGPYHH + HLLP+EKKK LA+ +F+RRCGLSTED
Subjt:  KDEVKRVMELSPLYYRVV---QSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTED

Query:  IVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLLSSSPFL
        +V+EL+ MLEDLQ SYDKLDDKWKT P KFLELM+LDGCFMVLVL      + F   D +RDML+LENQLP+ LLDKLYSML+QENNQ+ L LL+SSP L
Subjt:  IVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLLSSSPFL

Query:  FELV
         ++V
Subjt:  FELV

XP_022925445.1 putative UPF0481 protein At3g02645 [Cucurbita moschata]3.6e-9599.44Show/hide
Query:  MELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLE
        MELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLE
Subjt:  MELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLE

Query:  DLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQV
        DLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQ+
Subjt:  DLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQV

XP_023535324.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]1.2e-5863.49Show/hide
Query:  EVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSEL
        E+ R+     L   + Q+L SHL N+ DDLE K NGASIYRIP+HIKKV+P AFKP+ +SFGPYHH + HL P+EK KHLAL  F+RRCGL  EDIV+EL
Subjt:  EVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSEL

Query:  FIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLL
        + MLEDLQRSYDKLDD+WK  P KFLE+MILDGC M+ VL  D     FT+TDV+RDMLLLENQLP+ LLDKLY M   E+N+   SL+
Subjt:  FIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLL

XP_023535502.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]3.6e-5876.33Show/hide
Query:  MLLLENQLPMKLLDQLYSMLMKENIQ-VKSLVWKSMDLPT--KSMLMEEDYSHLLDMYRAELRFHKGNEQNLLQISYSGMGHEIQLARRFHKVGIKLKKG
        MLLLENQLPMKLLD+L  ML  +NI+ +KSLVW+S +LPT  K  LM  DY HLLDMYR EL FH GNE  LLQ S+ GM HEIQLARRF K GIKLKKG
Subjt:  MLLLENQLPMKLLDQLYSMLMKENIQ-VKSLVWKSMDLPT--KSMLMEEDYSHLLDMYRAELRFHKGNEQNLLQISYSGMGHEIQLARRFHKVGIKLKKG

Query:  CNLGDVDFDENKGVLSLPFIEMNANIESGLLNVMTFEKLVEIDNVVGSFVILMGNLLEKDEVKRVMELS
        CN+ DVDFDENKGVLSLPFIEMNANIESGLLNVMTFEKLV IDN+VGSFVILMGNLLEKDEV    +L+
Subjt:  CNLGDVDFDENKGVLSLPFIEMNANIESGLLNVMTFEKLVEIDNVVGSFVILMGNLLEKDEVKRVMELS

TrEMBL top hitse value%identityAlignment
A0A6J1EBQ1 UPF0481 protein At3g47200-like5.9e-5963.49Show/hide
Query:  EVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSEL
        E+ R+     L   + Q+L SHL N+ DDLE K NGASIYRIP+HIKKV+P AFKP+ +SFGPYHH + HL P+EK KHLAL  F+RRCGL  EDIV+EL
Subjt:  EVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSEL

Query:  FIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLL
        + MLEDLQRSYDKLDD+WK  P KFLE+MILDGC M+ VL  D     FT+TDV+RDMLLLENQLP+ LLDKLY M   E+N+   SL+
Subjt:  FIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLL

A0A6J1EC69 UPF0481 protein At3g47200-like1.0e-5568.07Show/hide
Query:  VVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKL
        V Q+++SHL+N+ DDLESK N ASIYRIPEHIKKVNPNAFKP+LISFGPYHH + HL+P EK KHLA   F++RCGLS ED+V+E++ MLEDLQRSYDKL
Subjt:  VVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKL

Query:  DDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQE
        DDKWKT+P KFLE+MILDGCF++ VL   +        DV+RD+LLLENQLP+ LL KL SML  E
Subjt:  DDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQE

A0A6J1EF72 UPF0481 protein At3g47200-like1.6e-6466.18Show/hide
Query:  KDEVKRVMELSPLYYRVV---QSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTED
        KD V  V+ELSPL   VV   ++L+  L ++ D  + K NGASIYRIPEHIKKVNPNAFKP+ +SFGPYHH + HLLP+EKKK LA+ +F+RRCGLSTED
Subjt:  KDEVKRVMELSPLYYRVV---QSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTED

Query:  IVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLLSSSPFL
        +V+EL+ MLEDLQ SYDKLDDKWKT P KFLELM+LDGCFMVLVL      + F   D +RDML+LENQLP+ LLDKLYSML+QENNQ+ L LL+SSP L
Subjt:  IVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLLSSSPFL

Query:  FELV
         ++V
Subjt:  FELV

A0A6J1EI01 putative UPF0481 protein At3g026451.8e-9599.44Show/hide
Query:  MELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLE
        MELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLE
Subjt:  MELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLE

Query:  DLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQV
        DLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQ+
Subjt:  DLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQV

A0A6J1EI35 UPF0481 protein At3g47200-like4.9e-5360.87Show/hide
Query:  VVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC-GLSTEDIVSELFIMLEDLQRSYDK
        ++Q+LESHL+N+   +ESK  GASIYRIPEHI  VNPNAFKP+L+SFGPYHH + HL+P+EKKKH AL+ FK+ C GL+TE IVS L+ ML DLQ SYDK
Subjt:  VVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC-GLSTEDIVSELFIMLEDLQRSYDK

Query:  LDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLLSSSPFLFE
        LDDKWK +P KFLELMILDGC ++ +   D     F + DV RDMLLLENQLP+MLLDKLYS+L+   N+  +  L+    ++E
Subjt:  LDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKLYSMLRQENNQVCLSLLSSSPFLFE

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026453.3e-0625.45Show/hide
Query:  VVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC-GLSTEDIVSELFIMLEDLQRSYDK
        V +SL++ L   + DLE  +   SI+ +P+ +   +P+++ P  +S GPYH     L  +E+ K +     + +       D+V +L  M   ++  Y K
Subjt:  VVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC-GLSTEDIVSELFIMLEDLQRSYDK

Query:  LDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFT------STDVVRDMLLLENQLPLMLLDK
               + +  L +M +D  F++  L   + R++ T        +++RD++++ENQ+PL +L K
Subjt:  LDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFT------STDVVRDMLLLENQLPLMLLDK

Q9SD53 UPF0481 protein At3g472003.6e-1333.56Show/hide
Query:  IYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLE--KKKHLALYNFKRRCGLSTEDIVSELFIMLED-LQRSYDKLDDKWKTDPDKFLELMILDGCF
        I+R+PE    +NP A+KP+++S GPYH+ + HL  ++  K + L L+  + +     E+++ +  + LED +++SY    ++ KT  D  + +M+LDGCF
Subjt:  IYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLE--KKKHLALYNFKRRCGLSTEDIVSELFIMLED-LQRSYDKLDDKWKTDPDKFLELMILDGCF

Query:  MVLV---------LSNDACREI-FTSTDVVRDMLLLENQLPLMLLDKLY
        +++V         LS D    I +  + +  D+LLLENQ+P  +L  LY
Subjt:  MVLV---------LSNDACREI-FTSTDVVRDMLLLENQLPLMLLDKLY

Arabidopsis top hitse value%identityAlignment
AT3G47210.1 Plant protein of unknown function (DUF247)2.3e-1529.73Show/hide
Query:  IYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVL
        I+R+P+   ++NP A+KP+++S GPYHH   HL  +++ K   L+ F R   +  + + + +    +++++SY    +  +  P + + +MILDGCF+++
Subjt:  IYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVL

Query:  VLSNDACR-EIFTSTD-----------VVRDMLLLENQLPLMLLDKLY
        +L   + + E++ S D           +  D+LLLENQ+P  +L  L+
Subjt:  VLSNDACR-EIFTSTD-----------VVRDMLLLENQLPLMLLDKLY

AT3G47250.1 Plant protein of unknown function (DUF247)1.5e-1430Show/hide
Query:  LSPLYYRVVQSLESHLS-NVDDD------LESKSNGA-SIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC---GLSTEDI
        LSPL  R  +S+ SH+  N+         LES    +  I+RIP+ + +VNP A+KP+++S GPYH+ + HL  +++ K   L  F  R    G+    +
Subjt:  LSPLYYRVVQSLESHLS-NVDDD------LESKSNGA-SIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC---GLSTEDI

Query:  VSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVL-------SNDACRE-IFT----STDVVRDMLLLENQLPLMLLDKLY
         + +  +   ++ SY    ++ + +  + + +MILDGCF++++L         D  ++ IFT       +  D+LLLENQ+P  +L  ++
Subjt:  VSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVL-------SNDACRE-IFT----STDVVRDMLLLENQLPLMLLDKLY

AT3G47250.2 Plant protein of unknown function (DUF247)1.5e-1430Show/hide
Query:  LSPLYYRVVQSLESHLS-NVDDD------LESKSNGA-SIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC---GLSTEDI
        LSPL  R  +S+ SH+  N+         LES    +  I+RIP+ + +VNP A+KP+++S GPYH+ + HL  +++ K   L  F  R    G+    +
Subjt:  LSPLYYRVVQSLESHLS-NVDDD------LESKSNGA-SIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC---GLSTEDI

Query:  VSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVL-------SNDACRE-IFT----STDVVRDMLLLENQLPLMLLDKLY
         + +  +   ++ SY    ++ + +  + + +MILDGCF++++L         D  ++ IFT       +  D+LLLENQ+P  +L  ++
Subjt:  VSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVL-------SNDACRE-IFT----STDVVRDMLLLENQLPLMLLDKLY

AT3G47250.3 Plant protein of unknown function (DUF247)1.5e-1430Show/hide
Query:  LSPLYYRVVQSLESHLS-NVDDD------LESKSNGA-SIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC---GLSTEDI
        LSPL  R  +S+ SH+  N+         LES    +  I+RIP+ + +VNP A+KP+++S GPYH+ + HL  +++ K   L  F  R    G+    +
Subjt:  LSPLYYRVVQSLESHLS-NVDDD------LESKSNGA-SIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRC---GLSTEDI

Query:  VSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVL-------SNDACRE-IFT----STDVVRDMLLLENQLPLMLLDKLY
         + +  +   ++ SY    ++ + +  + + +MILDGCF++++L         D  ++ IFT       +  D+LLLENQ+P  +L  ++
Subjt:  VSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVL-------SNDACRE-IFT----STDVVRDMLLLENQLPLMLLDKLY

AT4G31980.1 unknown protein5.6e-1733.33Show/hide
Query:  LESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELM
        L S S    IY++P  ++++NP+A+ PRL+SFGP H     L  +E +K+  L +F  R   S ED+V       ++ +  Y    +  K   D+F+E++
Subjt:  LESKSNGASIYRIPEHIKKVNPNAFKPRLISFGPYHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELM

Query:  ILDGCFMVLVL----------SNDACREIFTS----TDVVRDMLLLENQLPLMLLDKLYSML
        ++DG F+V +L           ND    IF +    TDV RDM+L+ENQLP  ++ +++ +L
Subjt:  ILDGCFMVLVL----------SNDACREIFTS----TDVVRDMLLLENQLPLMLLDKLYSML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCTGCTTGAGAATCAGCTGCCCATGAAGCTTCTTGACCAGCTGTATTCCATGTTAATGAAAGAGAACATCCAAGTCAAATCGCTTGTTTGGAAGTCCATGGACTT
ACCCACAAAGAGCATGTTAATGGAGGAAGACTACTCGCACCTTCTGGATATGTATAGGGCAGAACTGCGGTTTCACAAGGGGAATGAACAGAATCTACTTCAAATCAGCT
ACTCGGGAATGGGTCATGAGATTCAGCTAGCCAGACGCTTCCATAAAGTTGGGATCAAACTCAAGAAAGGGTGTAACCTTGGGGACGTGGATTTTGATGAAAACAAAGGT
GTGTTGAGCCTCCCGTTCATCGAAATGAATGCTAACATTGAATCAGGCTTATTGAACGTGATGACATTTGAGAAGCTTGTTGAGATTGATAACGTAGTAGGCTCTTTCGT
CATCTTGATGGGTAATCTACTGGAGAAAGATGAGGTCAAAAGAGTAATGGAGCTGTCACCACTCTATTACCGAGTGGTCCAATCTCTAGAATCCCATTTGTCAAATGTTG
ACGATGATCTTGAAAGCAAAAGCAATGGAGCTTCAATCTATAGAATACCCGAGCATATAAAGAAGGTTAATCCAAACGCATTCAAACCGCGGCTGATATCGTTCGGGCCA
TACCACCATAGGGATGCGCATTTGTTGCCATTGGAGAAGAAAAAACATTTAGCACTTTACAACTTTAAAAGGCGCTGCGGATTGTCCACTGAAGACATTGTGAGTGAGCT
GTTCATCATGTTGGAAGATCTGCAGAGATCATATGATAAACTTGATGATAAATGGAAAACAGACCCAGACAAATTTTTGGAGCTCATGATCTTGGATGGTTGCTTCATGG
TGCTTGTCTTGTCGAATGATGCCTGTAGAGAGATATTTACGAGTACGGATGTAGTGCGAGATATGTTGCTGCTTGAGAATCAGCTGCCCTTGATGCTTCTTGACAAGCTG
TATTCCATGTTAAGGCAAGAGAATAACCAAGTATGTCTATCTCTACTTAGCTCAAGCCCATTCCTTTTTGAGCTGGTTAGGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCTGCTTGAGAATCAGCTGCCCATGAAGCTTCTTGACCAGCTGTATTCCATGTTAATGAAAGAGAACATCCAAGTCAAATCGCTTGTTTGGAAGTCCATGGACTT
ACCCACAAAGAGCATGTTAATGGAGGAAGACTACTCGCACCTTCTGGATATGTATAGGGCAGAACTGCGGTTTCACAAGGGGAATGAACAGAATCTACTTCAAATCAGCT
ACTCGGGAATGGGTCATGAGATTCAGCTAGCCAGACGCTTCCATAAAGTTGGGATCAAACTCAAGAAAGGGTGTAACCTTGGGGACGTGGATTTTGATGAAAACAAAGGT
GTGTTGAGCCTCCCGTTCATCGAAATGAATGCTAACATTGAATCAGGCTTATTGAACGTGATGACATTTGAGAAGCTTGTTGAGATTGATAACGTAGTAGGCTCTTTCGT
CATCTTGATGGGTAATCTACTGGAGAAAGATGAGGTCAAAAGAGTAATGGAGCTGTCACCACTCTATTACCGAGTGGTCCAATCTCTAGAATCCCATTTGTCAAATGTTG
ACGATGATCTTGAAAGCAAAAGCAATGGAGCTTCAATCTATAGAATACCCGAGCATATAAAGAAGGTTAATCCAAACGCATTCAAACCGCGGCTGATATCGTTCGGGCCA
TACCACCATAGGGATGCGCATTTGTTGCCATTGGAGAAGAAAAAACATTTAGCACTTTACAACTTTAAAAGGCGCTGCGGATTGTCCACTGAAGACATTGTGAGTGAGCT
GTTCATCATGTTGGAAGATCTGCAGAGATCATATGATAAACTTGATGATAAATGGAAAACAGACCCAGACAAATTTTTGGAGCTCATGATCTTGGATGGTTGCTTCATGG
TGCTTGTCTTGTCGAATGATGCCTGTAGAGAGATATTTACGAGTACGGATGTAGTGCGAGATATGTTGCTGCTTGAGAATCAGCTGCCCTTGATGCTTCTTGACAAGCTG
TATTCCATGTTAAGGCAAGAGAATAACCAAGTATGTCTATCTCTACTTAGCTCAAGCCCATTCCTTTTTGAGCTGGTTAGGAAGTAG
Protein sequenceShow/hide protein sequence
MLLLENQLPMKLLDQLYSMLMKENIQVKSLVWKSMDLPTKSMLMEEDYSHLLDMYRAELRFHKGNEQNLLQISYSGMGHEIQLARRFHKVGIKLKKGCNLGDVDFDENKG
VLSLPFIEMNANIESGLLNVMTFEKLVEIDNVVGSFVILMGNLLEKDEVKRVMELSPLYYRVVQSLESHLSNVDDDLESKSNGASIYRIPEHIKKVNPNAFKPRLISFGP
YHHRDAHLLPLEKKKHLALYNFKRRCGLSTEDIVSELFIMLEDLQRSYDKLDDKWKTDPDKFLELMILDGCFMVLVLSNDACREIFTSTDVVRDMLLLENQLPLMLLDKL
YSMLRQENNQVCLSLLSSSPFLFELVRK