; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024454 (gene) of Chayote v1 genome

Gene IDSed0024454
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG07:9102954..9106695
RNA-Seq ExpressionSed0024454
SyntenySed0024454
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576843.1 Protein MODIFYING WALL LIGNIN-2, partial [Cucurbita argyrosperma subsp. sororia]1.9e-7668.09Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KAL +CSVV+FL LL+VATGFAAE TRVK                 Q  QV    + T C YP+SPA  LGLTAALSLLL Q++INVSTGCICC RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRTA +CF+VSWF  VIAFLLLLTGAALN+ RNEQ+ +  YY CYVL+PGVFAVAT++  ASL LGL Y+LILNSAKND  VWGNPSIPP AN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAM QPQFPPPPP    TA+P FV EDTY RRQFT
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

KAG7014863.1 hypothetical protein SDJN02_22493 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-7668.09Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KAL +CSVV+FL LL+VATGFAAE TRVK                 Q  QV    + T C YP+SPA  LGLTAALSLLL Q++INVSTGCICC RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRTA +CF+VSWF  VIAFLLLLTGAALN+ RNEQ+ +  YY CYVL+PGVFAVAT++  ASL LGL Y+LILNSAKND  VWGNPSIPP AN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAM QPQFPPPPP    TA+P FV EDTY RRQFT
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

XP_022140830.1 uncharacterized protein LOC111011403 [Momordica charantia]1.7e-8068.51Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KA+ +CSVV+FL LLVVATGFAAE TR+K+                  SQV  V  +T C YPRSPA+ LGL AALSLL+ Q+ INVSTGCICC+RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRT  +CF++SWF  +IAFLLLLTGAALN+RR E++Y+ GYYECYVL+PGVFAVATILA AS+VLGL Y+LILNSAKN+ TVWGNPS+PPQAN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAMGQPQFPPPPP  Q + +P FV EDTYMRRQ+T
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

XP_022922463.1 uncharacterized protein LOC111430466 [Cucurbita moschata]2.5e-7668.09Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KAL +CSVV+FL LL+VATGFAAE TRVK                 Q  QV    + T C YP+SPA  LGLTAALSLLL Q++INVSTGCICC RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRTA +CF+VSWF  VIAFLLLLTGAALN+ RNEQ+ +  YY CYVL+PGVFAVAT++  ASL LGL Y+LILNSAKND  VWGNPSIPP AN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAM QPQFPPPPP    TA+P FV EDTY RRQFT
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

XP_023551924.1 uncharacterized protein LOC111809750 [Cucurbita pepo subsp. pepo]3.6e-7567.66Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KAL +CSVV+ L LL+VATGFAAE TRVK                 Q  QV    + T C YP+SPA  LGLTAALSLLL Q++INVSTGCICC RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRTA +CF+VSWF  VIAFLLLLTGAALN  RNEQ+ +  YY CYVL+PGVFAVAT++  ASL LGL Y+LILNSAKND  VWGNPSIPP AN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAM QPQFPPPPP    TA+P FV EDTY RRQFT
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein4.3e-6661.54Show/hide
Query:  KKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPR
        K AL++  VV+ L ++++ATGFAAEATR K                   +QV  V     C YPRSPA+ LGLTAALSLL  Q+ I  STGC+CC+RGPR
Subjt:  KKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPR

Query:  PTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKND-RTVWGNPSIPPQANI
        P ASKWRTA ICF +SW   VIAFLL LTGAALN  R EQ  +   Y+CYVL+PGVF+ ATI+ +ASL LG+ YFLILNSAKND  TVWG+PS+PPQ NI
Subjt:  PTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKND-RTVWGNPSIPPQANI

Query:  AMGQPQF--PPPPPQGTANPAFVREDTYMRRQFT
        AM QPQF  PPPPPQ TA+P FV EDTYMRRQFT
Subjt:  AMGQPQF--PPPPPQGTANPAFVREDTYMRRQFT

A0A1S3BX72 uncharacterized protein LOC1034944359.5e-6661.92Show/hide
Query:  MEKKALLLCS-VVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLR
        ME+KA L+   VV+FL ++++ATGFAAEATR K K                  QV  V     C YPRSPAM LG TAALSLL  Q+ I  STGC+CC+R
Subjt:  MEKKALLLCS-VVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLR

Query:  GPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKND-RTVWGNPSIPPQ
        GPRP   KWRTA ICF VSW   VIAFLL LTGAALN  R++Q  + G YECYVL+PGVF+ ATI+ VASL LG+ YFLILNSAKND  TVWG+PS+PPQ
Subjt:  GPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKND-RTVWGNPSIPPQ

Query:  ANIAMGQPQF----PPPPPQGTANPAFVREDTYMRRQFT
         NIAM QPQF    PPPPPQ T +P FV EDTYMRRQFT
Subjt:  ANIAMGQPQF----PPPPPQGTANPAFVREDTYMRRQFT

A0A5D3D069 DUF1218 domain-containing protein9.5e-6661.92Show/hide
Query:  MEKKALLLCS-VVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLR
        ME+KA L+   VV+FL ++++ATGFAAEATR K K                  QV  V     C YPRSPAM LG TAALSLL  Q+ I  STGC+CC+R
Subjt:  MEKKALLLCS-VVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLR

Query:  GPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKND-RTVWGNPSIPPQ
        GPRP   KWRTA ICF VSW   VIAFLL LTGAALN  R++Q  + G YECYVL+PGVF+ ATI+ VASL LG+ YFLILNSAKND  TVWG+PS+PPQ
Subjt:  GPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKND-RTVWGNPSIPPQ

Query:  ANIAMGQPQF----PPPPPQGTANPAFVREDTYMRRQFT
         NIAM QPQF    PPPPPQ T +P FV EDTYMRRQFT
Subjt:  ANIAMGQPQF----PPPPPQGTANPAFVREDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114038.0e-8168.51Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KA+ +CSVV+FL LLVVATGFAAE TR+K+                  SQV  V  +T C YPRSPA+ LGL AALSLL+ Q+ INVSTGCICC+RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRT  +CF++SWF  +IAFLLLLTGAALN+RR E++Y+ GYYECYVL+PGVFAVATILA AS+VLGL Y+LILNSAKN+ TVWGNPS+PPQAN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAMGQPQFPPPPP  Q + +P FV EDTYMRRQ+T
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304661.2e-7668.09Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME+KAL +CSVV+FL LL+VATGFAAE TRVK                 Q  QV    + T C YP+SPA  LGLTAALSLLL Q++INVSTGCICC RG
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        PRP ASKWRTA +CF+VSWF  VIAFLLLLTGAALN+ RNEQ+ +  YY CYVL+PGVFAVAT++  ASL LGL Y+LILNSAKND  VWGNPSIPP AN
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT
        IAM QPQFPPPPP    TA+P FV EDTY RRQFT
Subjt:  IAMGQPQFPPPPP--QGTANPAFVREDTYMRRQFT

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-15.8e-0433.87Show/hide
Query:  CNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRTAAICFIV---SWFLDVIAFLLLLTGAALNERRNEQNYFGGYY--ECYVLRPG
        C  P + A  LG+ A + + + Q++ NV    IC  RG   T  K RT   C I+   SW    +A  L+  GA++N    EQ Y  G+   ECY+++ G
Subjt:  CNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRTAAICFIV---SWFLDVIAFLLLLTGAALNERRNEQNYFGGYY--ECYVLRPG

Query:  VFAVATILAVASL--VLGLVYFLI
        VFA +  L+V ++  +LG   F +
Subjt:  VFAVATILAVASL--VLGLVYFLI

Arabidopsis top hitse value%identityAlignment
AT1G61065.1 Protein of unknown function (DUF1218)4.7e-0933.33Show/hide
Query:  CNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASK-WRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQN-YFGGYY-ECYVLRPGVF
        C Y +  A  LG+ + L LL  Q++I V++ C+CC R   P+ S+ W  A   FI +W    IA + LL G+  N    +   YFG     C  LR GVF
Subjt:  CNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASK-WRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQN-YFGGYY-ECYVLRPGVF

Query:  AVATILAVASLVLGLVYFLILNSAKN
               V + ++  +Y++ L+ AK+
Subjt:  AVATILAVASLVLGLVYFLILNSAKN

AT1G68220.1 Protein of unknown function (DUF1218)2.6e-0729.21Show/hide
Query:  LCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASK
        + +VV+ L LL     F AE  R        S+ +     Y++          T C Y    + V G++A   LL+ Q ++N  T C+C  +G     S 
Subjt:  LCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASK

Query:  WRTAAICFIVSWFLDVIAFLLLLTGAALN--ERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSA
           A + F+VSW   + A   LL G+A N    ++E  Y G    C VL  GVFA      + SL+  ++Y+L  + A
Subjt:  WRTAAICFIVSWFLDVIAFLLLLTGAALN--ERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSA

AT4G27435.1 Protein of unknown function (DUF1218)4.9e-0629.27Show/hide
Query:  CNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQN--YFGGYYECYVLRPGVFA
        C Y    A   G+ A L  +  Q++I + + C CC +  +P  S+   A I FIVSW   +IA + LL G+  N    +    +     +C  LR GVFA
Subjt:  CNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQN--YFGGYYECYVLRPGVFA

Query:  VATILAVASLVLGLVYFLILNSA
                + ++   Y+    SA
Subjt:  VATILAVASLVLGLVYFLILNSA

AT5G17210.1 Protein of unknown function (DUF1218)8.8e-4843.35Show/hide
Query:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG
        ME++ +++C V+  L LL   T F AEATR+K                     V +  S T+C YPRSPA  LG T+AL L++ Q++++VS+GC CC +G
Subjt:  MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRG

Query:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN
        P P+ S W  + ICF+VSWF  VIAFL+LL+GAALN+   E++   G Y CY+++PGVF+   +L++ ++ LG+VY+L L S K         +      
Subjt:  PRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQAN

Query:  IAMGQPQFPPPPPQGTANPAFVREDTYMRRQFT
        IAMGQPQ     P+   +P FV EDTYMRRQFT
Subjt:  IAMGQPQFPPPPPQGTANPAFVREDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)1.3e-4347.78Show/hide
Query:  VNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYV
        V +  S T+C YPRSPA  LG T+AL L++ Q++++VS+GC CC +GP P+ S W  + ICF+VSWF  VIAFL+LL+GAALN+   E++   G Y CY+
Subjt:  VNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRTAAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYV

Query:  LRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQANIAMGQPQFPPPPPQGTANPAFVREDTYMRRQFT
        ++PGVF+   +L++ ++ LG+VY+L L S K         +      IAMGQPQ     P+   +P FV EDTYMRRQFT
Subjt:  LRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQANIAMGQPQFPPPPPQGTANPAFVREDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAGGCTCTGCTTCTGTGCTCTGTGGTTTCCTTCCTGATTCTTCTCGTCGTCGCCACTGGCTTCGCCGCCGAGGCCACAAGAGTTAAGGTGAAAATCCATCT
TACAAGTAGTATTATTGAAAACAGTTGTGATTATGAACAGCCTAGTCAAGTTAATATAGTCGGTTCTTCTACTAGGTGCAATTATCCCCGAAGTCCTGCGATGGTCCTTG
GTTTGACTGCAGCTCTATCACTTTTGCTTGGTCAGATGATGATAAATGTTTCAACCGGATGCATTTGCTGCTTGAGAGGCCCTCGGCCAACAGCTTCTAAATGGCGAACC
GCCGCGATCTGCTTCATTGTTTCCTGGTTTTTAGATGTGATAGCATTCCTCCTGTTGCTCACTGGTGCTGCACTGAACGAACGGCGCAACGAACAAAACTATTTTGGCGG
TTACTACGAGTGCTATGTTCTGAGACCGGGAGTTTTTGCTGTTGCTACCATTCTGGCAGTTGCAAGTTTAGTGCTTGGATTGGTCTATTTCCTCATATTGAACTCTGCAA
AGAATGATCGTACTGTGTGGGGCAATCCTTCCATTCCTCCTCAAGCAAACATTGCCATGGGGCAGCCCCAATTCCCTCCCCCTCCTCCACAGGGAACTGCAAACCCCGCA
TTCGTTCGCGAAGACACGTATATGAGACGACAATTCACATGA
mRNA sequenceShow/hide mRNA sequence
ATTGTCATTTCCTCTCAGCAACACGGTTTCCTCAATCCAAATCTCGCCGGAGAAGCCATCGGCGGCGGCGGACATGGAGAAGAAGGCTCTGCTTCTGTGCTCTGTGGTTT
CCTTCCTGATTCTTCTCGTCGTCGCCACTGGCTTCGCCGCCGAGGCCACAAGAGTTAAGGTGAAAATCCATCTTACAAGTAGTATTATTGAAAACAGTTGTGATTATGAA
CAGCCTAGTCAAGTTAATATAGTCGGTTCTTCTACTAGGTGCAATTATCCCCGAAGTCCTGCGATGGTCCTTGGTTTGACTGCAGCTCTATCACTTTTGCTTGGTCAGAT
GATGATAAATGTTTCAACCGGATGCATTTGCTGCTTGAGAGGCCCTCGGCCAACAGCTTCTAAATGGCGAACCGCCGCGATCTGCTTCATTGTTTCCTGGTTTTTAGATG
TGATAGCATTCCTCCTGTTGCTCACTGGTGCTGCACTGAACGAACGGCGCAACGAACAAAACTATTTTGGCGGTTACTACGAGTGCTATGTTCTGAGACCGGGAGTTTTT
GCTGTTGCTACCATTCTGGCAGTTGCAAGTTTAGTGCTTGGATTGGTCTATTTCCTCATATTGAACTCTGCAAAGAATGATCGTACTGTGTGGGGCAATCCTTCCATTCC
TCCTCAAGCAAACATTGCCATGGGGCAGCCCCAATTCCCTCCCCCTCCTCCACAGGGAACTGCAAACCCCGCATTCGTTCGCGAAGACACGTATATGAGACGACAATTCA
CATGATTGGTAACTGTAGGTCGATACCGAATGCGTTTAAGAAAACCATATAACTCATACCACATGATTGATGTATCTTTTGAAACCAGGCAATAAGTTGAACAAATATGT
TTGCTCCGTAGAAATATATGAAATAGTGATTGTAGCTAAAATGATTTGTTTGTAGGGTATATACCAAATAATGATTCTAGCTAAAACTAGATTTCTAAAGAAAGGTGGCT
CATAGGAAACCTTTATATTCTCTTTTCAAATTACACTCTTATCATCATTATTATTATTATTATTTTTTGAGTACCTCTTATTATTTTTTAGTGAACACAATATTTGGTAT
TGAGATTCAAA
Protein sequenceShow/hide protein sequence
MEKKALLLCSVVSFLILLVVATGFAAEATRVKVKIHLTSSIIENSCDYEQPSQVNIVGSSTRCNYPRSPAMVLGLTAALSLLLGQMMINVSTGCICCLRGPRPTASKWRT
AAICFIVSWFLDVIAFLLLLTGAALNERRNEQNYFGGYYECYVLRPGVFAVATILAVASLVLGLVYFLILNSAKNDRTVWGNPSIPPQANIAMGQPQFPPPPPQGTANPA
FVREDTYMRRQFT