; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019983 (gene) of Chayote v1 genome

Gene IDSed0019983
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG07:9109179..9112623
RNA-Seq ExpressionSed0019983
SyntenySed0019983
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576843.1 Protein MODIFYING WALL LIGNIN-2, partial [Cucurbita argyrosperma subsp. sororia]1.1e-8677.88Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KAL VCSVV+FL LL++ATGFAAEGTRVK +QV +V+P T C YP+SPA ALG TAALSLLLAQI+INVSTGCICC RGPRP ASKWRTA++CF+VS
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T
        W  +VIAFLLLL GAALND RNEQ  YF YY CYVLKPGVFAVAT++  ASL LGL Y+L LNSAKNDP VWGNPSIPP ANIAM QPQFPPPPP +  T
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T

Query:  ADPVFVHEDTYMRRQFT
        ADPVFVHEDTY RRQFT
Subjt:  ADPVFVHEDTYMRRQFT

KAG7014863.1 hypothetical protein SDJN02_22493 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8677.88Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KAL VCSVV+FL LL++ATGFAAEGTRVK +QV +V+P T C YP+SPA ALG TAALSLLLAQI+INVSTGCICC RGPRP ASKWRTA++CF+VS
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T
        W  +VIAFLLLL GAALND RNEQ  YF YY CYVLKPGVFAVAT++  ASL LGL Y+L LNSAKNDP VWGNPSIPP ANIAM QPQFPPPPP +  T
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T

Query:  ADPVFVHEDTYMRRQFT
        ADPVFVHEDTY RRQFT
Subjt:  ADPVFVHEDTYMRRQFT

XP_022140830.1 uncharacterized protein LOC111011403 [Momordica charantia]3.3e-9177.88Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KA+ VCSVV+FL LLV+ATGFAAEGTR+KLSQV +V+P T C YPRSPA+ LG  AALSLL+AQ+ INVSTGCICC+RGPRP ASKWRT ++CF++S
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPP--QRT
        W  ++IAFLLLL GAALNDRR E+ YYFGYYECYVLKPGVFAVATILA AS+VLGL Y+L LNSAKN+P VWGNPS+PPQANIAMGQPQFPPPPP  QR+
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPP--QRT

Query:  ADPVFVHEDTYMRRQFT
         DPVFVHEDTYMRRQ+T
Subjt:  ADPVFVHEDTYMRRQFT

XP_022922463.1 uncharacterized protein LOC111430466 [Cucurbita moschata]1.4e-8677.88Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KAL VCSVV+FL LL++ATGFAAEGTRVK +QV +V+P T C YP+SPA ALG TAALSLLLAQI+INVSTGCICC RGPRP ASKWRTA++CF+VS
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T
        W  +VIAFLLLL GAALND RNEQ  YF YY CYVLKPGVFAVAT++  ASL LGL Y+L LNSAKNDP VWGNPSIPP ANIAM QPQFPPPPP +  T
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T

Query:  ADPVFVHEDTYMRRQFT
        ADPVFVHEDTY RRQFT
Subjt:  ADPVFVHEDTYMRRQFT

XP_023551924.1 uncharacterized protein LOC111809750 [Cucurbita pepo subsp. pepo]4.6e-8576.96Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KAL VCSVV+ L LL++ATGFAAEGTRVK +QV +V+P T C YP+SPA ALG TAALSLLLAQI+INVSTGCICC RGPRP ASKWRTA++CF+VS
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T
        W  +VIAFLLLL GAALN  RNEQ  YF YY CYVLKPGVFAVAT++  ASL LGL Y+L LNSAKNDP VWGNPSIPP ANIAM QPQFPPPPP +  T
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T

Query:  ADPVFVHEDTYMRRQFT
        ADPVFVHEDTY RRQFT
Subjt:  ADPVFVHEDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein3.4e-7871.76Show/hide
Query:  KKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWI
        K AL+V  VV+ L +++IATGFAAE TR K +QVT+V+P   C YPRSPA+ LG TAALSLL AQI I  STGC+CC+RGPRP ASKWRTA+ICF +SW+
Subjt:  KKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWI

Query:  LYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPA-VWGNPSIPPQANIAMGQPQF--PPPPPQRTA
         YVIAFLL L GAALN+ R EQR YF  Y+CYVLKPGVF+ ATI+ +ASL LG+ YFL LNSAKNDP+ VWG+PS+PPQ NIAM QPQF  PPPPPQRTA
Subjt:  LYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPA-VWGNPSIPPQANIAMGQPQF--PPPPPQRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQFT
Subjt:  DPVFVHEDTYMRRQFT

A0A1S3BX72 uncharacterized protein LOC1034944354.5e-7872.85Show/hide
Query:  MEKKALLVCS-VVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIV
        ME+KA LV   VV+FL +++IATGFAAE TR K  QVTRV+P   C YPRSPAM LGFTAALSLL AQI I  STGC+CC+RGPRP   KWRTA+ICF V
Subjt:  MEKKALLVCS-VVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIV

Query:  SWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPA-VWGNPSIPPQANIAMGQPQF----PPPP
        SWI YVIAFLL L GAALN+ R++QR Y G YECYVLKPGVF+ ATI+ VASL LG+ YFL LNSAKNDP+ VWG+PS+PPQ NIAM QPQF    PPPP
Subjt:  SWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPA-VWGNPSIPPQANIAMGQPQF----PPPP

Query:  PQRTADPVFVHEDTYMRRQFT
        PQRT DPVFVHEDTYMRRQFT
Subjt:  PQRTADPVFVHEDTYMRRQFT

A0A5D3D069 DUF1218 domain-containing protein4.5e-7872.85Show/hide
Query:  MEKKALLVCS-VVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIV
        ME+KA LV   VV+FL +++IATGFAAE TR K  QVTRV+P   C YPRSPAM LGFTAALSLL AQI I  STGC+CC+RGPRP   KWRTA+ICF V
Subjt:  MEKKALLVCS-VVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIV

Query:  SWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPA-VWGNPSIPPQANIAMGQPQF----PPPP
        SWI YVIAFLL L GAALN+ R++QR Y G YECYVLKPGVF+ ATI+ VASL LG+ YFL LNSAKNDP+ VWG+PS+PPQ NIAM QPQF    PPPP
Subjt:  SWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPA-VWGNPSIPPQANIAMGQPQF----PPPP

Query:  PQRTADPVFVHEDTYMRRQFT
        PQRT DPVFVHEDTYMRRQFT
Subjt:  PQRTADPVFVHEDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114031.6e-9177.88Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KA+ VCSVV+FL LLV+ATGFAAEGTR+KLSQV +V+P T C YPRSPA+ LG  AALSLL+AQ+ INVSTGCICC+RGPRP ASKWRT ++CF++S
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPP--QRT
        W  ++IAFLLLL GAALNDRR E+ YYFGYYECYVLKPGVFAVATILA AS+VLGL Y+L LNSAKN+P VWGNPS+PPQANIAMGQPQFPPPPP  QR+
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPP--QRT

Query:  ADPVFVHEDTYMRRQFT
         DPVFVHEDTYMRRQ+T
Subjt:  ADPVFVHEDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304666.9e-8777.88Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS
        ME+KAL VCSVV+FL LL++ATGFAAEGTRVK +QV +V+P T C YP+SPA ALG TAALSLLLAQI+INVSTGCICC RGPRP ASKWRTA++CF+VS
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVS

Query:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T
        W  +VIAFLLLL GAALND RNEQ  YF YY CYVLKPGVFAVAT++  ASL LGL Y+L LNSAKNDP VWGNPSIPP ANIAM QPQFPPPPP +  T
Subjt:  WILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQR--T

Query:  ADPVFVHEDTYMRRQFT
        ADPVFVHEDTY RRQFT
Subjt:  ADPVFVHEDTYMRRQFT

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-19.8e-0636.07Show/hide
Query:  CNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIV---SWILYVIAFLLLLAGAALNDRRNEQRYYFGYY--ECYVLKPG
        C  P + A  LG  A + + +AQI+ NV    IC  RG      K RT + C I+   SW+ + +A  L+  GA++N    EQ Y  G+   ECY++K G
Subjt:  CNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIV---SWILYVIAFLLLLAGAALNDRRNEQRYYFGYY--ECYVLKPG

Query:  VFAVATILAVASL--VLGLVYF
        VFA +  L+V ++  +LG   F
Subjt:  VFAVATILAVASL--VLGLVYF

Arabidopsis top hitse value%identityAlignment
AT1G52910.1 Protein of unknown function (DUF1218)3.3e-0926.63Show/hide
Query:  LVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPT-----TRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSW
        LV  +V  L L+ +    AAE  R     V +V P        C Y    A + G  A + L ++Q++I V++ C CC +  +P  S+    ++ F++ W
Subjt:  LVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPT-----TRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSW

Query:  ILYVIAFLLLLAGAALNDRRNEQRYYFGYY---ECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKN
        + ++IA + LLAG+  N      R  +       C V++ GVFA     A+ + ++   Y+++ + A++
Subjt:  ILYVIAFLLLLAGAALNDRRNEQRYYFGYY---ECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKN

AT1G61065.1 Protein of unknown function (DUF1218)1.1e-1232.93Show/hide
Query:  ALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVS-PTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASK-WRTAMICFIVSWI
        ++L+  +V    L+      AAE  R    Q++R S   + C Y +  A  LG  + L LL +Q++I V++ C+CC R   PS S+ W  A+  FI +W+
Subjt:  ALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVS-PTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASK-WRTAMICFIVSWI

Query:  LYVIAFLLLLAGAALNDRRNEQRYYFGYY--ECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKN
         + IA + LLAG+  N    + R YFG     C  L+ GVF       V + ++  +Y++TL+ AK+
Subjt:  LYVIAFLLLLAGAALNDRRNEQRYYFGYY--ECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKN

AT1G68220.1 Protein of unknown function (DUF1218)3.3e-0930.36Show/hide
Query:  VCSVVSFLILLVIATGFAAEGTRVKLSQV-TRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWILYVI
        + +VV+ L LL     F AE  R     V  +    T C Y    +   G +A   LL++Q ++N  T C+C  +G     S    A++ F+VSW+ ++ 
Subjt:  VCSVVSFLILLVIATGFAAEGTRVKLSQV-TRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWILYVI

Query:  AFLLLLAGAALN--DRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVW
        A   LL G+A N    ++E  Y      C VL  GVFA      + SL+  ++Y+L    +K D   W
Subjt:  AFLLLLAGAALN--DRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVW

AT5G17210.1 Protein of unknown function (DUF1218)1.4e-5552.53Show/hide
Query:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVT-RVSPT-TRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFI
        ME++ +++C V+  L LL   T F AE TR+K SQVT  VS + T+C YPRSPA  LGFT+AL L++AQI+++VS+GC CC +GP PS S W  ++ICF+
Subjt:  MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVT-RVSPT-TRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFI

Query:  VSWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQRT
        VSW  +VIAFL+LL+GAALND   E+    G Y CY++KPGVF+   +L++ ++ LG+VY+L L S K   A     +      IAMGQPQ     P+R 
Subjt:  VSWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQRT

Query:  ADPVFVHEDTYMRRQFT
         DPVFVHEDTYMRRQFT
Subjt:  ADPVFVHEDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)1.3e-4853.33Show/hide
Query:  VTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYV
        VT     T+C YPRSPA  LGFT+AL L++AQI+++VS+GC CC +GP PS S W  ++ICF+VSW  +VIAFL+LL+GAALND   E+    G Y CY+
Subjt:  VTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWILYVIAFLLLLAGAALNDRRNEQRYYFGYYECYV

Query:  LKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQRTADPVFVHEDTYMRRQFT
        +KPGVF+   +L++ ++ LG+VY+L L S K   A     +      IAMGQPQ     P+R  DPVFVHEDTYMRRQFT
Subjt:  LKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQRTADPVFVHEDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAGGCTCTGCTTGTGTGCTCCGTGGTGTCTTTTCTGATTCTTCTGGTGATCGCCACTGGCTTCGCCGCCGAGGGCACAAGAGTTAAGCTTAGTCAGGTTAC
TAGAGTCAGTCCTACTACTAGATGCAATTATCCGCGAAGTCCTGCGATGGCCCTGGGTTTCACTGCAGCATTATCACTTTTGCTTGCTCAAATAATGATAAATGTTTCGA
CCGGATGCATTTGTTGCTTGAGAGGCCCTCGGCCATCTGCTTCCAAATGGCGAACCGCCATGATCTGCTTTATTGTTTCGTGGATTTTATATGTGATAGCATTCCTCCTG
TTGCTTGCTGGTGCTGCACTGAACGATCGACGCAACGAACAGCGCTACTACTTCGGTTACTACGAGTGCTATGTTCTGAAACCGGGAGTTTTTGCTGTTGCTACCATTCT
GGCAGTTGCAAGTTTAGTGCTTGGATTGGTCTATTTCCTCACATTGAACTCTGCAAAGAATGATCCTGCTGTGTGGGGCAATCCTTCCATTCCTCCTCAAGCAAACATTG
CAATGGGGCAGCCCCAATTCCCTCCCCCTCCTCCACAGAGAACTGCAGACCCCGTATTCGTTCACGAAGACACGTACATGAGACGACAATTCACGTGA
mRNA sequenceShow/hide mRNA sequence
AAAATTCCACCGCCACGTCCCCATTAAAGTCCGAATTCGGAAGCCCCTCTTTTCATCGGACGCAACGTCGAAAGTTCATATCATTTTCCTCTCAACAACTTTTACTATAC
GTTTCTCTGCCTGCAACGGTTTCCTCGCCCATAGAAACCTAAACCAAAATCCGAATCTCCACCAAATCTCGCCGGTTTTTTCTCTCCGGCGGCGGCGGAGATGGAGAAGA
AGGCTCTGCTTGTGTGCTCCGTGGTGTCTTTTCTGATTCTTCTGGTGATCGCCACTGGCTTCGCCGCCGAGGGCACAAGAGTTAAGCTTAGTCAGGTTACTAGAGTCAGT
CCTACTACTAGATGCAATTATCCGCGAAGTCCTGCGATGGCCCTGGGTTTCACTGCAGCATTATCACTTTTGCTTGCTCAAATAATGATAAATGTTTCGACCGGATGCAT
TTGTTGCTTGAGAGGCCCTCGGCCATCTGCTTCCAAATGGCGAACCGCCATGATCTGCTTTATTGTTTCGTGGATTTTATATGTGATAGCATTCCTCCTGTTGCTTGCTG
GTGCTGCACTGAACGATCGACGCAACGAACAGCGCTACTACTTCGGTTACTACGAGTGCTATGTTCTGAAACCGGGAGTTTTTGCTGTTGCTACCATTCTGGCAGTTGCA
AGTTTAGTGCTTGGATTGGTCTATTTCCTCACATTGAACTCTGCAAAGAATGATCCTGCTGTGTGGGGCAATCCTTCCATTCCTCCTCAAGCAAACATTGCAATGGGGCA
GCCCCAATTCCCTCCCCCTCCTCCACAGAGAACTGCAGACCCCGTATTCGTTCACGAAGACACGTACATGAGACGACAATTCACGTGATCGGTAACTGTCGGTTGATACC
GAATGCGTTTAAGTAAACCATATAACTCATACCACAGGATTAATGTATCTTTTGAAACCAGGCAATAAGTTAAAAAAATATGTTGGCTCTTTAGAAATATATGAAATAGT
GATTGTAACTAGAATGATTTATTTATAGGGGATACCAAATAATGATTCTAGCTCAAATTAGATTTCAAATGAAATTTTCTAC
Protein sequenceShow/hide protein sequence
MEKKALLVCSVVSFLILLVIATGFAAEGTRVKLSQVTRVSPTTRCNYPRSPAMALGFTAALSLLLAQIMINVSTGCICCLRGPRPSASKWRTAMICFIVSWILYVIAFLL
LLAGAALNDRRNEQRYYFGYYECYVLKPGVFAVATILAVASLVLGLVYFLTLNSAKNDPAVWGNPSIPPQANIAMGQPQFPPPPPQRTADPVFVHEDTYMRRQFT