; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019817 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019817
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationchr5:45777494..45780711
RNA-Seq ExpressionLag0019817
SyntenyLag0019817
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576843.1 Protein MODIFYING WALL LIGNIN-2, partial [Cucurbita argyrosperma subsp. sororia]3.0e-8475Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKAL VCSVV FLGLL+ ATGFAAE TRVK NQV+ VTPT CKYP+SPA  LGLTAALSLLLA + INVSTGCICC RGPRPPAS+WRTA++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA
         TFVIA L+LL GAALN+ + E +S+F Y  CYVLKPGVF +AT++  ASL LGL Y LILNSAKNDP V+GNPSIPP ANIAM QPQFPPP P +  TA
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

KAG7014863.1 hypothetical protein SDJN02_22493 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-8475Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKAL VCSVV FLGLL+ ATGFAAE TRVK NQV+ VTPT CKYP+SPA  LGLTAALSLLLA + INVSTGCICC RGPRPPAS+WRTA++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA
         TFVIA L+LL GAALN+ + E +S+F Y  CYVLKPGVF +AT++  ASL LGL Y LILNSAKNDP V+GNPSIPP ANIAM QPQFPPP P +  TA
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

XP_022140830.1 uncharacterized protein LOC111011403 [Momordica charantia]4.1e-8675Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKA+ VCSVV FLGLLV ATGFAAE TR+K +QVI VTP TC YPRSPA+GLGL AALSLL+A +TINVSTGCICC RGPRPPAS+WRT ++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAP--QRTA
         TF+IA L+LL GAALN+R+ E + +FGY  CYVLKPGVF +ATILA AS+ LGL Y LILNSAKN+P V+GNPS+PPQANIAMGQPQFPPP P  QR+ 
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAP--QRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

XP_022922463.1 uncharacterized protein LOC111430466 [Cucurbita moschata]3.9e-8475Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKAL VCSVV FLGLL+ ATGFAAE TRVK NQV+ VTPT CKYP+SPA  LGLTAALSLLLA + INVSTGCICC RGPRPPAS+WRTA++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA
         TFVIA L+LL GAALN+ + E +S+F Y  CYVLKPGVF +AT++  ASL LGL Y LILNSAKNDP V+GNPSIPP ANIAM QPQFPPP P +  TA
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

XP_023551924.1 uncharacterized protein LOC111809750 [Cucurbita pepo subsp. pepo]5.6e-8374.54Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKAL VCSVV  LGLL+ ATGFAAE TRVK NQV+ VTPT CKYP+SPA  LGLTAALSLLLA + INVSTGCICC RGPRPPAS+WRTA++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA
         TFVIA L+LL GAALN  + E +S+F Y  CYVLKPGVF +AT++  ASL LGL Y LILNSAKNDP V+GNPSIPP ANIAM QPQFPPP P +  TA
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein2.9e-7771.1Show/hide
Query:  MERK-ALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATS
        MERK AL+V  VV  LG+++ ATGFAAEATR K NQV  V P  CKYPRSPA+GLGLTAALSLL A +TI  STGC+CC RGPRPPAS+WRTA+ICF  S
Subjt:  MERK-ALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATS

Query:  WVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPA-VFGNPSIPPQANIAMGQPQF--PPPAPQR
        WVT+VIA L+ L GAALN  + E  ++F   +CYVLKPGVF+ ATI+ +ASLTLG+SY LILNSAKNDP+ V+G+PS+PPQ NIAM QPQF  PPP PQR
Subjt:  WVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPA-VFGNPSIPPQANIAMGQPQF--PPPAPQR

Query:  TADPVFVHEDTYMRRQFT
        TADPVFVHEDTYMRRQFT
Subjt:  TADPVFVHEDTYMRRQFT

A0A5D3D069 DUF1218 domain-containing protein1.0e-7469.09Show/hide
Query:  MERK-ALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATS
        MERK ALVV  VV FLG+++ ATGFAAEATR K  QV  V P  CKYPRSPAMGLG TAALSLL A +TI  STGC+CC RGPRPP  +WRTA+ICF  S
Subjt:  MERK-ALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATS

Query:  WVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPA-VFGNPSIPPQANIAMGQPQF----PPPAP
        W+T+VIA L+ L GAALN  + +  ++ G   CYVLKPGVF+ ATI+ VASLTLG+SY LILNSAKNDP+ V+G+PS+PPQ NIAM QPQF    PPP P
Subjt:  WVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPA-VFGNPSIPPQANIAMGQPQF----PPPAP

Query:  QRTADPVFVHEDTYMRRQFT
        QRT DPVFVHEDTYMRRQFT
Subjt:  QRTADPVFVHEDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114032.0e-8675Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKA+ VCSVV FLGLLV ATGFAAE TR+K +QVI VTP TC YPRSPA+GLGL AALSLL+A +TINVSTGCICC RGPRPPAS+WRT ++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAP--QRTA
         TF+IA L+LL GAALN+R+ E + +FGY  CYVLKPGVF +ATILA AS+ LGL Y LILNSAKN+P V+GNPS+PPQANIAMGQPQFPPP P  QR+ 
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAP--QRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304661.9e-8475Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        MERKAL VCSVV FLGLL+ ATGFAAE TRVK NQV+ VTPT CKYP+SPA  LGLTAALSLLLA + INVSTGCICC RGPRPPAS+WRTA++CF  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA
         TFVIA L+LL GAALN+ + E +S+F Y  CYVLKPGVF +AT++  ASL LGL Y LILNSAKNDP V+GNPSIPP ANIAM QPQFPPP P +  TA
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

A0A6J1J634 uncharacterized protein LOC1114829187.9e-7567.76Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        ME+KALVV +VVVFLGLLV ATGFAAE T+VK N VI V  TTCKYP+SPA+GLGL AALSLLLAH+T+NV+TGC CC   PR   S+WR A+IC+  SW
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRTADP
        +TF  A ++LL GAALN+++ E +   G+  CYVL+ G FT+ATI+A  S+ LGL+Y ++LNSA+N+P+VFGNP IPPQANIAMGQPQFPPP P R+ADP
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRTADP

Query:  VFVHEDTYMRRQFT
        VFVHEDTYMRRQFT
Subjt:  VFVHEDTYMRRQFT

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-11.4e-0430.7Show/hide
Query:  TCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFAT---SWVTFVIASLVLLGGAALNERQREPNSHFGYSN--CYVLKP
        +C  P + A GLG+ A + + +A +  NV    + CR   +    + RT + C      SWV F +A  ++  GA++N   RE     G+ N  CY++K 
Subjt:  TCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFAT---SWVTFVIASLVLLGGAALNERQREPNSHFGYSN--CYVLKP

Query:  GVFTIATILAVASL
        GVF  +  L+V ++
Subjt:  GVFTIATILAVASL

Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)3.3e-0929.17Show/hide
Query:  RKALVVCSVVVFLGLLVAATGFAAEATRV--KPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW
        + + +V  +VV L L+      AAE  R   K  Q      T C Y    A G G+ A L LL +   +   T C+C  R P  P S    ++I F +SW
Subjt:  RKALVVCSVVVFLGLLVAATGFAAEATRV--KPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSW

Query:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDP
        +TF++A   ++ GA  N    +  S   +S C  L+ G+F    +  VA++ L + Y +    + + P
Subjt:  VTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDP

AT1G61065.1 Protein of unknown function (DUF1218)6.1e-1134.13Show/hide
Query:  CKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASR-WRTAMICFATSWVTFVIASLVLLGGAALNERQREPNSHFGYS--NCYVLKPGVF
        C Y +  A GLG+ + L LL + + I V++ C+CC R   P  SR W  A+  F T+WV F IA + LL G+  N    +   +FG +  +C  L+ GVF
Subjt:  CKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASR-WRTAMICFATSWVTFVIASLVLLGGAALNERQREPNSHFGYS--NCYVLKPGVF

Query:  TIATILAVASLTLGLSYCLILNSAKN
               V +  +   Y + L+ AK+
Subjt:  TIATILAVASLTLGLSYCLILNSAKN

AT1G68220.1 Protein of unknown function (DUF1218)1.3e-0830.43Show/hide
Query:  VCSVVVFLGLLVAATGFAAEATR--VKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSWVTFVI
        + +VV  L LL     F AE  R    P    Y   T CKY    +   G++A   LL++   +N  T C+C  +G     S    A++ F  SWV+F+ 
Subjt:  VCSVVVFLGLLVAATGFAAEATR--VKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSWVTFVI

Query:  ASLVLLGGAALNERQREPNSHFGYS--NCYVLKPGVFTIATILAVASLTLGLSYCLILNSA
        A   LLGG+A N    +    +     +C VL  GVF       + SL   + Y L  + A
Subjt:  ASLVLLGGAALNERQREPNSHFGYS--NCYVLKPGVFTIATILAVASLTLGLSYCLILNSA

AT5G17210.1 Protein of unknown function (DUF1218)7.6e-5450.69Show/hide
Query:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTP---TTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFA
        MER+ +V+C V+  LGLL A T F AEATR+K +QV        T C YPRSPA  LG T+AL L++A + ++VS+GC CCR+GP P  S W  ++ICF 
Subjt:  MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTP---TTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFA

Query:  TSWVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRT
         SW TFVIA LVLL GAALN+   E + + G   CY++KPGVF+   +L++ ++ LG+ Y L L S K   A     +      IAMGQPQ     P+R 
Subjt:  TSWVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRT

Query:  ADPVFVHEDTYMRRQFT
         DPVFVHEDTYMRRQFT
Subjt:  ADPVFVHEDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)1.4e-4451.45Show/hide
Query:  TTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSWVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFT
        T C YPRSPA  LG T+AL L++A + ++VS+GC CCR+GP P  S W  ++ICF  SW TFVIA LVLL GAALN+   E + + G   CY++KPGVF+
Subjt:  TTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSWVTFVIASLVLLGGAALNERQREPNSHFGYSNCYVLKPGVFT

Query:  IATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRTADPVFVHEDTYMRRQFT
           +L++ ++ LG+ Y L L S K   A     +      IAMGQPQ     P+R  DPVFVHEDTYMRRQFT
Subjt:  IATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRTADPVFVHEDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGGAAGGCTTTGGTGGTGTGCTCTGTTGTCGTCTTTTTGGGGCTTTTGGTGGCCGCCACTGGCTTCGCCGCTGAGGCCACCAGAGTTAAGCCTAATCAAGTTAT
TTATGTCACTCCTACTACGTGCAAATATCCCCGAAGTCCAGCGATGGGCCTTGGTTTGACTGCAGCTCTATCACTTTTGCTTGCTCATGTAACGATAAATGTTTCGACGG
GATGCATTTGCTGCAGGAGGGGTCCTCGGCCTCCTGCTTCTAGATGGAGAACAGCCATGATCTGCTTCGCCACTTCCTGGGTTACGTTTGTGATAGCGTCCCTCGTGTTG
CTTGGCGGAGCCGCACTGAACGAAAGACAGAGGGAACCGAACTCCCATTTCGGTTACAGCAATTGCTACGTTCTGAAACCGGGAGTTTTCACCATCGCTACCATTTTGGC
CGTTGCTAGTTTGACGCTGGGATTGTCCTATTGCCTCATATTGAACTCTGCAAAGAATGACCCTGCTGTGTTTGGCAATCCTTCCATTCCCCCTCAAGCAAACATTGCAA
TGGGGCAGCCCCAATTCCCTCCCCCTGCTCCACAGAGAACCGCAGACCCTGTTTTCGTTCATGAAGACACATACATGAGACGACAATTCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGGAAGGCTTTGGTGGTGTGCTCTGTTGTCGTCTTTTTGGGGCTTTTGGTGGCCGCCACTGGCTTCGCCGCTGAGGCCACCAGAGTTAAGCCTAATCAAGTTAT
TTATGTCACTCCTACTACGTGCAAATATCCCCGAAGTCCAGCGATGGGCCTTGGTTTGACTGCAGCTCTATCACTTTTGCTTGCTCATGTAACGATAAATGTTTCGACGG
GATGCATTTGCTGCAGGAGGGGTCCTCGGCCTCCTGCTTCTAGATGGAGAACAGCCATGATCTGCTTCGCCACTTCCTGGGTTACGTTTGTGATAGCGTCCCTCGTGTTG
CTTGGCGGAGCCGCACTGAACGAAAGACAGAGGGAACCGAACTCCCATTTCGGTTACAGCAATTGCTACGTTCTGAAACCGGGAGTTTTCACCATCGCTACCATTTTGGC
CGTTGCTAGTTTGACGCTGGGATTGTCCTATTGCCTCATATTGAACTCTGCAAAGAATGACCCTGCTGTGTTTGGCAATCCTTCCATTCCCCCTCAAGCAAACATTGCAA
TGGGGCAGCCCCAATTCCCTCCCCCTGCTCCACAGAGAACCGCAGACCCTGTTTTCGTTCATGAAGACACATACATGAGACGACAATTCACGTGA
Protein sequenceShow/hide protein sequence
MERKALVVCSVVVFLGLLVAATGFAAEATRVKPNQVIYVTPTTCKYPRSPAMGLGLTAALSLLLAHVTINVSTGCICCRRGPRPPASRWRTAMICFATSWVTFVIASLVL
LGGAALNERQREPNSHFGYSNCYVLKPGVFTIATILAVASLTLGLSYCLILNSAKNDPAVFGNPSIPPQANIAMGQPQFPPPAPQRTADPVFVHEDTYMRRQFT