; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G001160 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G001160
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCmo_Chr02:606151..607836
RNA-Seq ExpressionCmoCh02G001160
SyntenyCmoCh02G001160
Gene Ontology termsGO:0009808 - lignin metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0016710 - trans-cinnamate 4-monooxygenase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010693.1 hypothetical protein SDJN02_27489 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-10892.09Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDK S +QSNLGCFLHCTTP+VNSQFLPKSEIRNLNRLWHPWERE+VEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDEC-GDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLS
        +NGFRDEC GDSE RDSFSDSCSDESESEK WRWDGSSSEE GGLLEQDSPL LSDRLGYLYFQYFERSTPYGRVPLMDKIN LARR+PGLMTLRSVDLS
Subjt:  INGFRDEC-GDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLS

Query:  PASWMAVAWSVFVSL
        PASWMAVAWS+ +SL
Subjt:  PASWMAVAWSVFVSL

KGN46762.2 hypothetical protein Csa_020792 [Cucumis sativus]2.7e-10993.3Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDK S +QSNLGCFLHCTTP+VNSQFLPKSEIRNLNRLWHPWERE+VEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        +NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEE GG LEQ+SPL LSDRLGYLYFQYFERSTPYGRVPLMDKIN LARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAWS
        ASWMAV+W+
Subjt:  ASWMAVAWS

XP_004139934.1 uncharacterized protein LOC101222318 isoform X1 [Cucumis sativus]3.5e-10993.75Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDK S +QSNLGCFLHCTTP+VNSQFLPKSEIRNLNRLWHPWERE+VEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        +NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEE GG LEQ+SPL LSDRLGYLYFQYFERSTPYGRVPLMDKIN LARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAV+W
Subjt:  ASWMAVAW

XP_022947406.1 uncharacterized protein LOC111451281 [Cucurbita moschata]2.5e-118100Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAVAW
Subjt:  ASWMAVAW

XP_038900400.1 uncharacterized protein LOC120087632 [Benincasa hispida]5.0e-11195.19Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDKGS +QSNLGCFLHCTTP+VNSQFLPKSEIRNLNRLWHPWERE+VEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        +NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEE GG LEQ+SPL LSDRLGYLYFQYFERSTPYGRVPLMDKIN LARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAVAW
Subjt:  ASWMAVAW

TrEMBL top hitse value%identityAlignment
A0A0A0KG73 Uncharacterized protein1.7e-10993.75Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDK S +QSNLGCFLHCTTP+VNSQFLPKSEIRNLNRLWHPWERE+VEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        +NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEE GG LEQ+SPL LSDRLGYLYFQYFERSTPYGRVPLMDKIN LARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAV+W
Subjt:  ASWMAVAW

A0A1S3BIR2 uncharacterized protein LOC1034905143.2e-10892.31Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDKGS +QSNLGCFLHCTTP+VNSQFL KSEIRNLNRLWHPWERE+VEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        ++GFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEE GG LEQ+SP+ LSDRLGYLYFQYFERSTPYGRVPLMDKI+ LARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAV+W
Subjt:  ASWMAVAW

A0A5A7V865 DUF789 domain-containing protein3.2e-10892.31Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDKGS +QSNLGCFLHCTTP+VNSQFL KSEIRNLNRLWHPWERE+VEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        ++GFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEE GG LEQ+SP+ LSDRLGYLYFQYFERSTPYGRVPLMDKI+ LARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAV+W
Subjt:  ASWMAVAW

A0A6J1G6C1 uncharacterized protein LOC1114512811.2e-118100Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAVAW
Subjt:  ASWMAVAW

A0A6J1I6G3 uncharacterized protein LOC1114696171.2e-118100Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAVAW
Subjt:  ASWMAVAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G03610.1 Protein of unknown function (DUF789)1.0e-7468.57Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        M+  KGS  +SNL  FLHC TPLV  Q LPK+EIR LNRLWHPWER++VE+F L+DLW+CYDEWSAYGA VPI V NGE+LVQYYVPYLSAIQIFTS+S+
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDEC--GDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDL
        +   R+E   G+ E RD FSDS SDE           S SEEG   LE ++ L  SDRLGYLY QYFERS PY RVPLMDKINELA+RYPGLM+LRSVDL
Subjt:  INGFRDEC--GDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDL

Query:  SPASWMAVAW
        SPASWM+VAW
Subjt:  SPASWMAVAW

AT1G17830.1 Protein of unknown function (DUF789)8.6e-4550Show/hide
Query:  LQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST-INGFRDE
        L+SNL  FL   TP   S  L +S   +LN LW    ++ +EYF L+DLW+C+DE SAYG G  + +NNGE+++QYYVPYLSAIQI+T+ ST I+    +
Subjt:  LQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST-INGFRDE

Query:  CGDSETRDSFSDSCSDESESEKLWR---------WDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDL
          D E     S+  SD+SE EKL R         WD S S++ G  ++  S L + D+LG + FQYFE   P+ RVPL  K+NELA +YPGL TLRSVDL
Subjt:  CGDSETRDSFSDSCSDESESEKLWR---------WDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDL

Query:  SPASWMAVAW
        SPASW+A+AW
Subjt:  SPASWMAVAW

AT4G03420.1 Protein of unknown function (DUF789)4.1e-7970.67Show/hide
Query:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        M+  KG    SNL  FLHCTTP+V  Q L K+EIR+LNR+WHPWER++VE+F L+DLW+CYDEWSAYGAGVPI ++NGE+LVQYYVPYLSAIQIFTS S+
Subjt:  MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP
        +   RD+  D E+RDSFSDS SDESES+KL R    +S+EG   LE D+ L  +DRLGYLY QYFERS PY RVPLMDKINELA+RYPGLM+LRSVDLSP
Subjt:  INGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSP

Query:  ASWMAVAW
        ASWMAVAW
Subjt:  ASWMAVAW

AT4G28150.1 Protein of unknown function (DUF789)1.0e-6664.32Show/hide
Query:  QSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTINGFRDECG
        +SNL  FL CTTP+V +  LPK++I+NLN LW+P E + VEYF L D W+C+DEWSAYGAGVPI    GETLVQYYVPYLSAIQIFTS+S IN  R+E  
Subjt:  QSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTINGFRDECG

Query:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSPASWMAVAW
         +E+ DS S+SCS+E      WRW+G SS E G   +   PL   DRLGY Y QYFER TPY RVPLMDKI EL  RY GL +LRSVDLSPASWMAVAW
Subjt:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSPASWMAVAW

AT4G28150.2 Protein of unknown function (DUF789)9.8e-6564.32Show/hide
Query:  QSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTINGFRDECG
        +SNL  FL CTTP+V +  LPK  I+NLN LW+P E + VEYF L D W+C+DEWSAYGAGVPI    GETLVQYYVPYLSAIQIFTS+S IN  R+E  
Subjt:  QSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTINGFRDECG

Query:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSPASWMAVAW
         +E+ DS S+SCS+E      WRW+G SS E G   +   PL   DRLGY Y QYFER TPY RVPLMDKI EL  RY GL +LRSVDLSPASWMAVAW
Subjt:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSPASWMAVAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTGACAAAGGGTCAGCACTGCAATCCAATTTGGGTTGCTTTCTTCATTGCACCACGCCCCTTGTCAATTCCCAATTCTTGCCCAAGAGTGAGATTAGGAATCT
GAATCGTCTGTGGCATCCATGGGAGAGAGAGAGGGTTGAATATTTCACCCTCGCCGATCTCTGGAATTGTTACGACGAATGGAGCGCCTACGGCGCTGGCGTTCCCATCG
CCGTCAACAACGGCGAGACCCTTGTTCAATACTACGTTCCTTATCTCTCTGCAATCCAAATCTTCACCAGCAACTCCACCATCAATGGTTTCAGAGATGAGTGTGGTGAC
AGTGAAACAAGGGATTCGTTCAGTGATTCATGCAGCGATGAGAGTGAAAGTGAAAAACTATGGAGATGGGATGGAAGCTCATCGGAAGAGGGAGGTGGATTATTAGAACA
AGACAGCCCTCTATCTCTCAGTGACAGATTGGGATACCTTTACTTTCAGTATTTCGAGAGATCAACTCCATACGGAAGAGTCCCACTAATGGATAAGATCAATGAATTGG
CTCGAAGATACCCTGGCCTGATGACATTGAGAAGCGTGGATCTTTCTCCTGCTAGTTGGATGGCTGTTGCCTGGTCTGTGTTTGTCTCTCTGTGCGTGTGTCTAAATTAG
mRNA sequenceShow/hide mRNA sequence
TTTCCGGATACGCATTCTGGAAATCCCATTAAATCCCATAAATGAAAATCTGGAATTTGTGATGCCTTTATTTATATCTTCTTCTTCTTCTCTGCTTTGTGTGTGTGTGT
GTGTGTGTGTTGTTCGTGGTGTTTCTGGGGAAATCTGAGGTGATCTTGATTGAAGTGTTGGGGAAGAGTGGGCGTGTTTTGTTTAGGGTTAGTTAAAAGCAGAGGAGGGA
GAACCCTGCACTTTACTTGCGGGAGTTGTTTCGATTCTCTGTTCGCTCGACGGGAACCAGAACGGCTTTGCATTTTGACGCCATTGACAGCCAAGACTCTCAAACAGAGA
GAGTAGAGATTCTAACTTTCCCGTGAAATTTGGTGGTTTCCGTTTCTAATCTTAGAAATTCTTTCACACGTTTGTGGGTTTATCTCCATTTTGGTTCCTATTTTCTTCGC
AATCGGAAGAGCCTTTTGAAAGGAAAAAAAAGACTCCGATTATTCCTCTTCCTCTTCCTCTTCCCTTTGTCTCTCTACTCTTTCCCGGCGCTCTGTTCTTCGGTTTGCTT
TCCGGGAAGCCTCTCTGCTTTGCAGGGTTGGGGGGAAGGCTCTGAATATTTCATCTCTCTCCCTCTCTTCCAAGATTATGATTCTTGACAAAGGGTCAGCACTGCAATCC
AATTTGGGTTGCTTTCTTCATTGCACCACGCCCCTTGTCAATTCCCAATTCTTGCCCAAGAGTGAGATTAGGAATCTGAATCGTCTGTGGCATCCATGGGAGAGAGAGAG
GGTTGAATATTTCACCCTCGCCGATCTCTGGAATTGTTACGACGAATGGAGCGCCTACGGCGCTGGCGTTCCCATCGCCGTCAACAACGGCGAGACCCTTGTTCAATACT
ACGTTCCTTATCTCTCTGCAATCCAAATCTTCACCAGCAACTCCACCATCAATGGTTTCAGAGATGAGTGTGGTGACAGTGAAACAAGGGATTCGTTCAGTGATTCATGC
AGCGATGAGAGTGAAAGTGAAAAACTATGGAGATGGGATGGAAGCTCATCGGAAGAGGGAGGTGGATTATTAGAACAAGACAGCCCTCTATCTCTCAGTGACAGATTGGG
ATACCTTTACTTTCAGTATTTCGAGAGATCAACTCCATACGGAAGAGTCCCACTAATGGATAAGATCAATGAATTGGCTCGAAGATACCCTGGCCTGATGACATTGAGAA
GCGTGGATCTTTCTCCTGCTAGTTGGATGGCTGTTGCCTGGTCTGTGTTTGTCTCTCTGTGCGTGTGTCTAAATTAG
Protein sequenceShow/hide protein sequence
MILDKGSALQSNLGCFLHCTTPLVNSQFLPKSEIRNLNRLWHPWERERVEYFTLADLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTINGFRDECGD
SETRDSFSDSCSDESESEKLWRWDGSSSEEGGGLLEQDSPLSLSDRLGYLYFQYFERSTPYGRVPLMDKINELARRYPGLMTLRSVDLSPASWMAVAWSVFVSLCVCLN