; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019607 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019607
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncinnamoyl-CoA reductase-like SNL6
Genome locationChr04:23721765..23736290
RNA-Seq ExpressionHG10019607
SyntenyHG10019607
Gene Ontology termsGO:0006694 - steroid biosynthetic process (biological process)
GO:0003854 - 3-beta-hydroxy-delta5-steroid dehydrogenase activity (molecular function)
InterPro domainsIPR002225 - 3-beta hydroxysteroid dehydrogenase/isomerase
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053663.1 cinnamoyl-CoA reductase 1-like isoform X1 [Cucumis melo var. makuwa]3.3e-14779.25Show/hide
Query:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL
        MGI GA+ +CR V+LE+LRRISER  M    RKDSD+F+GRRV SNSSSD+DE+PGDELVCVTSGVSLLGLA+VNQLLQRGFSVRILLDNP   ED EK+
Subjt:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL

Query:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT
        REM  KSEAGG      ++STLSA+LTE  SL NA EGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT
Subjt:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT

Query:  RAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLAT
        RAELPP+VD DCWSDRSLC DK            KLWYALGKLKAEKAAWRIA+E+D+KLVTIC A V   P+L   NST TIAYLKGA+EMYEQGLLAT
Subjt:  RAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLAT

Query:  VSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        VSV+RLAEA VNVYEAMGENEAHGRYICFDQIIKT AEAEALAREIC+PITKICQSQ    EEEASTSTS+
Subjt:  VSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

KAG6581036.1 Cinnamoyl-CoA reductase-like SNL6, partial [Cucurbita argyrosperma subsp. sororia]8.5e-16785.75Show/hide
Query:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK
        MGI GAEECRRVELEELRRISERACMLA RRK+SDEFHGRRV SNSSSD+DE+PGDELVCVTSGVS LGLAVVNQLLQRGFSVRI+LDNPEDREKL EM 
Subjt:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK

Query:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
        SEAGGIRIND KVST+SANLT+CDSLANA EGCRGVFHTSSFIDP+GLTGYSKAM EVEK VSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
Subjt:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP

Query:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL
        +VD DCWSD+SLCVDK            KLWYALGKL+AEKAAWRIA+E+ LKLVTIC A V APDLC MRNSTATIAYLKGAQEMYEQGLLATVSV+RL
Subjt:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL

Query:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREI VP++KIC+SQ      EASTSTS+
Subjt:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

KAG7017766.1 Cinnamoyl-CoA reductase-like SNL6 [Cucurbita argyrosperma subsp. argyrosperma]2.6e-16886.03Show/hide
Query:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK
        MGI GAEECRRVELEELRRISERACMLA RRK+SDEFHGRRV SNSSSD+DE+PGDELVCVTSGVS LGLAVVNQLLQRGFSVRI+LDNPEDREKL EM 
Subjt:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK

Query:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
        SEAGGIRIND KVST+SANLT+CDSLANA EGCRGVFHTSSFIDP+GLTGYSKAM EVEK VSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
Subjt:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP

Query:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL
        +VD DCWSD+SLCVDK            KLWYALGKL+AEKAAWRIA+E+ LKLVTIC A V APDLC MRNSTATIAYLKGAQEMYEQGLLATVSV+RL
Subjt:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL

Query:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVP++KIC+SQ      EASTSTS+
Subjt:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

XP_022983179.1 cinnamoyl-CoA reductase-like SNL6 [Cucurbita maxima]7.9e-16584.66Show/hide
Query:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK
        MGIVGAEECRRVELEELRRISERACMLA+RRK+SDEFHGRRV SNS SD+DE+PGDE VCVTSGVS LGLAVVNQLLQRGFSVRI+LDNPED EKL EM 
Subjt:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK

Query:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
        SEAGGI+IND KVST+SANL ECDSLANA EGCRGVFHTSSFIDP+GLTGYSKAM EVEK  SENVMEACARTSSVRYCVFTSSLLACIWRDGTRAEL P
Subjt:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP

Query:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL
        +VD DCWSD+SLCVDK            KLWYALGKL+AEKAAWRIA+E+ LKLVTIC A V APDLC MRNSTATIAYLKGA+EMYEQGLLATVSV+RL
Subjt:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL

Query:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVP+TKIC+SQ      EASTSTS+
Subjt:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

XP_038903680.1 cinnamoyl-CoA reductase-like SNL6 [Benincasa hispida]1.0e-17287.36Show/hide
Query:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK
        MGI GAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSS+SSSD+DEDPGDELVCVTSGVSLLGLAVVNQLLQ GFSVRILLDNPEDRE++REMK
Subjt:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK

Query:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
        SE GG RINDCKVSTLSANL ECDSLANA EGCRGVFHTSSFIDPSGLTGYSKAM EVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT+AELPP
Subjt:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP

Query:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLA
        +VD +CWSD SLCVDK            KLWYALGKLKAE AAWRIA+++D+KLVTIC   V APDLCMRNSTAT+AYLKGAQEMYEQGLLAT+S++RLA
Subjt:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLA

Query:  EAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        EAHV+VYEAMGENEAHGRYICFDQIIKTQAEAEALAREI VP+TKICQSQEAEA EEASTSTS+
Subjt:  EAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

TrEMBL top hitse value%identityAlignment
A0A0A0LFI0 Cinnamoyl-CoA reductase 116.6e-14176.82Show/hide
Query:  MGIVGAE--ECRRVELEELRRISERACMLASRRKD-SDEFHGRRVSSN-SSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKL
        MG+ G E  + RR +LE+LRRI E   MLAS RKD SD+F+GRRV SN SSSD+DE+  DELVCVTSGVSLLGLA+VNQLL RGFSVRIL+D+PEDREK+
Subjt:  MGIVGAE--ECRRVELEELRRISERACMLASRRKD-SDEFHGRRVSSN-SSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKL

Query:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT
         EM  K+EAGG      K+ TL  +L E  SLANA EGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT
Subjt:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT

Query:  RAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLAT
        RAELPP+VD DCWSD SLC +K            KLWYALGKLKAEKAAWRIA+E+D+KLVTIC A +   P L   NST TIAYLKGAQEMY+QGLLAT
Subjt:  RAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLAT

Query:  VSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        VSV+ LAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALARE+CVPITKICQSQE EA E+ASTSTST
Subjt:  VSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

A0A1S4DUE4 cinnamoyl-CoA reductase 1-like isoform X14.9e-13675.54Show/hide
Query:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL
        MGI GA+ +CR V+LE+LRRISER  M    RKDSD+F+GRRV SNSSSD+DE+PGDELVCVTSGVSLLGLA+VNQLLQRGFSVRILLDNP   ED EK+
Subjt:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL

Query:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLL-ACIWRDG
        REM  KSEAGG      ++STLSA+LTE  SL NA EGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSL         
Subjt:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLL-ACIWRDG

Query:  TRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLA
             PP+VD DCWSDRSLC DK            KLWYALGKLKAEKAAWRIA+E+D+KLVTIC A V   P+L   NST TIAYLKGA+EMYEQGLLA
Subjt:  TRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLA

Query:  TVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        TVSV+RLAEA VNVYEAMGENEAHGRYICFDQIIKT AEAEALAREIC+PITKICQSQ    EEEASTSTS+
Subjt:  TVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

A0A1S4DUG5 cinnamoyl-CoA reductase 1-like isoform X22.3e-13375Show/hide
Query:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL
        MGI GA+ +CR V+LE+LRRISER  M    RKDSD+F+GRRV SNSSSD+DE+PGDELVCVTSGVSLLGLA+VNQLLQRGFSVRILLDNP   ED EK+
Subjt:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL

Query:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLL-ACIWRDG
        REM  KSEAGG      ++STLSA+LTE  SL NA EGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSL         
Subjt:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLL-ACIWRDG

Query:  TRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLA
             PP+VD DCWSDRSLC DK            KLWYALGKLKAEKAAWRIA+E+D+KLVTIC A V   P+L   NST TIAYLK  +EMYEQGLLA
Subjt:  TRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLA

Query:  TVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        TVSV+RLAEA VNVYEAMGENEAHGRYICFDQIIKT AEAEALAREIC+PITKICQSQ    EEEASTSTS+
Subjt:  TVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

A0A5D3BM33 Cinnamoyl-CoA reductase 1-like isoform X11.6e-14779.25Show/hide
Query:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL
        MGI GA+ +CR V+LE+LRRISER  M    RKDSD+F+GRRV SNSSSD+DE+PGDELVCVTSGVSLLGLA+VNQLLQRGFSVRILLDNP   ED EK+
Subjt:  MGIVGAE-ECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNP---EDREKL

Query:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT
        REM  KSEAGG      ++STLSA+LTE  SL NA EGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT
Subjt:  REM--KSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGT

Query:  RAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLAT
        RAELPP+VD DCWSDRSLC DK            KLWYALGKLKAEKAAWRIA+E+D+KLVTIC A V   P+L   NST TIAYLKGA+EMYEQGLLAT
Subjt:  RAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVV-APDLCMRNSTATIAYLKGAQEMYEQGLLAT

Query:  VSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        VSV+RLAEA VNVYEAMGENEAHGRYICFDQIIKT AEAEALAREIC+PITKICQSQ    EEEASTSTS+
Subjt:  VSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

A0A6J1J6K7 cinnamoyl-CoA reductase-like SNL63.8e-16584.66Show/hide
Query:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK
        MGIVGAEECRRVELEELRRISERACMLA+RRK+SDEFHGRRV SNS SD+DE+PGDE VCVTSGVS LGLAVVNQLLQRGFSVRI+LDNPED EKL EM 
Subjt:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMK

Query:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP
        SEAGGI+IND KVST+SANL ECDSLANA EGCRGVFHTSSFIDP+GLTGYSKAM EVEK  SENVMEACARTSSVRYCVFTSSLLACIWRDGTRAEL P
Subjt:  SEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPP

Query:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL
        +VD DCWSD+SLCVDK            KLWYALGKL+AEKAAWRIA+E+ LKLVTIC A V APDLC MRNSTATIAYLKGA+EMYEQGLLATVSV+RL
Subjt:  IVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLC-MRNSTATIAYLKGAQEMYEQGLLATVSVQRL

Query:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST
        AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVP+TKIC+SQ      EASTSTS+
Subjt:  AEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEAEAEEEASTSTST

SwissProt top hitse value%identityAlignment
A0A059TC02 Cinnamoyl-CoA reductase 16.6e-2933.1Show/hide
Query:  ELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMV
        ++VCVT     +   +V  LL++G++VR  + NP+D +     + E    R+  CK     A+L +  SL  A+ GC GVFHT+     S +T   + MV
Subjt:  ELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMV

Query:  EVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVT
        E     ++NV+ A A  ++VR  VFTSS +  ++ D  R +   +VD+ CWSD   C + K             WY  GK+ AE+AAW  A+EK + LV 
Subjt:  EVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVT

Query:  ICPAFVVAPDLCMRNSTAT---IAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAR
        I P  V  P L    + +    + YL G+ + Y   + A V V+ +A AH+ +YE     EA GRY+C + ++      E L++
Subjt:  ICPAFVVAPDLCMRNSTAT---IAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAR

A0A0B6VQ48 Phenylacetaldehyde reductase4.6e-2228.61Show/hide
Query:  DELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAM
        +++VCVT     +   +V  LLQRG++V+  + NP D  K   + +  G       ++    A+L E  S  +A+EGC GVFHT+S      +T     +
Subjt:  DELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAM

Query:  VEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLV
        ++   K + NV+ +C+++ S++  V TSS+ A  +    R   P +V  + W                 C E KLWY L K  AE AAW+  +EK + +V
Subjt:  VEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLV

Query:  TICPAFVVAPDLCMRNSTATIAYL---KGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEA
        TI PA V+ P L    +T+  A L   KGA+  Y       ++V+ +A AHV  +E      A GRY C  + +    E   +  E+   +    Q  E 
Subjt:  TICPAFVVAPDLCMRNSTATIAYL---KGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKICQSQEA

Query:  EAEEEASTSTSTMNSDPKSVNDKRKKNQDHNPPPKDLKLKQKVEDL
         ++++    T         V+ ++ K+      P D+ LK+ +E L
Subjt:  EAEEEASTSTSTMNSDPKSVNDKRKKNQDHNPPPKDLKLKQKVEDL

Q0JKZ0 Cinnamoyl-CoA reductase-like SNL63.1e-7946.44Show/hide
Query:  MGIVGAEECRRVELEELRRI-------SERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDR
        MG++ + +  + E+EE+R            A   A  R  + +   +R +      +    G   VCVT G+S +G AVV++LL+ G++VR+ L+  ED 
Subjt:  MGIVGAEECRRVELEELRRI-------SERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDR

Query:  EKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDG
        +KLREM+      R     V T+ AN+T+ +SL  A +GC GVFHTS+F+DP G++GY+K M  +E K +E V+EAC RT SVR CVFTSSLLAC+WR  
Subjt:  EKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDG

Query:  TRAE--LPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLL
           +   P I+D++CWSD S C D             KLW+ALGK  AEK AWR AR +DLKLVT+CPA V  P    RNSTA+IAYLKGA+ M   GLL
Subjt:  TRAE--LPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLL

Query:  ATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVP
        AT SV+ +AEAHV VYEAMG+N A GRYIC+D ++K   E   L R++ +P
Subjt:  ATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVP

Q500U8 Tetraketide alpha-pyrone reductase 11.7e-2430.74Show/hide
Query:  VCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEV
        VCVT     L   +V +LL  G+ V   + +P + +KL  +    G       ++  + A+L E  S  NA+ GC+GVFHT+S +    L   S    E+
Subjt:  VCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEV

Query:  EKKVSE---NVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLV
         +   E   NV+ +C +  S++  V TSS      RD    ++P  +D+  W+             V  C   ++WYAL K  AE+AAW+ + E  + LV
Subjt:  EKKVSE---NVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLV

Query:  TICPAFVVA----PDLCMRNSTATIAYLKGAQEMYE-QGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEAL-AREICVPITK
        T+ P+F+V     PDLC   ++  +  LKG  E ++  G +  V +  +A  H+ V+E      A GRYIC   +I  +     L AR   +PI K
Subjt:  TICPAFVVA----PDLCMRNSTATIAYLKGAQEMYE-QGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEAL-AREICVPITK

Q9S9N9 Cinnamoyl-CoA reductase 14.9e-2430.56Show/hide
Query:  PGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSK
        P  + VCVT     +   +V  LL+RG++V+  + NP+D +     + E G  R+  CK     A+L + ++L  A++GC GVFHT+     S +T   +
Subjt:  PGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSK

Query:  AMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLK
         MVE     ++ V+ A A  + V+  V TSS +  ++ D  R +   +VD+ CWSD   C + K             WY  GK+ AE+AAW  A+EK + 
Subjt:  AMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLK

Query:  LVTICPAFVVAPDLCMRNSTA---TIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAR---EICVPITKIC
        LV + P  V+ P L    + +    + YL G+ + Y     A V V+ +A AHV VYEA     A GRY+  +         E LA+   E  +P TK  
Subjt:  LVTICPAFVVAPDLCMRNSTA---TIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAR---EICVPITKIC

Query:  QSQEAEAEEEASTS----------TSTMNSDPKSVNDKRKKNQDHNPPPKDLKLKQKVED
          +   A+    T+          TST  S   +V   ++K     PPP     ++ VE+
Subjt:  QSQEAEAEEEASTS----------TSTMNSDPKSVNDKRKKNQDHNPPPKDLKLKQKVED

Arabidopsis top hitse value%identityAlignment
AT1G09510.1 NAD(P)-binding Rossmann-fold superfamily protein2.3e-2932.52Show/hide
Query:  GDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKA
        G ++VCVT     +   +V  LL RG++VR  + +P D +K   + +  G       K+    A+L E  S   A+EGC  VFHT+S +  + +T     
Subjt:  GDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKA

Query:  MVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKL
        +++   K + NV++ CA+ SSV+  + TSS+ A ++R+ T      +VD+ C+SD + C +K            KLWYAL K  AE  AWR A+EK L L
Subjt:  MVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKL

Query:  VTICPAFVVAPDL--CMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALARE
        V I P  V+ P L   +  S   I  L   ++ +       V V+ +A AH+  +E      A+GRYI    ++ T  + E + RE
Subjt:  VTICPAFVVAPDL--CMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALARE

AT2G23910.1 NAD(P)-binding Rossmann-fold superfamily protein1.4e-3934.53Show/hide
Query:  CVTSGVSLLGLAVVNQLLQRGFSVRILL-DNPED--REKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMV
        CV    + +G  ++ +LL RG+SV   +  N E    EK+R+M++       N+ ++     ++ +  S+  +L  C  VF      +P G     +  V
Subjt:  CVTSGVSLLGLAVVNQLLQRGFSVRILL-DNPED--REKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMV

Query:  EVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRD--GTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKL
        ++E + + NV+EACART S+   VF+SSL A IWRD  GT+ +    VD+ CWSD   C+ K            KLW+AL K ++EKAAW +A ++ + +
Subjt:  EVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRD--GTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKL

Query:  VTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEA----EALAREICVPITKICQSQ
        V++ P  +V P +   N   T++YLKGA +MYE G+LA V V+ +A+ H+  +E   +  A GRY CF+QI+ T+ EA    + L+  I +P     + Q
Subjt:  VTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEA----EALAREICVPITKICQSQ

Query:  EAEAEEE
         +E  EE
Subjt:  EAEAEEE

AT4G30470.1 NAD(P)-binding Rossmann-fold superfamily protein3.4e-4135.71Show/hide
Query:  DEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILL---DNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSG
        D D      CV    + +G  ++ +LL RG+SV   +      E  E +REM++    + + D        ++ +  S+  +L+ C  VF     +D   
Subjt:  DEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILL---DNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSG

Query:  LTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRD--GTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWR
          GY +  V++E + + NV+EAC RT S+   VF+SSL A IWRD  GT+ +    VD+ CWSD+  C  K            KLW+AL K+ +EKAAW 
Subjt:  LTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRD--GTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWR

Query:  IAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREI
        +A ++ L +V+I P  VV P +   N+  T++YLKGA +MYE G+LA V V+ LA+ H+  +E   +  A GRY CF+QI+ T+ EA  L   +
Subjt:  IAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREI

AT5G14700.1 NAD(P)-binding Rossmann-fold superfamily protein7.3e-9252.03Show/hide
Query:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRR--VSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLRE
        M IV A E    EL+E        C    RRKD D F G R    S ++ D D D G+ LVCVT GVS LG A+V +LL  G+SVRI++D PED+EK+ E
Subjt:  MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRR--VSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLRE

Query:  MKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAEL
        M+++A     ++   S +S  LTE DSL  A +GC GVFHTS+F+DP+G++GYSK+M E+E KVSE+V+EAC RT+SVR CVFTSSLLAC W+      L
Subjt:  MKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAEL

Query:  P-PIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQ
           +++++ WSD  LC+D             KLWYALGKLKAEKAAWRIA  K LKL TICPA +  PD   RNST+T+AYLKGA+EMY  GLLAT+ V 
Subjt:  P-PIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQ

Query:  RLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKIC---QSQEAEAEEEASTSTS
        RLA+AHV ++E +G   A GRYICFD I+     AE LA++I V I KIC      +A  E EAS   S
Subjt:  RLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREICVPITKIC---QSQEAEAEEEASTSTS

AT5G19440.1 NAD(P)-binding Rossmann-fold superfamily protein1.4e-2630.41Show/hide
Query:  ELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSS-FIDPSGLTGYSKAM
        ++VCVT     +   +V  LL RG++V+  + +P D +K + + S  G       ++    A+L E  S  +A++GC GVFHT+S F + +        +
Subjt:  ELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRINDCKVSTLSANLTECDSLANALEGCRGVFHTSS-FIDPSGLTGYSKAM

Query:  VEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLV
        ++   K + NV+ +CA+ SSV+  V TSS +A +  +G        VD+  +SD  LC               K+WY L K  AE AAW++A+EK L +V
Subjt:  VEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKIYVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLV

Query:  TICPAFVVAPDL--CMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREIC--VPITKIC
        TI PA V+ P L   +  S A I  L    + +       V+V+ +A AH+  +E      A+GRY   ++++   +E   + RE+   +P+ + C
Subjt:  TICPAFVVAPDL--CMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQAEAEALAREIC--VPITKIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATTGTAGGGGCGGAAGAGTGCCGGAGAGTGGAGTTGGAGGAGCTCCGCCGCATCTCCGAACGTGCCTGTATGCTTGCCAGTCGCCGGAAGGATTCTGATGAATT
CCATGGAAGGAGAGTTTCGTCCAACTCCTCCAGTGACTCCGACGAGGACCCCGGCGACGAGCTCGTTTGCGTCACTAGTGGCGTCTCGCTCTTAGGCCTTGCGGTTGTGA
ATCAACTCCTGCAGCGTGGATTTTCCGTTCGGATCCTCCTCGATAATCCAGAAGATAGAGAAAAACTGAGAGAGATGAAATCAGAAGCAGGTGGGATTAGAATAAACGAT
TGTAAAGTGTCGACATTGTCGGCCAATTTGACGGAGTGCGATAGCTTAGCAAATGCTCTTGAAGGCTGTCGCGGGGTTTTCCATACCTCTTCTTTCATCGATCCTTCTGG
ACTCACTGGCTATTCGAAAGCCATGGTTGAGGTAGAAAAGAAGGTGAGTGAGAATGTTATGGAGGCCTGTGCAAGAACATCCTCTGTAAGATACTGTGTTTTTACCTCTT
CACTTTTGGCTTGCATATGGCGTGATGGTACACGAGCCGAGCTCCCCCCGATCGTCGACCAAGATTGTTGGAGCGATCGATCACTATGCGTCGATAAAAAGAAGAAAATT
TATGTTGGTACTTGTTTGGAATTGAAGCTATGGTATGCTTTGGGGAAGCTAAAGGCAGAGAAGGCAGCATGGAGAATTGCAAGAGAAAAAGATTTAAAGTTAGTCACAAT
TTGTCCTGCCTTCGTAGTAGCTCCTGACCTTTGTATGAGAAATTCAACTGCAACAATTGCATATCTCAAAGGAGCGCAAGAGATGTACGAGCAAGGGCTACTGGCGACGG
TGAGCGTGCAAAGATTAGCCGAGGCACATGTGAATGTGTATGAGGCAATGGGAGAAAACGAAGCACATGGAAGATACATTTGTTTTGATCAAATAATCAAAACACAAGCT
GAGGCAGAGGCTTTGGCAAGGGAGATATGTGTGCCTATCACCAAAATTTGCCAATCTCAAGAAGCGGAAGCAGAAGAAGAAGCATCAACAAGCACTTCTACGATGAATTC
AGATCCGAAATCCGTGAACGATAAGCGCAAGAAAAATCAAGATCATAATCCGCCACCAAAGGATCTGAAATTGAAACAAAAAGTTGAAGACCTCAATTATCAGCCAACGG
ATTTGAAAATGAAACAAAAAGTTGAAGATCTCTATGACTATGAAGATTCATTTTCATTATATCCTCAATATCTTAAGGAGAAAGAACTGAATCTAGGAAATATCGGAAGT
GAATTGAATCTAGGAAATATTGGAAGTTTCCTCGAAGTTCGTTTCGAAGGAATTAGAGAAAGGGACGATTCTAGGGCAGAATCATCGAAAAGAAAAAGAATCGCAAAATC
TCCCCAAGTGATTGATTTGAGTTCGAAGTCGCCTAATTTTGAGCCACTGAAATTAATCGAGGAAATCGTGAGGATTTACAGTGATTACATTGAACACATATTTAAAATGG
CGAAGGACAGATTCAACGACGAACAGAGATGGAATTTTGATAAAACTATGTGTAGCGGTCTGGCTGAGATTTTTTCACAAAAAATCGAGAGATTGGGAATCGAATTAAAA
GAAATGAAGAAGAATTCGAATCAAAGAGAGAAGTATTACCTGATAGAGCATCGTGTTCGTCAAATTATGAATCAACTGGAAAAGATGCACGAGCGATTTGATAGCTCTCC
AAATATTAGGGCTTCGTCGAGGAGGAACTGCAGGAAAGAAGAACTGATATTATGTATAGATGAAATTAGTGAGATGAGGAAAGAATTGTATGAGATAATATGGAGAATTG
AAGAACTGAAAAACATTAAAATGAAGAAGAAGACGATGGAGATAAAACAGAGGAATTTGAGAATTGGAAGCAAAGATTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATTGTAGGGGCGGAAGAGTGCCGGAGAGTGGAGTTGGAGGAGCTCCGCCGCATCTCCGAACGTGCCTGTATGCTTGCCAGTCGCCGGAAGGATTCTGATGAATT
CCATGGAAGGAGAGTTTCGTCCAACTCCTCCAGTGACTCCGACGAGGACCCCGGCGACGAGCTCGTTTGCGTCACTAGTGGCGTCTCGCTCTTAGGCCTTGCGGTTGTGA
ATCAACTCCTGCAGCGTGGATTTTCCGTTCGGATCCTCCTCGATAATCCAGAAGATAGAGAAAAACTGAGAGAGATGAAATCAGAAGCAGGTGGGATTAGAATAAACGAT
TGTAAAGTGTCGACATTGTCGGCCAATTTGACGGAGTGCGATAGCTTAGCAAATGCTCTTGAAGGCTGTCGCGGGGTTTTCCATACCTCTTCTTTCATCGATCCTTCTGG
ACTCACTGGCTATTCGAAAGCCATGGTTGAGGTAGAAAAGAAGGTGAGTGAGAATGTTATGGAGGCCTGTGCAAGAACATCCTCTGTAAGATACTGTGTTTTTACCTCTT
CACTTTTGGCTTGCATATGGCGTGATGGTACACGAGCCGAGCTCCCCCCGATCGTCGACCAAGATTGTTGGAGCGATCGATCACTATGCGTCGATAAAAAGAAGAAAATT
TATGTTGGTACTTGTTTGGAATTGAAGCTATGGTATGCTTTGGGGAAGCTAAAGGCAGAGAAGGCAGCATGGAGAATTGCAAGAGAAAAAGATTTAAAGTTAGTCACAAT
TTGTCCTGCCTTCGTAGTAGCTCCTGACCTTTGTATGAGAAATTCAACTGCAACAATTGCATATCTCAAAGGAGCGCAAGAGATGTACGAGCAAGGGCTACTGGCGACGG
TGAGCGTGCAAAGATTAGCCGAGGCACATGTGAATGTGTATGAGGCAATGGGAGAAAACGAAGCACATGGAAGATACATTTGTTTTGATCAAATAATCAAAACACAAGCT
GAGGCAGAGGCTTTGGCAAGGGAGATATGTGTGCCTATCACCAAAATTTGCCAATCTCAAGAAGCGGAAGCAGAAGAAGAAGCATCAACAAGCACTTCTACGATGAATTC
AGATCCGAAATCCGTGAACGATAAGCGCAAGAAAAATCAAGATCATAATCCGCCACCAAAGGATCTGAAATTGAAACAAAAAGTTGAAGACCTCAATTATCAGCCAACGG
ATTTGAAAATGAAACAAAAAGTTGAAGATCTCTATGACTATGAAGATTCATTTTCATTATATCCTCAATATCTTAAGGAGAAAGAACTGAATCTAGGAAATATCGGAAGT
GAATTGAATCTAGGAAATATTGGAAGTTTCCTCGAAGTTCGTTTCGAAGGAATTAGAGAAAGGGACGATTCTAGGGCAGAATCATCGAAAAGAAAAAGAATCGCAAAATC
TCCCCAAGTGATTGATTTGAGTTCGAAGTCGCCTAATTTTGAGCCACTGAAATTAATCGAGGAAATCGTGAGGATTTACAGTGATTACATTGAACACATATTTAAAATGG
CGAAGGACAGATTCAACGACGAACAGAGATGGAATTTTGATAAAACTATGTGTAGCGGTCTGGCTGAGATTTTTTCACAAAAAATCGAGAGATTGGGAATCGAATTAAAA
GAAATGAAGAAGAATTCGAATCAAAGAGAGAAGTATTACCTGATAGAGCATCGTGTTCGTCAAATTATGAATCAACTGGAAAAGATGCACGAGCGATTTGATAGCTCTCC
AAATATTAGGGCTTCGTCGAGGAGGAACTGCAGGAAAGAAGAACTGATATTATGTATAGATGAAATTAGTGAGATGAGGAAAGAATTGTATGAGATAATATGGAGAATTG
AAGAACTGAAAAACATTAAAATGAAGAAGAAGACGATGGAGATAAAACAGAGGAATTTGAGAATTGGAAGCAAAGATTTGTGA
Protein sequenceShow/hide protein sequence
MGIVGAEECRRVELEELRRISERACMLASRRKDSDEFHGRRVSSNSSSDSDEDPGDELVCVTSGVSLLGLAVVNQLLQRGFSVRILLDNPEDREKLREMKSEAGGIRIND
CKVSTLSANLTECDSLANALEGCRGVFHTSSFIDPSGLTGYSKAMVEVEKKVSENVMEACARTSSVRYCVFTSSLLACIWRDGTRAELPPIVDQDCWSDRSLCVDKKKKI
YVGTCLELKLWYALGKLKAEKAAWRIAREKDLKLVTICPAFVVAPDLCMRNSTATIAYLKGAQEMYEQGLLATVSVQRLAEAHVNVYEAMGENEAHGRYICFDQIIKTQA
EAEALAREICVPITKICQSQEAEAEEEASTSTSTMNSDPKSVNDKRKKNQDHNPPPKDLKLKQKVEDLNYQPTDLKMKQKVEDLYDYEDSFSLYPQYLKEKELNLGNIGS
ELNLGNIGSFLEVRFEGIRERDDSRAESSKRKRIAKSPQVIDLSSKSPNFEPLKLIEEIVRIYSDYIEHIFKMAKDRFNDEQRWNFDKTMCSGLAEIFSQKIERLGIELK
EMKKNSNQREKYYLIEHRVRQIMNQLEKMHERFDSSPNIRASSRRNCRKEELILCIDEISEMRKELYEIIWRIEELKNIKMKKKTMEIKQRNLRIGSKDL