; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020022 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020022
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNAD(P)-binding Rossmann-fold superfamily protein
Genome locationscaffold5:29933846..29946913
RNA-Seq ExpressionSpg020022
SyntenySpg020022
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR002347 - Short-chain dehydrogenase/reductase SDR
IPR020904 - Short-chain dehydrogenase/reductase, conserved site
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAK83035.1 CTA [Cucumis sativus]2.3e-11481.06Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+IL+DDP TFQRVVNVNL+GA LGT+HAAR M PA RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

KGN64962.1 hypothetical protein Csa_022788 [Cucumis sativus]1.2e-11581.82Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+IL+DDP TFQRVVNVNL+GAFLGTKHAAR M PA RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

NP_001295763.1 secoisolariciresinol dehydrogenase [Cucumis sativus]3.5e-11581.44Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+ L+DDP TFQRVVNVNL+GAFLGTKHAAR M PA RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

XP_031745373.1 LOW QUALITY PROTEIN: secoisolariciresinol dehydrogenase-like [Cucumis sativus]3.9e-11481.06Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+IL+DDP TFQRVVNVNL+GAFLGTKHAAR M PA RGSI+TTASICS+IG  GTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

XP_038895003.1 secoisolariciresinol dehydrogenase-like [Benincasa hispida]2.6e-11883.02Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSN-HLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAG
        MS QLLPA ARRLEGKVA+ITGGARGIGEHTARLF KHGAKVVIADI D+LGH+L N +LGSSSSSFVHCDVTKEKDVENA+D  V+K+GKLDIMFNNAG
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSN-HLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAG

Query:  VL-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVP
        V  EA  F+IL+DDP TFQ+VVNVNL+GAFLGTKHAAR M P  RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RN AVDLGRYGIRVNCVSPNVVP
Subjt:  VL-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVP

Query:  TQMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        T+M R LFKL +GDEFPS Y NLK GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  TQMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

TrEMBL top hitse value%identityAlignment
A0A0A0LVV9 TASSELSEED2-like protein5.9e-11681.82Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+IL+DDP TFQRVVNVNL+GAFLGTKHAAR M PA RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

A0A6J1CM16 secoisolariciresinol dehydrogenase-like7.5e-11177.65Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        M++ LLPAAARRLEGKVA+ITGGARGIGE+TARLF KHGAKVVIADI D+LG SLSNHLGSSSSSFVHCDVTKEKDVEN VD TV+K+GKLDIMFNNA +
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  LEAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQ
        L AP +NILE+D S FQ+V+NVNL GAFLGTKHAARAMIPARRGSIIT AS  + +GGI  HA TSSKHGVLGL +NAAVDLGRYGIRVNCVSP  VPTQ
Subjt:  LEAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQ

Query:  MARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQG
        + R+L +L DGD+FP++Y NLK+ D + EEDVAEAALYLGSDA+K VSG NL+VDGGF+V+N G
Subjt:  MARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQG

A0A6J1CMN7 momilactone A synthase-like1.2e-11179.85Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        M + LLPAAARRLEGKVALITGGARGIGE+TARLFFKHGAKVVIADI+D+LGHSLSN+L  SSSSFVHCDVT+E DV NAVD TV+K+GKLDIMFNNAG+
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  LEAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQ
            KFNILE++   FQ+V+NVNLLGAFLGTKHAARAM+PARRGSII TAS CS+IGG   HAYTSSKHG++GL +NAAVDLGRYGIRVNCVSP+VVPTQ
Subjt:  LEAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQ

Query:  MARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        M R+LFKL DGDEFP+ YS+LK G +L EED+AEAALYLGSDASK VSG NLIVDGGFTVVN+
Subjt:  MARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

Q94G09 TASSELSEED2-like protein1.7e-11581.44Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+ L+DDP TFQRVVNVNL+GAFLGTKHAAR M PA RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

Q94G10 CTA1.1e-11481.06Show/hide
Query:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV
        MSIQLLPA ARRLEGKVA+ITGGARGIGE TA+LFFKHGAKVVIADI+D LG +L   LG SSS FVHCDVTKEKDVE AVD  V+K+GKLDIM NNAGV
Subjt:  MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGV

Query:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT
          E+P F+IL+DDP TFQRVVNVNL+GA LGT+HAAR M PA RGSI+TTASICS+IGGIGTHAYTSSKHGVLGL RNAAVDLGRYGIRVNCVSPNVVPT
Subjt:  L-EAPKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPT

Query:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ
        +M R LFK+ DG EFPS Y +LK+GD+LREEDV EA +YLGSD SKCVSGLNLIVDGGFTVVNQ
Subjt:  QMARNLFKLGDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQ

SwissProt top hitse value%identityAlignment
A3F5F0 Secoisolariciresinol dehydrogenase8.0e-7054.83Show/hide
Query:  AAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSS-SSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKF
        ++  RL+ KVA+ITGGA GIGE TA+LF ++GAKVVIADI D+ G  + N++GS    SFVHCDVTK++DV N VD T+AK GKLDIMF N GVL    +
Subjt:  AAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSS-SSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKF

Query:  NILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIG-THAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNL
        +ILE     F+RV+++N+ GAFL  KHAAR MIPA++GSI+ TASI S   G G +H YT++KH VLGLT +   +LG++GIRVNCVSP VV + +  ++
Subjt:  NILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIG-THAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNL

Query:  FKLGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN
        F +     +E     +NLK G +LR EDVA+A  YL  D SK VSGLNL++DGG+T  N
Subjt:  FKLGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN

F1SWA0 Zerumbone synthase1.4e-6651.15Show/hide
Query:  RLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSS-SSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILE
        RLEGKVAL+TGGA GIGE  ARLF +HGAK+ I D++DELG  +S  LG    + + HCDVT E DV  AVD T  K+G +DIM NNAG+      +I +
Subjt:  RLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSS-SSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILE

Query:  DDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFKLGD
         D + F++V ++N+ G FLG KHAAR MIP  +GSI++ AS+ S+I G G H YT +KH V+GLT++ A +LGR+GIRVNCVSP  VPT+++       +
Subjt:  DDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFKLGD

Query:  GDE--------FPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN
          E        F    +NLK  D L   DVAEA LYL ++ SK VSGLNL++DGGF++ N
Subjt:  GDE--------FPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN

Q7FAE1 Momilactone A synthase4.9e-7558.78Show/hide
Query:  AAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFN
        A AR+L GKVA+ITGGA GIG  TARLF KHGA+VV+ADI+DELG SL   LG  +SS+VHCDVT E DV  AVD  VA+FGKLD+MFNNAGV   P F 
Subjt:  AAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFN

Query:  ILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFK
        + E     F+RV+ VNL+G FLGTKHAAR M PARRGSII+TAS+ S + G  +HAYT+SKH ++G T NAA +LGR+GIRVNCVSP  V T +AR    
Subjt:  ILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFK

Query:  LGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQGNFG
        + D   +   +  +NLK    L+ +D+A AAL+L SD  + VSG NL VDGG +VVN  +FG
Subjt:  LGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQGNFG

Q94KL7 Secoisolariciresinol dehydrogenase (Fragment)2.3e-7757.25Show/hide
Query:  QLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEA
        Q+L A ARRLEGKVALITGGA GIGE TA+LF +HGAKV IAD++DELGHS+   +G+S+S+++HCDVT E  V+NAVD TV+ +GKLDIMF+NAG+ + 
Subjt:  QLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEA

Query:  PKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMAR
         +  I++++ + F+RV +VN+ G FL  KHAAR MIPAR G+II+TAS+ S +GG  +HAY  SKH VLGLTRN AV+LG++GIRVNC+SP  +PT + +
Subjt:  PKFNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMAR

Query:  NLFKLGDGDEFPSLYS---NLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN
            + + +EF ++ +   NLK G     EDVA AALYL SD +K VSG NL +DGGF+V N
Subjt:  NLFKLGDGDEFPSLYS---NLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN

Q94KL8 Secoisolariciresinol dehydrogenase (Fragment)2.1e-7054.83Show/hide
Query:  AAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSS-SSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKF
        ++  RL+ KVA+ITGGA GIGE TA+LF ++GAKVVIADI D+ G  + N++GS    SFVHCDVTK++DV N VD T+AK GKLDIMF N GVL    +
Subjt:  AAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSS-SSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKF

Query:  NILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIG-THAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNL
        +ILE     F+RV+++N+ GAFL  KHAAR MIPA++GSI+ TASI S   G G +H YT++KH VLGLT +   +LG YGIRVNCVSP +V + +  ++
Subjt:  NILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIG-THAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNL

Query:  FKLGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN
        F +     +E     +NLK G +LR EDVA+A  YL  D SK VSGLNL++DGG+T  N
Subjt:  FKLGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN

Arabidopsis top hitse value%identityAlignment
AT1G52340.1 NAD(P)-binding Rossmann-fold superfamily protein5.4e-6148.68Show/hide
Query:  ARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHL----GSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPK
        ++RL GKVALITGGA GIGE   RLF KHGAKV I D++D+LG  +   L       ++ F+H DV  E D+ NAVD  V  FG LDI+ NNAG+  AP 
Subjt:  ARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHL----GSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPK

Query:  FNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNL
         +I     S F+   +VN+ GAFL  KHAAR MIP ++GSI++  S+  ++GG+G H+Y  SKH VLGLTR+ A +LG++GIRVNCVSP  V T++A   
Subjt:  FNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNL

Query:  FKLGDGDE--------FPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN
            +  E        F +  +NLK G  L  +DVA A L+L SD S+ +SG NL++DGGFT  N
Subjt:  FKLGDGDE--------FPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVN

AT2G47140.1 NAD(P)-binding Rossmann-fold superfamily protein3.1e-6150Show/hide
Query:  AARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNI
        + +RL+GK+ +ITGGA GIG  + RLF +HGA+VVI D++DELG +++  +G   +S+ HCDVT E +VENAV  TV K+GKLD++F+NAGV+E P  +I
Subjt:  AARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNI

Query:  LEDDPSTFQRVVNVNLLGAFLGTKHAARAMI-PARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFK
        L+ + +   R + +NL G     KHAARAM+    RGSI+ T S+ + I G   H YT+SKHG+LGL ++A+  LG+YGIRVN V+P  V T +  N FK
Subjt:  LEDDPSTFQRVVNVNLLGAFLGTKHAARAMI-PARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFK

Query:  LGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVV
        +     ++  S  +NLK G VL+   VAEAAL+L SD S  VSG NL VDGG++VV
Subjt:  LGDG--DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVV

AT3G29250.1 NAD(P)-binding Rossmann-fold superfamily protein1.6e-6053.17Show/hide
Query:  LEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILEDD
        L+GK+A+ITGGA GIG    RLF  HGAKVVI DI++ELG +L+  +G   +SF  C+VT E DVENAV  TV K GKLD++F+NAGVLEA   ++L+ D
Subjt:  LEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILEDD

Query:  PSTFQRVVNVNLLGAFLGTKHAARAMIPA-RRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQM--ARNLFKLG
           F R + VN+ GA    KHAAR+M+ +  RGSI+ T SI + IGG G H+YT+SKH +LGL R+A   LG+YGIRVN V+P  V T M  A N   + 
Subjt:  PSTFQRVVNVNLLGAFLGTKHAARAMIPA-RRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQM--ARNLFKLG

Query:  DGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVV
          +E+     NLK G VL+   +AEAAL+L SD S  +SG NL+VDGGF+VV
Subjt:  DGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVV

AT3G29250.2 NAD(P)-binding Rossmann-fold superfamily protein4.1e-6153.36Show/hide
Query:  RLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILED
        RL+GK+A+ITGGA GIG    RLF  HGAKVVI DI++ELG +L+  +G   +SF  C+VT E DVENAV  TV K GKLD++F+NAGVLEA   ++L+ 
Subjt:  RLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILED

Query:  DPSTFQRVVNVNLLGAFLGTKHAARAMIPA-RRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQM--ARNLFKL
        D   F R + VN+ GA    KHAAR+M+ +  RGSI+ T SI + IGG G H+YT+SKH +LGL R+A   LG+YGIRVN V+P  V T M  A N   +
Subjt:  DPSTFQRVVNVNLLGAFLGTKHAARAMIPA-RRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQM--ARNLFKL

Query:  GDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVV
           +E+     NLK G VL+   +AEAAL+L SD S  +SG NL+VDGGF+VV
Subjt:  GDGDEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVV

AT3G51680.1 NAD(P)-binding Rossmann-fold superfamily protein1.9e-6652.81Show/hide
Query:  RRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSS----FVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPK-
        +RLEGKVA+ITGGA GIG+ T  LF +HGA VVIAD+ +  G SL+  L S  +S    F+ CDV+ E DVEN V+ TVA++G+LDI+FNNAGVL   K 
Subjt:  RRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSS----FVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPK-

Query:  -FNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIP-ARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMAR
          +IL+ D   F  V+ VN+ G  LG KH ARAMI    +G II+TAS+  ++GG+G HAYT+SKH ++GLT+NAA +LG+YGIRVNC+SP  V T M  
Subjt:  -FNILEDDPSTFQRVVNVNLLGAFLGTKHAARAMIP-ARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMAR

Query:  NLFKLGDG-----------DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFT
        N ++   G           +EF    +NLK G+ LR  D+AEAALYL SD SK V+G NL+VDGG T
Subjt:  NLFKLGDG-----------DEFPSLYSNLKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATCCAATTGCTTCCCGCCGCTGCAAGAAGGCTTGAAGGTAAAGTAGCACTGATCACCGGTGGGGCTAGAGGAATTGGTGAACACACTGCAAGGCTCTTCTTCAA
GCATGGAGCGAAGGTCGTGATTGCAGACATCAGAGATGAATTAGGTCACTCTCTTTCCAACCATCTTGGCTCTTCCTCTTCTTCCTTCGTCCACTGCGATGTCACGAAAG
AGAAAGACGTCGAAAACGCCGTTGACGCCACTGTTGCCAAATTTGGGAAGTTGGACATCATGTTCAATAATGCCGGAGTTCTCGAAGCTCCAAAGTTCAACATTCTCGAG
GACGATCCATCGACCTTTCAGAGAGTGGTCAACGTCAACCTTCTCGGTGCGTTTCTGGGCACCAAACACGCTGCGCGGGCGATGATACCAGCCCGTCGAGGGAGCATCAT
TACAACTGCAAGCATATGCTCCATCATTGGCGGGATTGGCACGCATGCATACACAAGCTCGAAGCATGGCGTGTTGGGATTAACCAGGAATGCAGCTGTGGATCTGGGAC
GATATGGGATCAGGGTGAACTGCGTGTCGCCCAACGTAGTGCCCACTCAAATGGCGAGGAACTTGTTTAAGCTCGGAGATGGCGATGAGTTTCCAAGTCTTTATTCGAAT
CTCAAAAGTGGGGATGTTCTGAGGGAAGAAGATGTGGCTGAAGCTGCTCTGTATTTAGGCAGCGATGCGTCTAAGTGTGTGAGTGGGCTCAACTTGATTGTTGATGGAGG
CTTCACTGTTGTCAACCAAGGCAATTTTGGACCACCCGACGCACAAGGAGCTGACGAGGACGTCCGGGCGAAAATAGGGCTGGGAGACCGACCCAAAGGAAGAACCGACC
AAAGGGCCGGGGCCAACTTGGCCCGACCCATATGGTTGGCCTCGGCCCAAGGCCAAGGCCGACCATTTGGCCCGCTTGTGCGGGCTGAGCTCGGTCACCTCCTCTCGGTC
CCTGATGCCTCTAGCAGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGCATCGGAGGCGTTGGCGCCGTCTGTGG
GGAAGAAAGCTTGCCAAATCTGCATATCGGTCATTCCATGAGTAAGGGAATGGAGAAGAGAAATCAAGACATAAACACAGAAAATTCGGATGGTGACCACCACCAGCGGA
GGTCACGGGAAGAAGGCCGAGATCGACCTCAAACCGAATCTCCTCGTCCTCGGTCTCCACTGCCCTCATCCCGAGAGAAGCAAGCTGATCTAAAATTTGCTGCTCTCGAA
AACAAAATAAGTGCGATGGATCATAATTTGTCCAGGATACTTCGTATCTTGGATAAACCTAGTCTTAGCACTAAAACCCCTGATGAGAGGTTGGTTAGGGATCCGAGGAA
GGGGAAGGAGCCCATGGAGCACACTGCAGAATCGGAGACGAGATCGAAGGGAAAGAAGACTAGCAGCATGACCAGCAAGATCAGGGGGCTCAAGCCTACTGATCGTACAA
TTTTGAGGAGCCCAGAGTCAAGCACACTTAGGGGACGAGACTACACAGTTTCTACCCCAAGCTATGGTCATACTAAGACAGACCTAAGGAATCTGATCGAGGAGAAGCGC
AGAAGTGCCAAAACTGTCGAATCCGAGGCCAGAGCTGCCGAAGCTGAAGCCAGGGCTGCAGAGGCCGAGGCCAAGAAAGACAATCCCCCTTGGAAGACCGAGCTTTTAAA
CACACTAAAGGAACTCGGAAATCCTCAGGGAGACCTGCAGAAGTCAAGGGACCTCGGAGACCAAGACTTGGAAGAACTAATCGACCGGGTCGACCCGCCCTTCACAGAAG
AAAAATACATGAGCGCAGAAGAGTTGCTGAAGTCGAAGAAGTCAGAACGTGAACACAAGAGGTCTTCTTCATCTGACCACGACACTAGGAAGGACAAAAAGCAGCGGACC
GACGACCATGGCCAAGGCCGACCAGACCGAGCACATCCCTTTGGTAAGTTCGAGAAATATACACCAACAGCTGTTCCACAGGAGCAAGTATTGATGGAGATCCGAAATAC
GGGACTCCTAAAATTCCCGGCGAGGATGAAGTCGAGTCCCGATAGAAGAGACAAGAGCCAGTATTGCCTTTTCCACCGAGACCATGGGCATTCAACCAGGAATTGTATTC
AGTTGAAGGATGAAATCGAAGCATTGATCCAAAATGGGTATCTGAAGGAGTTCGTCGGCGAGCCCAAGGCCGAGGCCGACCAGGGATGGCCGAGGCCGAGGCCGACCAAA
GATGGCCGAGACAAAGAAGAACCCCTACGAGAGATCAGAACCATCTTTGGAGGACCAGCTGGAGGAGGTTCAAGCAGGAAGAGAAAAGCTATTGCCAGGGAAGCAAGGAG
TGTGCACAAGACAACCGATTCAAGCATCACGAAACGCGAGCCTGGACCGCCGAGTCTACCTCGCCCCATCGGCTTGAGACAATGTGAGCCTGGACCCTGGGCGAGCCTGG
ACTACCGAGTCTACCTCGTCTCATCCAGATTGAGACTATGTGAGCCTCGACCACCGAGTCTACCTCACCTCATCTGGATTGAGCATCCTCACATCGGGCGAGCCTGGACT
ACCGAGTCTACCTCGCCTCATCCAAACTTGAGATTGCATGAGCCTGGACCACCGAAGTCTACCTCACCCGAGTCTAGCTCACCTGGCTGGGGCAGAAATCGAGCACCTCT
TGCCAAAGCCGAGCACCTCTTGCCGAGGCCGAGCACAAACTTTAAAGGACTAATATCTGGATGCAAGGAGGATCGTGAGATCTTCCATTTTTCCTACATTTTACTTGTGG
ATAAGCCCTGGATCCGGAATGATGCACTCCGAATGACCAAGTACGGGGGTCAAAAGGGGGATGGCAGTGCTCGGCCTCTTCCCGAGGCCGACCAGCACCACTTTTGTAAT
CCCTCATCTTTTCTCATTTTTATCTCACCTAACCCCTATTCAGGGAGTTTTGGCCGGGCCAACTTGGCCCGACCATTTGGCCCGCTTGTGCGGGCCGAGCTCGGTCACCT
CCTCTCGGTCCCTGATGCCTCTAGCAGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGGCCACGTCTTCCCCCGC
TCTCAAATAAATCACTGTCGATTATCACGTGGAGCGAAGGCAGTAGTCACCTGCTAATTGATATCACGCCGGAGAAGATTGTTGTAGCAGTGGCTGTCGTGGGTCTCGAT
TTTGAGGACAAAATTGGTGTGGGTCTCTGGTTTTCTGTGGGTCTCTGTTCAGATATTGGTTTTTCTCGCCAGCTTGGGTCCTTCAGCAGTCGCCGACGTGAGTCTTTGCA
ACAGTCACCGTGGGTCTTGATTTGGAGACGATTACACGCTGTTTTGGGTCTTCGTTCAGATCAGATGTGGGTCTTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATCCAATTGCTTCCCGCCGCTGCAAGAAGGCTTGAAGGTAAAGTAGCACTGATCACCGGTGGGGCTAGAGGAATTGGTGAACACACTGCAAGGCTCTTCTTCAA
GCATGGAGCGAAGGTCGTGATTGCAGACATCAGAGATGAATTAGGTCACTCTCTTTCCAACCATCTTGGCTCTTCCTCTTCTTCCTTCGTCCACTGCGATGTCACGAAAG
AGAAAGACGTCGAAAACGCCGTTGACGCCACTGTTGCCAAATTTGGGAAGTTGGACATCATGTTCAATAATGCCGGAGTTCTCGAAGCTCCAAAGTTCAACATTCTCGAG
GACGATCCATCGACCTTTCAGAGAGTGGTCAACGTCAACCTTCTCGGTGCGTTTCTGGGCACCAAACACGCTGCGCGGGCGATGATACCAGCCCGTCGAGGGAGCATCAT
TACAACTGCAAGCATATGCTCCATCATTGGCGGGATTGGCACGCATGCATACACAAGCTCGAAGCATGGCGTGTTGGGATTAACCAGGAATGCAGCTGTGGATCTGGGAC
GATATGGGATCAGGGTGAACTGCGTGTCGCCCAACGTAGTGCCCACTCAAATGGCGAGGAACTTGTTTAAGCTCGGAGATGGCGATGAGTTTCCAAGTCTTTATTCGAAT
CTCAAAAGTGGGGATGTTCTGAGGGAAGAAGATGTGGCTGAAGCTGCTCTGTATTTAGGCAGCGATGCGTCTAAGTGTGTGAGTGGGCTCAACTTGATTGTTGATGGAGG
CTTCACTGTTGTCAACCAAGGCAATTTTGGACCACCCGACGCACAAGGAGCTGACGAGGACGTCCGGGCGAAAATAGGGCTGGGAGACCGACCCAAAGGAAGAACCGACC
AAAGGGCCGGGGCCAACTTGGCCCGACCCATATGGTTGGCCTCGGCCCAAGGCCAAGGCCGACCATTTGGCCCGCTTGTGCGGGCTGAGCTCGGTCACCTCCTCTCGGTC
CCTGATGCCTCTAGCAGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGCATCGGAGGCGTTGGCGCCGTCTGTGG
GGAAGAAAGCTTGCCAAATCTGCATATCGGTCATTCCATGAGTAAGGGAATGGAGAAGAGAAATCAAGACATAAACACAGAAAATTCGGATGGTGACCACCACCAGCGGA
GGTCACGGGAAGAAGGCCGAGATCGACCTCAAACCGAATCTCCTCGTCCTCGGTCTCCACTGCCCTCATCCCGAGAGAAGCAAGCTGATCTAAAATTTGCTGCTCTCGAA
AACAAAATAAGTGCGATGGATCATAATTTGTCCAGGATACTTCGTATCTTGGATAAACCTAGTCTTAGCACTAAAACCCCTGATGAGAGGTTGGTTAGGGATCCGAGGAA
GGGGAAGGAGCCCATGGAGCACACTGCAGAATCGGAGACGAGATCGAAGGGAAAGAAGACTAGCAGCATGACCAGCAAGATCAGGGGGCTCAAGCCTACTGATCGTACAA
TTTTGAGGAGCCCAGAGTCAAGCACACTTAGGGGACGAGACTACACAGTTTCTACCCCAAGCTATGGTCATACTAAGACAGACCTAAGGAATCTGATCGAGGAGAAGCGC
AGAAGTGCCAAAACTGTCGAATCCGAGGCCAGAGCTGCCGAAGCTGAAGCCAGGGCTGCAGAGGCCGAGGCCAAGAAAGACAATCCCCCTTGGAAGACCGAGCTTTTAAA
CACACTAAAGGAACTCGGAAATCCTCAGGGAGACCTGCAGAAGTCAAGGGACCTCGGAGACCAAGACTTGGAAGAACTAATCGACCGGGTCGACCCGCCCTTCACAGAAG
AAAAATACATGAGCGCAGAAGAGTTGCTGAAGTCGAAGAAGTCAGAACGTGAACACAAGAGGTCTTCTTCATCTGACCACGACACTAGGAAGGACAAAAAGCAGCGGACC
GACGACCATGGCCAAGGCCGACCAGACCGAGCACATCCCTTTGGTAAGTTCGAGAAATATACACCAACAGCTGTTCCACAGGAGCAAGTATTGATGGAGATCCGAAATAC
GGGACTCCTAAAATTCCCGGCGAGGATGAAGTCGAGTCCCGATAGAAGAGACAAGAGCCAGTATTGCCTTTTCCACCGAGACCATGGGCATTCAACCAGGAATTGTATTC
AGTTGAAGGATGAAATCGAAGCATTGATCCAAAATGGGTATCTGAAGGAGTTCGTCGGCGAGCCCAAGGCCGAGGCCGACCAGGGATGGCCGAGGCCGAGGCCGACCAAA
GATGGCCGAGACAAAGAAGAACCCCTACGAGAGATCAGAACCATCTTTGGAGGACCAGCTGGAGGAGGTTCAAGCAGGAAGAGAAAAGCTATTGCCAGGGAAGCAAGGAG
TGTGCACAAGACAACCGATTCAAGCATCACGAAACGCGAGCCTGGACCGCCGAGTCTACCTCGCCCCATCGGCTTGAGACAATGTGAGCCTGGACCCTGGGCGAGCCTGG
ACTACCGAGTCTACCTCGTCTCATCCAGATTGAGACTATGTGAGCCTCGACCACCGAGTCTACCTCACCTCATCTGGATTGAGCATCCTCACATCGGGCGAGCCTGGACT
ACCGAGTCTACCTCGCCTCATCCAAACTTGAGATTGCATGAGCCTGGACCACCGAAGTCTACCTCACCCGAGTCTAGCTCACCTGGCTGGGGCAGAAATCGAGCACCTCT
TGCCAAAGCCGAGCACCTCTTGCCGAGGCCGAGCACAAACTTTAAAGGACTAATATCTGGATGCAAGGAGGATCGTGAGATCTTCCATTTTTCCTACATTTTACTTGTGG
ATAAGCCCTGGATCCGGAATGATGCACTCCGAATGACCAAGTACGGGGGTCAAAAGGGGGATGGCAGTGCTCGGCCTCTTCCCGAGGCCGACCAGCACCACTTTTGTAAT
CCCTCATCTTTTCTCATTTTTATCTCACCTAACCCCTATTCAGGGAGTTTTGGCCGGGCCAACTTGGCCCGACCATTTGGCCCGCTTGTGCGGGCCGAGCTCGGTCACCT
CCTCTCGGTCCCTGATGCCTCTAGCAGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGGCCACGTCTTCCCCCGC
TCTCAAATAAATCACTGTCGATTATCACGTGGAGCGAAGGCAGTAGTCACCTGCTAATTGATATCACGCCGGAGAAGATTGTTGTAGCAGTGGCTGTCGTGGGTCTCGAT
TTTGAGGACAAAATTGGTGTGGGTCTCTGGTTTTCTGTGGGTCTCTGTTCAGATATTGGTTTTTCTCGCCAGCTTGGGTCCTTCAGCAGTCGCCGACGTGAGTCTTTGCA
ACAGTCACCGTGGGTCTTGATTTGGAGACGATTACACGCTGTTTTGGGTCTTCGTTCAGATCAGATGTGGGTCTTGGATTGA
Protein sequenceShow/hide protein sequence
MSIQLLPAAARRLEGKVALITGGARGIGEHTARLFFKHGAKVVIADIRDELGHSLSNHLGSSSSSFVHCDVTKEKDVENAVDATVAKFGKLDIMFNNAGVLEAPKFNILE
DDPSTFQRVVNVNLLGAFLGTKHAARAMIPARRGSIITTASICSIIGGIGTHAYTSSKHGVLGLTRNAAVDLGRYGIRVNCVSPNVVPTQMARNLFKLGDGDEFPSLYSN
LKSGDVLREEDVAEAALYLGSDASKCVSGLNLIVDGGFTVVNQGNFGPPDAQGADEDVRAKIGLGDRPKGRTDQRAGANLARPIWLASAQGQGRPFGPLVRAELGHLLSV
PDASSSPGFAWFVPKRLRIPKNPRSMSSIGGVGAVCGEESLPNLHIGHSMSKGMEKRNQDINTENSDGDHHQRRSREEGRDRPQTESPRPRSPLPSSREKQADLKFAALE
NKISAMDHNLSRILRILDKPSLSTKTPDERLVRDPRKGKEPMEHTAESETRSKGKKTSSMTSKIRGLKPTDRTILRSPESSTLRGRDYTVSTPSYGHTKTDLRNLIEEKR
RSAKTVESEARAAEAEARAAEAEAKKDNPPWKTELLNTLKELGNPQGDLQKSRDLGDQDLEELIDRVDPPFTEEKYMSAEELLKSKKSEREHKRSSSSDHDTRKDKKQRT
DDHGQGRPDRAHPFGKFEKYTPTAVPQEQVLMEIRNTGLLKFPARMKSSPDRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPKAEADQGWPRPRPTK
DGRDKEEPLREIRTIFGGPAGGGSSRKRKAIAREARSVHKTTDSSITKREPGPPSLPRPIGLRQCEPGPWASLDYRVYLVSSRLRLCEPRPPSLPHLIWIEHPHIGRAWT
TESTSPHPNLRLHEPGPPKSTSPESSSPGWGRNRAPLAKAEHLLPRPSTNFKGLISGCKEDREIFHFSYILLVDKPWIRNDALRMTKYGGQKGDGSARPLPEADQHHFCN
PSSFLIFISPNPYSGSFGRANLARPFGPLVRAELGHLLSVPDASSSPGFAWFVPKRLRIPKNPRSMSRPRLPPLSNKSLSIITWSEGSSHLLIDITPEKIVVAVAVVGLD
FEDKIGVGLWFSVGLCSDIGFSRQLGSFSSRRRESLQQSPWVLIWRRLHAVLGLRSDQMWVLD