; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012207 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012207
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNAD(P)-binding Rossmann-fold superfamily protein
Genome locationchr1:38641110..38645518
RNA-Seq ExpressionLag0012207
SyntenyLag0012207
Gene Ontology termsGO:0004316 - 3-oxoacyl-[acyl-carrier-protein] reductase (NADPH) activity (molecular function)
InterPro domainsIPR002347 - Short-chain dehydrogenase/reductase SDR
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060116.1 short-chain type dehydrogenase/reductase [Cucumis melo var. makuwa]1.7e-11387.5Show/hide
Query:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI
        AA LPL GRTAIVTGASRGIGRAIA+HLHSLGANL+LNYASNSTQADLLAS+LN+SS PLRRAVAVQADVSDPDQVKRLFDSA+KEFGSEIHILVNSAGI
Subjt:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI

Query:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD
        LDSKYP+LAGTAVEDWD  F+VNCRGAFLVCKEA NR++ GGGGRI+LITTSIV SLPPGYGAYAASKAAVEAMAK AAKELKGT +TVNCVAPGPV T+
Subjt:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD

Query:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQG
        LF+AGKSEETVA+  EACP+ RLGQPDDVAKVVGFLATD G WVNGQG
Subjt:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQG

KAG6595510.1 NADPH-dependent aldehyde reductase-like protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.2e-11487.95Show/hide
Query:  AAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSA
        AAAA LPL GR A+VTGASRGIGRAIAVHLHSLGANL+LNYASNS QA++LASELN+SS P +R +AVQADVSDP QVKRLFDSA +E GSEIHI+VN+A
Subjt:  AAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSA

Query:  GILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVA
        GILDSKYP+LAGTAVEDWD  F+VNCRGAFLVCKEAANRL+ GGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVA
Subjt:  GILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVA

Query:  TDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        T+LFFAGKSEE VA+SV+ACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
Subjt:  TDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

XP_022924880.1 NADPH-dependent aldehyde reductase-like protein, chloroplastic [Cucurbita moschata]4.6e-11487.5Show/hide
Query:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG
        AAA LPL GR A+VTGASRGIGRAIAVHLHSLGANL+LNYASNS QA++LASELN+SS P +R +AVQADVSDP QVKRLFDSA +E GSEIHI+VN+AG
Subjt:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG

Query:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
        ILDSKYP+LAGTA+EDWD  F+VNCRGAFLVCKEAANRL+ GGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
Subjt:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT

Query:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        +LFFAGKSEE VA+SV+ACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
Subjt:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

XP_023518082.1 NADPH-dependent aldehyde reductase-like protein, chloroplastic [Cucurbita pepo subsp. pepo]1.2e-11488.31Show/hide
Query:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG
        AAA LPL GR A+VTGASRGIGRAIAVHLHSLGANL+LNYASNS QA+LLASELN+SS P +R +AVQADVSDP QVKRLFDSA +E GSEIHI+VN+AG
Subjt:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG

Query:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
        ILDSKYP+LAGTAVEDWD  F+VNCRGAFLVCKEAANRL+ GGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
Subjt:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT

Query:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        +LFFAGKSEE VA+SV+ACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
Subjt:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

XP_038883178.1 NADPH-dependent aldehyde reductase-like protein, chloroplastic [Benincasa hispida]2.0e-11789.16Show/hide
Query:  AAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSA
        A +AALPL GRTAIVTGASRGIGRAIAVHLH LGANL++NYASNS+QADLLASELN+SS PLRRAV VQADVSDPDQVKRLFDSA+KEFGSEIHI+VNSA
Subjt:  AAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSA

Query:  GILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVA
        GILDSKYP+LAGTAVEDWD AF+VNCRGAFLVCKEA+NRLR GGGGR++L+TTSIVGSLPPGYGAYAASKAAVEAMAK AAKELKGTEITVNCVAPGPVA
Subjt:  GILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVA

Query:  TDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        T+LF+AGKSEETVAK  EACPLGRLGQPDDVAKVVGFL TD GGWVNGQ
Subjt:  TDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

TrEMBL top hitse value%identityAlignment
A0A0A0L1D5 Uncharacterized protein2.1e-11286.64Show/hide
Query:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI
        A ALPL GRTAIVTGASRGIGRAIA+HLHSLGANL+LNYASNSTQADLLASELN+SS PLRRAVAVQADVSDPD VKRLFDSA+KEFGSEIHILVNSAGI
Subjt:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI

Query:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD
        LDSKYP+L  T VEDWD  F+VNCRGAFLVCKEA NR++ GGGGRI+LITTSIV SLPPGYGAYAASKAAVEAMAK AAKELKGT ITVNCVAPGPVAT+
Subjt:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD

Query:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        LF+AGKSEETVA+  EACP+GRLGQPDD+AKVVGFL TD G WVNGQ
Subjt:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

A0A1S3B4M7 short-chain type dehydrogenase/reductase3.2e-11387.45Show/hide
Query:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI
        AA LPL GRTAIVTGASRGIGRAIA+HLHSLGANL+LNYASNSTQADLLAS+LN+SS PLRRAVAVQADVSDPDQVKRLFDSA+KEFGSEIHILVNSAGI
Subjt:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI

Query:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD
        LDSKYP+LAGTAVEDWD  F+VNCRGAFLVCKEA NR++ GGGGRI+LITTSIV SLPPGYGAYAASKAAVEAMAK AAKELKGT +TVNCVAPGPV T+
Subjt:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD

Query:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        LF+AGKSEETVA+  EACP+ RLGQPDDVAKVVGFLATD G WVNGQ
Subjt:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

A0A5A7V0W1 Short-chain type dehydrogenase/reductase8.4e-11487.5Show/hide
Query:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI
        AA LPL GRTAIVTGASRGIGRAIA+HLHSLGANL+LNYASNSTQADLLAS+LN+SS PLRRAVAVQADVSDPDQVKRLFDSA+KEFGSEIHILVNSAGI
Subjt:  AAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI

Query:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD
        LDSKYP+LAGTAVEDWD  F+VNCRGAFLVCKEA NR++ GGGGRI+LITTSIV SLPPGYGAYAASKAAVEAMAK AAKELKGT +TVNCVAPGPV T+
Subjt:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD

Query:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQG
        LF+AGKSEETVA+  EACP+ RLGQPDDVAKVVGFLATD G WVNGQG
Subjt:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQG

A0A6J1EGB3 NADPH-dependent aldehyde reductase-like protein, chloroplastic2.2e-11487.5Show/hide
Query:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG
        AAA LPL GR A+VTGASRGIGRAIAVHLHSLGANL+LNYASNS QA++LASELN+SS P +R +AVQADVSDP QVKRLFDSA +E GSEIHI+VN+AG
Subjt:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG

Query:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
        ILDSKYP+LAGTA+EDWD  F+VNCRGAFLVCKEAANRL+ GGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
Subjt:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT

Query:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        +LFFAGKSEE VA+SV+ACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
Subjt:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

A0A6J1HS50 NADPH-dependent aldehyde reductase-like protein, chloroplastic8.4e-11487.2Show/hide
Query:  MAAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNS
        MA AAALPL GR A+VTGASRGIGRAIAVHLHSLG NL+LNYASNS QA+LLASELN+SS P +R +AVQADVSDP QVKRLFDSA +E GSEIHI+VN+
Subjt:  MAAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNS

Query:  AGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPV
        AGILDSKYP+LA TAVEDWD  F+VNCRGAFLVCKEAANRL+ GGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPV
Subjt:  AGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPV

Query:  ATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        AT+LFFAGKSEE VA+SV+ACPLGRLGQPDDVAKVV FLATDGGGWVNGQ
Subjt:  ATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

SwissProt top hitse value%identityAlignment
F9XMW6 Probable tetrahydroxynaphthalene reductase MYCGRDRAFT_879942.8e-3437.6Show/hide
Query:  LAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKY
        L G+ A+VTG+ RGIG A+A+HL + GA +++NYA++   A+ +  E+         A+A+QADV +  Q  +L D A   FG ++ I+ +++G++   +
Subjt:  LAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKY

Query:  PTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLP--PGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLFF
          L     E++DR F++N RG F V +EA   L    GGRII++  SI G     P +  Y+ASK A+E   +  A +    +ITVN VAPG + TD++ 
Subjt:  PTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLP--PGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLFF

Query:  A--------GKS---EETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQGFG
        A        G++   EE    +    P+ R+GQP D+AKVVGFLA++ G W+NG+  G
Subjt:  A--------GKS---EETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQGFG

P0CU75 Short chain dehydrogenase claC1.5e-3538.19Show/hide
Query:  LAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKY
        L GR A+VTG+ RGIG AIAVHL  LGAN+++NYA+++  A  +  ++  +      A+A++AD+ D  Q+ RLFD A   FG  + I V+++G++   +
Subjt:  LAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKY

Query:  PTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLP-PGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLF--
          L     E++DR F +N RG F V +EA   L    GGRII+ +++       P +  Y+ SK AV++  +  +K+    +ITVN VAPG   TD+F  
Subjt:  PTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLP-PGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLF--

Query:  ----FAGKSEETVAKSVE-----ACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
            +    E+  A+  +     A PL R G P+D+A VVGFLA+  G W+NG+
Subjt:  ----FAGKSEETVAKSVE-----ACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

Q08632 Short-chain type dehydrogenase/reductase2.5e-7057.89Show/hide
Query:  LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLR---RAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI
        LPL GR AIVTGASRGIGR IA+++   GA ++++Y+SN   A+ +AS +N  SP      RA+  +ADV++P QV +LFD+A+  FG  +HI+VN+AG+
Subjt:  LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLR---RAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGI

Query:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD
         DSKYPTLA T+ E+WDR FQVNC+GAFL  +EAA R+  GGGGRII I++S+V    P YGAY ASKAAVE M +  A+EL+GT+IT NCVAPGPVATD
Subjt:  LDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATD

Query:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        +FFAGKSE  V   V++ P  RLG+ +DVA +V FLA+D G WVN Q
Subjt:  LFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

Q9SQR2 NADPH-dependent aldehyde reductase 2, chloroplastic8.8e-6854.72Show/hide
Query:  MAAAAA-----LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLA----------SELNRSSPPLRRAVAVQADVSDPDQVKRLFDS
        MAAA++     L LAGR AIVTG+SRGIGRAIA+HL  LGA +++NY+++  +A+ +A          +E+   SP   R + V+AD+S+P QVK LFD 
Subjt:  MAAAAA-----LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLA----------SELNRSSPPLRRAVAVQADVSDPDQVKRLFDS

Query:  ADKEFGSEIHILVNSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKEL
        A++ F S +HILVNSA I D  + T++  +VE +DR   VN RGAF+  +EAANRL+ GGGGRIIL++TS+V +L   YG+Y ASKAAVEAMAK  AKEL
Subjt:  ADKEFGSEIHILVNSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKEL

Query:  KGTEITVNCVAPGPVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        KGTEITVNCV+PGPVAT++F+ G S E V K       GR+G+  D+A VVGFLA+D G W+NGQ
Subjt:  KGTEITVNCVAPGPVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

Q9SQR4 NADPH-dependent aldehyde reductase-like protein, chloroplastic2.5e-7861.9Show/hide
Query:  LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLR--------RAVAVQADVSDPDQVKRLFDSADKEFGSEIHILV
        LPLAGR AIVTG+SRGIGRAIA+HL  LGA +++NY S +  A+ +ASE+N    P+R        RA+ VQA+VS+P QVK +FD+A+  F + +HILV
Subjt:  LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLR--------RAVAVQADVSDPDQVKRLFDSADKEFGSEIHILV

Query:  NSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPG
        NSAGILD KYPT+A T+VED+D  F VN +GAFL  KEAANRL+ GGGGRIIL+T+S   SL PG+GAYAASKAAVE M K  AKELKGT IT NCVAPG
Subjt:  NSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPG

Query:  PVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        P+AT++FF GK+ E V K     P GR+G+  DV  +VGFLA DGG WVNGQ
Subjt:  PVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

Arabidopsis top hitse value%identityAlignment
AT1G24360.1 NAD(P)-binding Rossmann-fold superfamily protein1.4e-2334.96Show/hide
Query:  IVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKYPTLAGT
        ++TGASRGIG+AIA+ L   G  +++NYA ++ +A+ +A ++        +A+    DVS    V  +  +A  ++G+ I ++VN+AGI  ++   L   
Subjt:  IVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKYPTLAGT

Query:  AVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPP-GYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLFFAGKSEET
            WD    +N  G FL  + A   +     GRII I +S+VG +   G   YAA+K  V + +KTAA+E     I VN V PG +A+D+  A   E+ 
Subjt:  AVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPP-GYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLFFAGKSEET

Query:  VAKSVEACPLGRLGQPDDVAKVVGFLA-TDGGGWVNGQGF---GGI
          K +   PLGR G+ ++VA +V FLA +    ++ GQ F   GGI
Subjt:  VAKSVEACPLGRLGQPDDVAKVVGFLA-TDGGGWVNGQGF---GGI

AT3G03980.1 NAD(P)-binding Rossmann-fold superfamily protein1.7e-7961.9Show/hide
Query:  LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLR--------RAVAVQADVSDPDQVKRLFDSADKEFGSEIHILV
        LPLAGR AIVTG+SRGIGRAIA+HL  LGA +++NY S +  A+ +ASE+N    P+R        RA+ VQA+VS+P QVK +FD+A+  F + +HILV
Subjt:  LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLR--------RAVAVQADVSDPDQVKRLFDSADKEFGSEIHILV

Query:  NSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPG
        NSAGILD KYPT+A T+VED+D  F VN +GAFL  KEAANRL+ GGGGRIIL+T+S   SL PG+GAYAASKAAVE M K  AKELKGT IT NCVAPG
Subjt:  NSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPG

Query:  PVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        P+AT++FF GK+ E V K     P GR+G+  DV  +VGFLA DGG WVNGQ
Subjt:  PVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

AT3G04000.1 NAD(P)-binding Rossmann-fold superfamily protein6.2e-6954.72Show/hide
Query:  MAAAAA-----LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLA----------SELNRSSPPLRRAVAVQADVSDPDQVKRLFDS
        MAAA++     L LAGR AIVTG+SRGIGRAIA+HL  LGA +++NY+++  +A+ +A          +E+   SP   R + V+AD+S+P QVK LFD 
Subjt:  MAAAAA-----LPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLA----------SELNRSSPPLRRAVAVQADVSDPDQVKRLFDS

Query:  ADKEFGSEIHILVNSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKEL
        A++ F S +HILVNSA I D  + T++  +VE +DR   VN RGAF+  +EAANRL+ GGGGRIIL++TS+V +L   YG+Y ASKAAVEAMAK  AKEL
Subjt:  ADKEFGSEIHILVNSAGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKEL

Query:  KGTEITVNCVAPGPVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        KGTEITVNCV+PGPVAT++F+ G S E V K       GR+G+  D+A VVGFLA+D G W+NGQ
Subjt:  KGTEITVNCVAPGPVATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

AT4G13180.1 NAD(P)-binding Rossmann-fold superfamily protein1.7e-8763.31Show/hide
Query:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG
        ++++LPLAGR AIVTGA+RG+GR IA+HLHSLGA + +NY S+S++A+LL SELN SS  L+ A+AV+ADVSDPDQ+  LFD  ++EFGS++HI+VN AG
Subjt:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAG

Query:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT
        +LD KYP+L+ T +ED+D  F +N RG+FL CKEAA R+  GGGGRII+++TS+VG L PGYG YAASKAAVE M K  AKELKG+ IT NCVAPGPVAT
Subjt:  ILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVAT

Query:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ
        ++F+AGKS+ETV     ACP+GR+G+  D+ ++VGFLA DGG W+NGQ
Subjt:  DLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGGGWVNGQ

AT5G18210.1 NAD(P)-binding Rossmann-fold superfamily protein1.5e-7861.48Show/hide
Query:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAV--QADVSDPDQVKRLFDSADKEFGSEIHILVNS
        A++   LAGR AIVTG+SRGIGRAIA+HL  LGA +++NY + ST+AD +A+E+N S+  + + +AV   AD+S+P Q+K LFD+A+K F S +HILVNS
Subjt:  AAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAV--QADVSDPDQVKRLFDSADKEFGSEIHILVNS

Query:  AGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPV
        AGIL+  YPT+A T +E++DR F+VN RG+FL CKEAA RL+ GGGGRIIL+T+S+  +L PG GAY ASKAAVEAM K  AKELKG  IT NCV+PGPV
Subjt:  AGILDSKYPTLAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPV

Query:  ATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGG
        AT++FF GKSEETV   +E  P GRLG+  D+A VVGFLA+DGG
Subjt:  ATDLFFAGKSEETVAKSVEACPLGRLGQPDDVAKVVGFLATDGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCGCCGCACTGCCTCTCGCCGGCCGCACCGCCATCGTCACCGGCGCCTCCCGAGGGATCGGCCGCGCCATCGCCGTCCACCTCCATTCCCTCGGCGCAAA
CCTCATCCTCAACTACGCTTCCAACTCCACCCAAGCCGATCTCCTCGCATCGGAGCTCAACCGGTCATCGCCGCCGCTCCGGCGAGCCGTCGCCGTCCAGGCCGACGTGT
CTGACCCCGACCAGGTCAAGCGGCTGTTCGATTCGGCCGACAAGGAGTTCGGATCTGAGATCCACATCCTCGTGAACTCCGCTGGGATTCTCGATTCGAAGTACCCGACT
CTGGCCGGGACCGCCGTGGAAGATTGGGATAGGGCGTTTCAGGTGAATTGCAGAGGCGCGTTTTTGGTCTGTAAAGAGGCGGCGAACCGGCTGAGGTGCGGCGGCGGAGG
GAGGATTATTTTGATTACGACGTCGATCGTTGGGTCTCTGCCGCCGGGGTACGGCGCTTATGCGGCGTCGAAGGCGGCGGTGGAGGCGATGGCGAAGACGGCTGCGAAGG
AGCTGAAGGGGACGGAGATTACGGTTAACTGCGTGGCGCCGGGACCGGTGGCGACGGACCTTTTTTTCGCCGGAAAATCTGAGGAGACGGTGGCGAAGTCGGTGGAGGCA
TGCCCGCTCGGCCGTCTCGGGCAGCCCGACGATGTGGCTAAGGTGGTGGGGTTTTTGGCCACTGATGGCGGCGGGTGGGTAAACGGGCAGGGTTTCGGAGGCATTTTCGG
TCAAAGCAGGCAACCCGGGGCAGATGGAAGCAGTGGGGACCTAACGGCACCAGACGGGCTCGGCCCGCGCGAGCGGGCCGAGGTCGGCCTCGGCCATGGCCGAGGCCGAG
CACGGGGTCGGGCCAAAAACCCGACCCCTTCGGTCTTGGCCCGTCCCACTTACCGGTTTTGCTCCTTGGTCCATCTCTCAGCCCGATTTCTCCTCGGTTGCCCTCGTCAG
CTCCTTGGGTTAGATGAAATGGAGATGAAAAAGGGTGAGAGATTGGCCGGTCGGCCTCGGCCTCGGGAAGAGGTCGAGCACAGCTATCTCCCTCTCTGCTTGCCTGGTCA
GCCTCGGCTCGGAAGAGGCCGAGCAGTTTCTAACCCCCGTACTTGGGAGTGCATATCCCCGGTGACTTGCTTTTTGTTTAATAGGGAGTGCATATCCCCTTTGCAGCCCC
CAATTTCAGGTGGTCGGCCTCGGCTTTGGCATGAGGCCGACCAGCTTCCTTATGCTATTTGCTTGCTCCGTAGTTGGGCTTGTGTGCAGATTCACTTTCTGCTTGAACAT
TTCTTTGTTGCTTTGGATATTTCAGTTGTGCAAGTCGAGCCTCACCGCAAGCCCCCCACAGTCGTTCTTAGGGCTAGAATGCTCGGTTTCGGCCTTAGCGTGAGGCTGAG
CATTTTTCCTACATTGCTCGTTATTCTGCTCATCAATCTGCTGTGTTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGTTGAGGGCATGCGTCAAC
CTCTTTGCAAGAGTTTAATCTCCTTCCAAGGGAAAGTGCAGAAGTTCGGCTTCTACAAGAATCACTCTTGTTTGCCATATGTGCCCATCCTATGGAAAAATGCTTTGCAC
AAACATAGTATAAAGAATTTTATCAAATTCGCAAGTGTCTGCATCCTCCACGCTGTTCGTAATCCTAACTTGAATAAGAAAGGAAAGTCAAAATCATTGCAACATGGCCA
TGATTATGATAGCTCCACCGCTGTTCGTAATCCTAACTTGAATAAGAAAGGAAAGTCAAAATCATTGCAACATGGCCATGATTATGATGGACTCGGAAAAACTAACTTCG
TACCTGGGCCTCTAAGGGTGAAGTCCAGCCTTGGCCTCGGGATGGTCGGCCTCGGCCTTGGCATGTGGCCGACCACCTCCTCTGTGTTGTTCGCTTGCTTGTTTTTCATC
CCAAGGGTGCGTGCACCCAGTCAGGGAAAACTTGGGGAGGTCAAGGAAATTACCGAGCCTCTCTTCGCATGTTGCGTTCATCCCAGGGTAAGAAAATATAATATCATACC
TGGTCTTTTGAGGACAAAGTTCATCCTTGAAGCTGAGATAGCCAGCCCCGACCTTGGCATGAGGTCGATCCTCGCTTCTCCATCACCTGTTTGCTCGCCTAATAATTGGA
CTCATGTGCGGATAATTTGCCTGCGAAGTAACTACCGGACACAGTCGAGCCCCATTCCTTGTCTAATGGTCGGCCTCAGCCTTGGCATGAGGACGGCATTTTCCCTGCAT
GACTTGAGAAGTTTGAGGGAAACTTGGGAGGCTTTGGGATTTCCCCAGGCCTCTCTTCAAGCATTATGGTCATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGCCGCCGCACTGCCTCTCGCCGGCCGCACCGCCATCGTCACCGGCGCCTCCCGAGGGATCGGCCGCGCCATCGCCGTCCACCTCCATTCCCTCGGCGCAAA
CCTCATCCTCAACTACGCTTCCAACTCCACCCAAGCCGATCTCCTCGCATCGGAGCTCAACCGGTCATCGCCGCCGCTCCGGCGAGCCGTCGCCGTCCAGGCCGACGTGT
CTGACCCCGACCAGGTCAAGCGGCTGTTCGATTCGGCCGACAAGGAGTTCGGATCTGAGATCCACATCCTCGTGAACTCCGCTGGGATTCTCGATTCGAAGTACCCGACT
CTGGCCGGGACCGCCGTGGAAGATTGGGATAGGGCGTTTCAGGTGAATTGCAGAGGCGCGTTTTTGGTCTGTAAAGAGGCGGCGAACCGGCTGAGGTGCGGCGGCGGAGG
GAGGATTATTTTGATTACGACGTCGATCGTTGGGTCTCTGCCGCCGGGGTACGGCGCTTATGCGGCGTCGAAGGCGGCGGTGGAGGCGATGGCGAAGACGGCTGCGAAGG
AGCTGAAGGGGACGGAGATTACGGTTAACTGCGTGGCGCCGGGACCGGTGGCGACGGACCTTTTTTTCGCCGGAAAATCTGAGGAGACGGTGGCGAAGTCGGTGGAGGCA
TGCCCGCTCGGCCGTCTCGGGCAGCCCGACGATGTGGCTAAGGTGGTGGGGTTTTTGGCCACTGATGGCGGCGGGTGGGTAAACGGGCAGGGTTTCGGAGGCATTTTCGG
TCAAAGCAGGCAACCCGGGGCAGATGGAAGCAGTGGGGACCTAACGGCACCAGACGGGCTCGGCCCGCGCGAGCGGGCCGAGGTCGGCCTCGGCCATGGCCGAGGCCGAG
CACGGGGTCGGGCCAAAAACCCGACCCCTTCGGTCTTGGCCCGTCCCACTTACCGGTTTTGCTCCTTGGTCCATCTCTCAGCCCGATTTCTCCTCGGTTGCCCTCGTCAG
CTCCTTGGGTTAGATGAAATGGAGATGAAAAAGGGTGAGAGATTGGCCGGTCGGCCTCGGCCTCGGGAAGAGGTCGAGCACAGCTATCTCCCTCTCTGCTTGCCTGGTCA
GCCTCGGCTCGGAAGAGGCCGAGCAGTTTCTAACCCCCGTACTTGGGAGTGCATATCCCCGGTGACTTGCTTTTTGTTTAATAGGGAGTGCATATCCCCTTTGCAGCCCC
CAATTTCAGGTGGTCGGCCTCGGCTTTGGCATGAGGCCGACCAGCTTCCTTATGCTATTTGCTTGCTCCGTAGTTGGGCTTGTGTGCAGATTCACTTTCTGCTTGAACAT
TTCTTTGTTGCTTTGGATATTTCAGTTGTGCAAGTCGAGCCTCACCGCAAGCCCCCCACAGTCGTTCTTAGGGCTAGAATGCTCGGTTTCGGCCTTAGCGTGAGGCTGAG
CATTTTTCCTACATTGCTCGTTATTCTGCTCATCAATCTGCTGTGTTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGTTGAGGGCATGCGTCAAC
CTCTTTGCAAGAGTTTAATCTCCTTCCAAGGGAAAGTGCAGAAGTTCGGCTTCTACAAGAATCACTCTTGTTTGCCATATGTGCCCATCCTATGGAAAAATGCTTTGCAC
AAACATAGTATAAAGAATTTTATCAAATTCGCAAGTGTCTGCATCCTCCACGCTGTTCGTAATCCTAACTTGAATAAGAAAGGAAAGTCAAAATCATTGCAACATGGCCA
TGATTATGATAGCTCCACCGCTGTTCGTAATCCTAACTTGAATAAGAAAGGAAAGTCAAAATCATTGCAACATGGCCATGATTATGATGGACTCGGAAAAACTAACTTCG
TACCTGGGCCTCTAAGGGTGAAGTCCAGCCTTGGCCTCGGGATGGTCGGCCTCGGCCTTGGCATGTGGCCGACCACCTCCTCTGTGTTGTTCGCTTGCTTGTTTTTCATC
CCAAGGGTGCGTGCACCCAGTCAGGGAAAACTTGGGGAGGTCAAGGAAATTACCGAGCCTCTCTTCGCATGTTGCGTTCATCCCAGGGTAAGAAAATATAATATCATACC
TGGTCTTTTGAGGACAAAGTTCATCCTTGAAGCTGAGATAGCCAGCCCCGACCTTGGCATGAGGTCGATCCTCGCTTCTCCATCACCTGTTTGCTCGCCTAATAATTGGA
CTCATGTGCGGATAATTTGCCTGCGAAGTAACTACCGGACACAGTCGAGCCCCATTCCTTGTCTAATGGTCGGCCTCAGCCTTGGCATGAGGACGGCATTTTCCCTGCAT
GACTTGAGAAGTTTGAGGGAAACTTGGGAGGCTTTGGGATTTCCCCAGGCCTCTCTTCAAGCATTATGGTCATCCTAG
Protein sequenceShow/hide protein sequence
MAAAAALPLAGRTAIVTGASRGIGRAIAVHLHSLGANLILNYASNSTQADLLASELNRSSPPLRRAVAVQADVSDPDQVKRLFDSADKEFGSEIHILVNSAGILDSKYPT
LAGTAVEDWDRAFQVNCRGAFLVCKEAANRLRCGGGGRIILITTSIVGSLPPGYGAYAASKAAVEAMAKTAAKELKGTEITVNCVAPGPVATDLFFAGKSEETVAKSVEA
CPLGRLGQPDDVAKVVGFLATDGGGWVNGQGFGGIFGQSRQPGADGSSGDLTAPDGLGPRERAEVGLGHGRGRARGRAKNPTPSVLARPTYRFCSLVHLSARFLLGCPRQ
LLGLDEMEMKKGERLAGRPRPREEVEHSYLPLCLPGQPRLGRGRAVSNPRTWECISPVTCFLFNRECISPLQPPISGGRPRLWHEADQLPYAICLLRSWACVQIHFLLEH
FFVALDISVVQVEPHRKPPTVVLRARMLGFGLSVRLSIFPTLLVILLINLLCSSQGCVHSSGEKLGEVEGMRQPLCKSLISFQGKVQKFGFYKNHSCLPYVPILWKNALH
KHSIKNFIKFASVCILHAVRNPNLNKKGKSKSLQHGHDYDSSTAVRNPNLNKKGKSKSLQHGHDYDGLGKTNFVPGPLRVKSSLGLGMVGLGLGMWPTTSSVLFACLFFI
PRVRAPSQGKLGEVKEITEPLFACCVHPRVRKYNIIPGLLRTKFILEAEIASPDLGMRSILASPSPVCSPNNWTHVRIICLRSNYRTQSSPIPCLMVGLSLGMRTAFSLH
DLRSLRETWEALGFPQASLQALWSS