Input Features as Output Objectives

This notebook demonstrates how to put objectives on input features or a combination of input features. Possible usecases are favoring lower or higher amounts of an ingredient or to take into account a known (linear) cost function. In case of categorical inputs it can be used to penalize the optimizer for choosing specific categories.

Imports

import numpy as np

import bofire.strategies.api as strategies
import bofire.surrogates.api as surrogates
from bofire.benchmarks.api import Himmelblau
from bofire.data_models.features.api import CategoricalInput, ContinuousOutput
from bofire.data_models.objectives.api import (
    MaximizeObjective,
    MaximizeSigmoidObjective,
)
from bofire.data_models.strategies.api import MultiplicativeSoboStrategy
from bofire.data_models.surrogates.api import (
    BotorchSurrogates,
    CategoricalDeterministicSurrogate,
    LinearDeterministicSurrogate,
)

Setup an Example

We use Himmelblau as example with an additional objective on x_2 which pushes it to be larger 3 during the optimization. In addition, we introduce a categorical feature called x_cat which is mapped by an CategoricalDeterministicSurrogate to a continuous output called y_cat.

bench = Himmelblau()
experiments = bench.f(bench.domain.inputs.sample(10), return_complete=True)

domain = bench.domain

# setup extra feature `y_x2` that is the same as `x_2` and is taken into account in the optimization by a sigmoid objective
domain.outputs.features.append(
    ContinuousOutput(key="y_x2", objective=MaximizeSigmoidObjective(tp=3, steepness=10))
)
experiments["y_x2"] = experiments.x_2


# add extra categorical input feature and corresponding output feature
domain.inputs.features.append(CategoricalInput(key="x_cat", categories=["a", "b", "c"]))
domain.outputs.features.append(
    ContinuousOutput(key="y_cat", objective=MaximizeObjective())
)

# generate random values for the new categorical feature
experiments["x_cat"] = np.random.choice(["a", "b", "c"], size=experiments.shape[0])

The LinearDeterministicSurrogate can be used to model that y_x2 = x_2.

surrogate_data = LinearDeterministicSurrogate(
    inputs=domain.inputs.get_by_keys(["x_2"]),
    outputs=domain.outputs.get_by_keys(["y_x2"]),
    coefficients={"x_2": 1},
    intercept=0,
)
surrogate = surrogates.map(surrogate_data)
surrogate.predict(experiments[domain.inputs.get_keys()].copy())

/opt/hostedtoolcache/Python/3.12.12/x64/lib/python3.12/site-packages/bofire/surrogates/botorch.py:47: UserWarning: The given NumPy array is not writable, and PyTorch does not support non-writable tensors. This means writing to this tensor will result in undefined behavior. You may want to copy the array to protect its data or make it writable before converting it to a tensor. This type of warning will be suppressed for the rest of this program. (Triggered internally at /pytorch/torch/csrc/utils/tensor_numpy.cpp:213.)
  X = torch.from_numpy(transformed_X.values).to(**tkwargs)

	y_x2_pred	y_x2_sd
0	2.220538	0.0
1	0.365708	0.0
2	1.149765	0.0
3	-3.141058	0.0
4	-3.740625	0.0
5	-5.342604	0.0
6	-5.850816	0.0
7	-0.391335	0.0
8	4.067876	0.0
9	-3.304675	0.0

The CategoricalDeterministicSurrogate can be used to map categories to specific continuous values.

categorical_surrogate_data = CategoricalDeterministicSurrogate(
    inputs=domain.inputs.get_by_keys(["x_cat"]),
    outputs=domain.outputs.get_by_keys(["y_cat"]),
    mapping={"a": 1, "b": 0.2, "c": 0.3},
)

surrogate = surrogates.map(categorical_surrogate_data)

surrogate.predict(experiments[domain.inputs.get_keys()].copy())

experiments["y_cat"] = surrogate.predict(experiments[domain.inputs.get_keys()].copy())[
    "y_cat_pred"
]

experiments

	x_1	x_2	y	valid_y	y_x2	x_cat	y_cat
0	2.192435	2.220538	15.797455	1	2.220538	a	1.0
1	-5.662613	0.365708	616.256065	1	0.365708	c	0.3
2	4.721741	1.149765	155.782615	1	1.149765	b	0.2
3	2.266131	-3.141058	107.444046	1	-3.141058	c	0.3
4	-2.802035	-3.740625	65.019570	1	-3.740625	a	1.0
5	-4.417991	-5.342604	303.367465	1	-5.342604	c	0.3
6	-3.537934	-5.850816	580.193155	1	-5.850816	a	1.0
7	-0.983807	-0.391335	169.967795	1	-0.391335	c	0.3
8	-2.102851	4.067876	61.725329	1	4.067876	c	0.3
9	-4.019652	-3.304675	3.443107	1	-3.304675	b	0.2

Next we setup a SoboStrategy using the custom surrogates for outputs y_x2 and y_cat and ask for a candidate. Note that the surrogate specs for output y is automatically generated and defaulted to be a SingleTaskGPSurrogate.

strategy_data = MultiplicativeSoboStrategy(
    domain=domain,
    surrogate_specs=BotorchSurrogates(
        surrogates=[surrogate_data, categorical_surrogate_data]
    ),
)
strategy = strategies.map(strategy_data)
strategy.tell(experiments)
strategy.ask(4)

	x_1	x_2	x_cat	y_pred	y_cat_pred	y_x2_pred	y_sd	y_des	y_x2_des	y_cat_des
0	1.276253	3.555062	a	39.793289	1.0	3.555062	154.457889	-39.793289	0.996130	1.0
1	-5.484383	4.892197	a	164.491492	1.0	4.892197	207.272863	-164.491492	1.000000	1.0
2	5.029403	3.464108	a	130.172821	1.0	3.464108	205.757027	-130.172821	0.990445	1.0
3	-1.161654	3.269005	a	50.356337	1.0	3.269005	142.433753	-50.356337	0.936437	1.0

--- title: Input Features as Output Objectives jupyter: python3 --- This notebook demonstrates how to put objectives on input features or a combination of input features. Possible usecases are favoring lower or higher amounts of an ingredient or to take into account a known (linear) cost function. In case of categorical inputs it can be used to penalize the optimizer for choosing specific categories. ## Imports ```{python} import numpy as np import bofire.strategies.api as strategies import bofire.surrogates.api as surrogates from bofire.benchmarks.api import Himmelblau from bofire.data_models.features.api import CategoricalInput, ContinuousOutput from bofire.data_models.objectives.api import ( MaximizeObjective, MaximizeSigmoidObjective, ) from bofire.data_models.strategies.api import MultiplicativeSoboStrategy from bofire.data_models.surrogates.api import ( BotorchSurrogates, CategoricalDeterministicSurrogate, LinearDeterministicSurrogate, ) ``` ## Setup an Example We use Himmelblau as example with an additional objective on `x_2` which pushes it to be larger 3 during the optimization. In addition, we introduce a categorical feature called `x_cat` which is mapped by an `CategoricalDeterministicSurrogate` to a continuous output called `y_cat`. ```{python} bench = Himmelblau() experiments = bench.f(bench.domain.inputs.sample(10), return_complete=True) domain = bench.domain # setup extra feature `y_x2` that is the same as `x_2` and is taken into account in the optimization by a sigmoid objective domain.outputs.features.append( ContinuousOutput(key="y_x2", objective=MaximizeSigmoidObjective(tp=3, steepness=10)) ) experiments["y_x2"] = experiments.x_2 # add extra categorical input feature and corresponding output feature domain.inputs.features.append(CategoricalInput(key="x_cat", categories=["a", "b", "c"])) domain.outputs.features.append( ContinuousOutput(key="y_cat", objective=MaximizeObjective()) ) # generate random values for the new categorical feature experiments["x_cat"] = np.random.choice(["a", "b", "c"], size=experiments.shape[0]) ``` The `LinearDeterministicSurrogate` can be used to model that `y_x2 = x_2`. ```{python} surrogate_data = LinearDeterministicSurrogate( inputs=domain.inputs.get_by_keys(["x_2"]), outputs=domain.outputs.get_by_keys(["y_x2"]), coefficients={"x_2": 1}, intercept=0, ) surrogate = surrogates.map(surrogate_data) surrogate.predict(experiments[domain.inputs.get_keys()].copy()) ``` The `CategoricalDeterministicSurrogate` can be used to map categories to specific continuous values. ```{python} categorical_surrogate_data = CategoricalDeterministicSurrogate( inputs=domain.inputs.get_by_keys(["x_cat"]), outputs=domain.outputs.get_by_keys(["y_cat"]), mapping={"a": 1, "b": 0.2, "c": 0.3}, ) surrogate = surrogates.map(categorical_surrogate_data) surrogate.predict(experiments[domain.inputs.get_keys()].copy()) experiments["y_cat"] = surrogate.predict(experiments[domain.inputs.get_keys()].copy())[ "y_cat_pred" ] experiments ``` Next we setup a `SoboStrategy` using the custom surrogates for outputs `y_x2` and `y_cat` and ask for a candidate. Note that the surrogate specs for output `y` is automatically generated and defaulted to be a `SingleTaskGPSurrogate`. ```{python} strategy_data = MultiplicativeSoboStrategy( domain=domain, surrogate_specs=BotorchSurrogates( surrogates=[surrogate_data, categorical_surrogate_data] ), ) strategy = strategies.map(strategy_data) strategy.tell(experiments) strategy.ask(4) ```