Beyond Trial-and-Error: Optimizing Catalyst Selection with Expected Improvement in Bayesian Optimization

Jaxon Cox Feb 02, 2026 580

This article provides a comprehensive guide for researchers and drug development professionals on applying the Expected Improvement (EI) acquisition function to accelerate the discovery and optimization of catalytic systems.

Beyond Trial-and-Error: Optimizing Catalyst Selection with Expected Improvement in Bayesian Optimization

Abstract

This article provides a comprehensive guide for researchers and drug development professionals on applying the Expected Improvement (EI) acquisition function to accelerate the discovery and optimization of catalytic systems. We explore the foundational mathematics of EI, detail its methodological implementation for high-throughput catalyst screening, address common pitfalls in real-world deployment, and validate its performance against other acquisition strategies. The full scope covers how EI intelligently balances exploration and exploitation to efficiently navigate vast chemical spaces, ultimately reducing experimental cost and time in developing novel catalysts for pharmaceutical synthesis and other applications.

Expected Improvement 101: A Primer for Materials Scientists on Bayesian Search

Application Notes

Within the broader thesis on acquisition functions for expected improvement in catalyst composition selection, these notes address the application of Bayesian optimization (BO) for high-throughput experimentation (HTE) in heterogeneous catalyst discovery. The primary challenge is the astronomical size of the compositional space when considering multi-metallic nanoparticles (e.g., quinary alloys) on diverse supports with variable promoters.

Key Application: Accelerating the discovery of novel bimetallic and trimetallic catalysts for the electrochemical oxygen reduction reaction (ORR), a critical process for fuel cells.

Quantitative Performance Data: Table 1: Comparison of Acquisition Functions for Catalyst Optimization

Acquisition Function	Iterations to 90% Peak Activity	Avg. Improvement per Cycle (mA/cm²)	Exploitation vs. Exploration Balance
Expected Improvement (EI)	14	1.23	Balanced
Probability of Improvement (PI)	22	0.87	High Exploitation
Upper Confidence Bound (UCB)	18	1.05	High Exploration (tunable)
Random Sampling	45+	0.45	None

Table 2: Top Catalyst Compositions Identified via BO-EI for ORR

Catalyst Composition (Pt:X:Y)	Support	Mass Activity @ 0.9V (A/mgₚₜ)	Stability (% activity retained)
Pt₃Co	Carbon	0.56	78%
Pt₃Ni	Nitrogen-doped Carbon	0.71	65%
Pt₅₈Cu₁₅Ni₂₇	Carbon	0.82	72%
Pt₇₅Pd₁₅Fe₁₀	Carbon	0.48	92%

Experimental Protocols

Protocol 1: High-Throughput Synthesis of Alloy Catalyst Libraries via Incipient Wetness Impregnation Objective: To prepare a spatially addressed library of bimetallic catalysts on a multi-well substrate.

Substrate Preparation: Load a 96-well ceramic plate with a pre-weighed mass (e.g., 10 mg) of high-surface-area carbon support in each well.
Precursor Solution Preparation: Calculate the total metal loading (e.g., 2 wt%). Prepare stock solutions of hexachloroplatinic acid (H₂PtCl₆), cobalt nitrate (Co(NO₃)₂), nickel chloride (NiCl₂), etc., in dilute hydrochloric acid (0.1M).
Automated Dispensing: Use a liquid handling robot to dispense precise volumetric mixtures of the precursor stock solutions into each well to achieve the desired compositional gradients (e.g., Pt₁₀₀₋ₓCoₓ, where x varies from 0 to 100 in 5% increments).
Drying & Reduction: Dry the plate at 80°C for 2 hours, then transfer to a tubular furnace. Reduce the catalysts under a 5% H₂/Ar flow at 300°C for 3 hours with a ramp rate of 5°C/min.
Passivation: Cool to room temperature under inert Ar and expose to a 1% O₂/Ar flow for 1 hour to passivate surfaces.

Protocol 2: Parallel Electrochemical Screening for ORR Activity Objective: To measure the electrochemical activity of catalyst libraries in parallel.

Ink Formulation: To each well, add 1 mL of a solution containing 0.5% Nafion and isopropanol. Sonicate the entire plate for 30 minutes to form homogeneous inks.
Working Electrode Preparation: Using a microarrayer, spot 2 µL of each catalyst ink onto a polished glassy carbon electrode array (16-well format).
Electrochemical Cell Setup: Employ a multi-channel potentiostat. Use a common Pt mesh counter electrode and a common reversible hydrogen electrode (RHE) in 0.1M HClO₄ electrolyte saturated with O₂.
Activity Measurement: For each channel, perform cyclic voltammetry (CV) from 0.05 to 1.0 V vs. RHE at 50 mV/s to clean the surface. Then, perform linear sweep voltammetry (LSV) from 1.0 to 0.05 V vs. RHE at 10 mV/s and 1600 RPM. Record the kinetic current at 0.9 V vs. RHE.
Data Processing: Normalize kinetic currents to the mass of precious metal (Pt) loaded to calculate mass activity (A/mgₚₜ).

Mandatory Visualization

Diagram 1: Bayesian Optimization Loop for Catalyst Discovery

Diagram 2: 4e⁻ Oxygen Reduction Reaction (ORR) Pathway

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for High-Throughput Catalyst Discovery

Item/Reagent	Function & Application Notes
Multi-Well Ceramic/Glass Plates	Inert substrate for parallel synthesis of catalyst libraries; enables high-temperature treatments.
Liquid Handling Robot (e.g., Positive Displacement)	Enables precise, reproducible dispensing of precursor solutions for combinatorial synthesis.
Metal Salt Precursors (e.g., H₂PtCl₆, Ni(NO₃)₂)	Source of active metal components. Must be high-purity and soluble for accurate formulation.
High-Surface-Area Carbon Supports (e.g., Vulcan XC-72)	Conductive support material to maximize catalyst dispersion and electronic conductivity.
Multi-Channel Potentiostat/Galvanostat	Allows simultaneous electrochemical characterization of multiple catalyst samples.
Glassy Carbon Electrode (GCE) Arrays	Provides standardized, reusable substrates for drop-casting catalyst inks for screening.
Rotating Disk Electrode (RDE) Setups	Controls mass transport of O₂ to the catalyst surface, allowing measurement of intrinsic activity.
Nafion Perfluorinated Resin Solution	Binder for catalyst inks; provides proton conductivity and adhesion to the electrode.
High-Purity Gases (O₂, N₂, H₂/Ar mix)	For electrolyte saturation (O₂), inert atmospheres (N₂/Ar), and catalyst reduction (H₂/Ar).

This Application Note details the methodology of Bayesian Optimization (BO) as applied to the research thesis: "Advancing Acquisition Functions for Expected Improvement in Catalyst Composition Selection for Drug Development." The selection of optimal heterogeneous catalyst compositions for key pharmaceutical synthesis steps is a high-dimensional, expensive, and data-scarce challenge. BO provides a principled framework to navigate this complex design space efficiently, minimizing the number of required experimental trials by iteratively suggesting the most promising compositions based on probabilistic models and strategic acquisition functions.

Foundational Concepts: Gaussian Processes (GPs)

A Gaussian Process is a non-parametric probabilistic model used as a surrogate for the unknown objective function (e.g., catalyst yield or selectivity). It defines a distribution over functions and is fully specified by a mean function m(x) and a covariance (kernel) function k(x, x').

Key Kernel Functions:

Kernel Name	Mathematical Form	Hyperparameters	Best For
Radial Basis (RBF)	$k(xi, xj) = \sigmaf^2 \exp(-\frac{1}{2l^2} \|xi - x_j\|^2)$	Length-scale (l), Signal variance ($\sigma_f^2$)	Smooth, continuous functions.
Matérn 5/2	$k(xi, xj) = \sigma_f^2 (1 + \frac{\sqrt{5}r}{l} + \frac{5r^2}{3l^2}) \exp(-\frac{\sqrt{5}r}{l})$	Length-scale (l), Signal variance ($\sigma_f^2$)	Less smooth than RBF, accommodates noise.
Constant	$k(xi, xj) = \sigma_c^2$	Constant ($\sigma_c^2$)	Capturing a constant bias.

Where $r = \|x_i - x_j\|$

GP Prior to Posterior Update Workflow:

Title: GP Posterior Formation from Prior and Data

Core Component: Acquisition Functions

Acquisition functions balance exploration and exploitation to propose the next experiment. They use the GP posterior (mean $\mu(x)$ and variance $\sigma^2(x)$) to quantify the utility of evaluating a candidate point.

Quantitative Comparison of Common Acquisition Functions:

Function	Formula	Key Characteristic	Theta Parameter
Probability of Improvement (PI)	$\alpha_{PI}(x) = \Phi(\frac{\mu(x) - f(x^+) - \xi}{\sigma(x)})$	Exploitative; seeks immediate gain.	$\xi$ (jitter)
Expected Improvement (EI)	$\alpha_{EI}(x) = (\mu(x) - f(x^+) - \xi)\Phi(Z) + \sigma(x)\phi(Z)$ where $Z = \frac{\mu(x) - f(x^+) - \xi}{\sigma(x)}$	Balances exploration/exploitation.	$\xi$ (jitter)
Upper Confidence Bound (UCB)	$\alpha_{UCB}(x) = \mu(x) + \kappa \sigma(x)$	Explicit balance parameter.	$\kappa$
Predictive Entropy Search	Complex, based on information gain.	Information-theoretic; global search.	--

Where $\Phi$ is CDF, $\phi$ is PDF of std. normal, $f(x^+)$ is best observation, $\xi, \kappa$ are tunable.

Acquisition Function Decision Logic:

Title: Selecting Next Experiment via Acquisition Function Maximization

Detailed Experimental Protocol: BO for Catalyst Selection

Protocol 1: High-Throughput Initialization and Iterative BO Loop

Objective: To identify a catalyst composition (e.g., Pd-Au-Ce/ZrO2 ratios, dopant level) maximizing yield for a Suzuki-Miyaura coupling relevant to API synthesis.

Materials & Reagents: See "The Scientist's Toolkit" below.

Procedure:

Design Space Definition: Define bounds for each compositional element (e.g., 0-5 wt% Pd, 0-3 wt% Au, 0-10 wt% Ce, balance ZrO2). Include process variables (calcination temperature: 300-600°C).
Initial Design: Perform a space-filling design (e.g., Latin Hypercube Sampling) for n=10 initial catalyst formulations. Synthesize and test these catalysts in the target reaction (see Protocol 2).
BO Loop: a. Surrogate Modeling: Construct a GP model using the accumulated data (composition -> yield). Use a Matérn 5/2 kernel. b. Acquisition Optimization: Maximize the Expected Improvement (EI) acquisition function across the defined compositional space using a global optimizer (e.g., L-BFGS-B or DIRECT). c. Candidate Selection: The point maximizing EI is selected as the next catalyst composition to test. d. Experimental Evaluation: Synthesize and test the proposed catalyst (Protocol 2). e. Data Augmentation: Add the new (composition, yield) data pair to the dataset. f. Iteration: Repeat steps a-e for a predetermined budget (e.g., 30 total experiments) or until a performance threshold is met.

Protocol 2: Standardized Catalyst Synthesis & Testing (Key Cited Experiment)

Objective: To evaluate the performance of a single catalyst composition proposed by the BO loop.

Procedure:

Wet Impregnation Synthesis:
- Calculate required volumes of precursor solutions (e.g., Pd(NO3)2, HAuCl4, Ce(NO3)3) to achieve target loadings on ZrO2 support.
- Add the support to the mixed precursor solution. Stir for 2 hours at room temperature.
- Dry the slurry overnight at 120°C.
- Calcine the dried powder in a muffle furnace at the specified temperature (from design space) for 4 hours in static air.
Catalytic Testing (Suzuki-Miyaura Coupling):
- Charge a parallel reaction vessel with aryl halide (1.0 mmol), phenylboronic acid (1.5 mmol), base (K2CO3, 2.0 mmol), and catalyst (50 mg).
- Add solvent (water:ethanol 3:1, 5 mL).
- Heat the reaction block to 80°C with stirring for 2 hours.
- Cool, filter to remove catalyst, and analyze reaction mixture via quantitative HPLC against a calibrated standard curve to determine yield.

The Scientist's Toolkit: Research Reagent Solutions

Item/Category	Function in Catalyst BO Research	Example Product/Specification
Metal Precursors	Source of active catalytic components for precise impregnation.	Pd(NO3)2•xH2O (99.9%), HAuCl4•3H2O (ACS grade), Ce(NO3)3•6H2O (99%).
High-Surface Area Support	Provides a stable, dispersive matrix for active metals.	ZrO2 powder, BET surface area >80 m²/g, pore volume >0.3 cm³/g.
High-Throughput Reactor	Enables parallel synthesis or testing of multiple catalyst candidates.	16-parallel glass reactor block with individual temperature control.
Quantitative HPLC	Essential for accurate, high-throughput yield determination of reaction products.	System with C18 column, PDA detector, and autosampler.
BO Software Library	Implements GP regression and acquisition function optimization.	Python libraries: `scikit-optimize`, `BoTorch`, or `GPyOpt`.

Advanced Considerations for Drug Development

In pharmaceutical applications, BO can be extended to multi-objective optimization (e.g., maximizing yield while minimizing costly metal loading or impurity formation). Adaptive acquisition functions, which dynamically adjust their balance parameter (e.g., $\kappa$ in UCB) based on iteration progress, are a key focus of the broader thesis. This aims to accelerate the discovery of sustainable, cost-effective catalysts for green pharmaceutical manufacturing.

Within the broader thesis on acquisition function-driven catalyst composition selection for drug development, Expected Improvement (EI) serves as a critical Bayesian optimization component. It formalizes the search for optimal catalyst formulations by balancing exploration of uncertain regions and exploitation of known high-performance areas. This protocol details its mathematical formulation, application workflow, and implementation for high-throughput experimentation.

Core Mathematical Formulation

The Expected Improvement acquisition function quantifies the potential gain over the current best-observed function value, ( f^* ), at a candidate point ( \mathbf{x} ), given a Gaussian process (GP) surrogate model providing a predictive mean ( \mu(\mathbf{x}) ) and standard deviation ( \sigma(\mathbf{x}) ).

The improvement is defined as: [ I(\mathbf{x}) = \max(0, f(\mathbf{x}) - f^) ] Since ( f(\mathbf{x}) ) is modeled as a Gaussian distribution ( \mathcal{N}(\mu(\mathbf{x}), \sigma^2(\mathbf{x})) ), the *expected value of this improvement is: [ EI(\mathbf{x}) = \mathbb{E}[I(\mathbf{x})] = \begin{cases} (\mu(\mathbf{x}) - f^)\Phi(Z) + \sigma(\mathbf{x})\phi(Z) & \text{if } \sigma(\mathbf{x}) > 0 \ 0 & \text{if } \sigma(\mathbf{x}) = 0 \end{cases} ] where: [ Z = \frac{\mu(\mathbf{x}) - f^}{\sigma(\mathbf{x})} ] Here, ( \Phi(\cdot) ) and ( \phi(\cdot) ) are the cumulative distribution function (CDF) and probability density function (PDF) of the standard normal distribution, respectively.

Table 1: EI Equation Components and Interpretation

Symbol	Term	Role in Catalyst Selection
( \mu(\mathbf{x}) )	Predictive Mean	Estimated performance (e.g., yield, selectivity) of catalyst composition ( \mathbf{x} ).
( \sigma(\mathbf{x}) )	Predictive Uncertainty	Uncertainty in the performance estimate at ( \mathbf{x} ).
( f^* )	Incumbent Best	Best currently observed performance from prior experiments.
( Z )	Standardized Improvement	Measures how many standard deviations the mean is above ( f^* ).
( \Phi(Z) )	CDF term	Exploitation weight: probability of improvement.
( \sigma(\mathbf{x})\phi(Z) )	PDF term	Exploration weight: rewards high uncertainty.

EI-Driven Catalyst Selection Workflow

A standardized protocol for applying EI in high-throughput catalyst screening.

Protocol 3.1: Iterative Optimization Cycle Using EI Objective: Identify the catalyst composition maximizing reaction yield within a defined chemical space. Materials: High-throughput robotic synthesis platform, parallel pressure reactors, GC-MS/HPLC for analysis, computational server for GP modeling. Procedure:

Initial Design: Perform a space-filling design (e.g., Latin Hypercube) of 20-30 catalyst compositions varying metal ratios, ligand structures, and support types. Synthesize and test.
Surrogate Modeling: Fit a Gaussian Process model with a Matérn kernel to the experimental data (yield vs. composition descriptors).
EI Calculation & Optimization: a. Compute ( f^* ), the maximum yield observed so far. b. For many candidate points ( \mathbf{x} ) in the composition space, calculate ( \mu(\mathbf{x}) ) and ( \sigma(\mathbf{x}) ) from the GP. c. Compute ( EI(\mathbf{x}) ) using the formula in Section 2. d. Identify the candidate point ( \mathbf{x}{next} = \arg\max{\mathbf{x}} EI(\mathbf{x}) ).
Experimental Validation: Synthesize and test the top 3-5 proposals from Step 3d.
Update & Iterate: Append new results to the dataset. Return to Step 2 until performance plateaus or budget is exhausted. Validation: Compare final optimized catalyst performance against a baseline identified through traditional one-variable-at-a-time (OVAT) screening.

Diagram 1: EI-Driven Catalyst Optimization Loop

Comparative Analysis of Acquisition Functions

Table 2: Quantitative Comparison of Key Acquisition Functions

Function	Formula	Exploration vs. Exploitation	Typical Performance in Catalyst Search
Expected Improvement (EI)	( (\mu - f^*)\Phi(Z) + \sigma\phi(Z) )	Balanced adaptive trade-off.	Consistently high; finds global optimum efficiently.
Upper Confidence Bound (UCB)	( \mu(\mathbf{x}) + \kappa \sigma(\mathbf{x}) )	Explicitly tuned by ( \kappa ).	Good but sensitive to ( \kappa ) choice; can over-explore.
Probability of Improvement (PI)	( \Phi(Z) )	Strong exploitation bias.	Often gets stuck in local optima; faster initial gains.
Thompson Sampling	Sample ( f(\mathbf{x}) \sim \mathcal{N}(\mu(\mathbf{x}), \sigma^2(\mathbf{x})) ), maximize sample.	Stochastic, inherent balance.	Very effective in practice; requires sampling.

Performance data synthesized from benchmark studies in materials informatics (2023-2024).

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for EI-Guided Catalyst Experimentation

Item / Reagent	Function in Protocol
Precursor Salt Libraries (e.g., metal acetates, nitrates)	Provides modular building blocks for high-throughput synthesis of varied catalyst compositions.
Ligand Arrays (e.g., phosphine, amine, carbene libraries)	Systematically modulates electronic and steric properties of the catalytic center.
Porous Support Particles (e.g., Al2O3, SiO2, C, MOFs)	Standardized supports for immobilizing active components, testing dispersion effects.
Internal Standard Kits (for GC-MS/HPLC)	Enables accurate, reproducible quantification of reaction yield and selectivity in parallel.
Bayesian Optimization Software (e.g., BoTorch, GPyOpt, scikit-optimize)	Provides computational backend for GP modeling and EI calculation/optimization.
HTE Reactor Blocks (e.g., 48- or 96-well plates with pressure/temperature control)	Enables parallel synthesis and testing under consistent, automated conditions.

Diagram 2: EI Calculation Logical Flow

Why EI for Catalysis? Addressing the Exploration-Exploitation Dilemma

In catalyst composition research, the Exploration-Exploitation Dilemma is central: should one explore new, uncertain regions of the compositional space or exploit known high-performing regions? Expected Improvement (EI), a prominent Bayesian optimization acquisition function, provides a mathematically principled balance. This Application Note details protocols for employing EI to accelerate the discovery of novel heterogeneous catalysts, framed within a thesis on advanced acquisition functions for materials selection.

Table 1: Comparison of Key Acquisition Functions for Catalyst Search

Acquisition Function	Primary Objective	Risk Preference	Best for Phase
Expected Improvement (EI)	Maximizes probability of improvement over best-known target	Balanced	General-purpose optimization
Probability of Improvement (PI)	Maximizes chance of improvement, regardless of magnitude	Risk-seeking	Early exploration
Upper Confidence Bound (UCB)	Explores regions of high uncertainty	Tunable (via κ parameter)	Systematic exploration
Entropy Search (ES)	Maximizes information gain about optimum	Information-driven	Global mapping

Table 2: Illustrative EI Performance Metrics in Catalysis Studies

Study Focus (Catalyst System)	Search Dimension	Initial Data Points	EI-Guided Experiments to Find Optimum	Performance Gain Over Baseline
Pt-Pd-Au Ternary Nanoparticles	3 (compositions)	20	15	2.1x activity
Mixed Metal Oxide (5 elements)	5	30	22	3.4x selectivity
Zeolite-supported Co/Mo	4 (Co/Mo ratio, temp, pressure)	15	18	1.8x yield

Experimental Protocols

Protocol 1: Setting Up the Bayesian Optimization Loop for Catalyst Screening

Objective: To iteratively select catalyst compositions for testing using an EI-driven workflow.

Materials & Computational Setup:

High-throughput catalyst synthesis platform (e.g., automated liquid handler, sputter system).
Characterization suite (e.g., XRD, XPS, automated reaction screening).
Bayesian Optimization software (e.g., GPyTorch, Scikit-optimize, Ax Platform).
Defined compositional search space (ranges for each element or synthesis parameter).

Procedure:

Initial Design: Perform a space-filling initial design (e.g., Latin Hypercube Sampling) to synthesize and test N initial catalyst candidates (typically N = 10-30).
Model Training: After each batch of experiments, train a Gaussian Process (GP) surrogate model. The model uses compositional features as input and maps them to the target performance metric (e.g., turnover frequency, yield).
EI Calculation: Compute the EI acquisition function across a dense grid of candidate compositions. EI is defined as: EI(x) = E[max(f(x) - f(x), 0)]* where f(x) is the predicted performance at point x, and f(x)* is the current best observed performance.
Candidate Selection: Select the next batch of compositions where EI is maximized.
Iteration: Synthesize, test, and characterize the selected candidates. Append the new data to the training set.
Termination: Repeat steps 2-5 until a performance threshold is met or the experimental budget is exhausted.

Protocol 2: High-Throughput Synthesis & Screening for Validation

Objective: To experimentally validate the top candidate catalysts identified by the EI-guided search.

Synthesis Workflow (for supported metal catalysts):

Precursor Deposition: Using an automated dispenser, deposit aqueous metal precursor solutions onto a high-surface-area support (e.g., Al2O3, SiO2) arrayed in a well plate.
Drying & Calcination: Dry the plate at 120°C for 2 hours, followed by calcination in a muffle furnace under static air at 500°C for 4 hours.
Reduction: Activate the catalysts in a parallel flow reactor under H2/N2 (5%/95%) at 300°C for 2 hours.

Performance Testing:

Transfer catalyst samples to a parallel, fixed-bed microreactor system.
Conduct catalytic testing (e.g., CO oxidation at 250°C, 1 atm) with online GC analysis.
Record key metrics: Conversion (%), Selectivity (%), and Turnover Frequency (TOF, s⁻¹).

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for EI-Guided Catalyst Discovery

Item	Function in Workflow	Example/Supplier Note
Multi-Element Metal Precursors	Enables precise composition control in high-throughput synthesis.	e.g., Tetraamminepalladium(II) nitrate, Chloroplatinic acid, Gold(III) chloride.
High-Throughput Support Wafers	Provides uniform, arrayed substrate for catalyst library.	e.g., Alumina-coated quartz wafers (5mm x 5mm wells).
Automated Liquid Handling System	Ensures reproducible, micro-scale dispensing of precursor solutions.	e.g., Hamilton Microlab STAR.
Parallel Flow Reactor System	Allows simultaneous activity/selectivity testing of multiple catalysts.	e.g., Symyx Technologies / Freeslate screening tools.
Gaussian Process Modeling Software	Core engine for building surrogate models and calculating EI.	e.g., GPyTorch (Python library).
Bayesian Optimization Platform	Integrates modeling, acquisition function, and experiment management.	e.g., Meta's Ax Platform.

Visualizations

Diagram Title: EI-Guided Catalyst Discovery Workflow

Diagram Title: EI Balances Exploration and Exploitation

Application Notes

In the context of optimizing acquisition functions for Expected Improvement (EI) in catalyst composition selection for drug development, three key concepts form the computational backbone. These are integral to Bayesian optimization (BO) frameworks used to efficiently navigate high-dimensional composition spaces, minimizing expensive experimental cycles.

Surrogate Model: A probabilistic, computationally inexpensive model that approximates the relationship between catalyst composition variables (e.g., ratios of metals, ligands, dopants) and the target performance metric (e.g., reaction yield, enantiomeric excess). In BO for catalyst research, a Gaussian Process (GP) is the standard surrogate, as it provides uncertainty estimates alongside predictions.
Posterior Distribution: The updated probabilistic belief about the objective function after incorporating observed experimental data. For a GP surrogate, the posterior at a new composition point is a full probability distribution (characterized by a mean and variance), quantifying both the predicted performance and the prediction uncertainty.
The Incumbent: The best-observed catalyst composition found so far in the optimization process, based on its evaluated performance metric. In the EI acquisition function, the incumbent (often denoted as ( f^* ) or ( f_{best} )) serves as the benchmark for calculating potential improvement.

The synergy is as follows: A surrogate model (GP), conditioned on all experimental data, provides a posterior distribution over the entire search space. An acquisition function (EI) uses this posterior and the current incumbent value to quantify the utility of evaluating any untested composition. EI is mathematically defined as the expected value of improvement ( I(x) = \max(0, f(x) - f^) ) under the posterior distribution, where ( f^ ) is the incumbent.

Data Presentation

Table 1: Performance Comparison of Surrogate Models in Simulated Catalyst Optimization

Surrogate Model Type	Average Regret after 50 Iterations (Lower is Better)	Mean Prediction Time (ms)	Handles High-Dim (>10) Compositions?	Key Advantage for Catalyst Screening
Gaussian Process (RBF Kernel)	0.12 ± 0.03	245	Moderate	Excellent uncertainty quantification
Random Forest	0.18 ± 0.05	45	Yes	Handles discrete/categorical variables well
Bayesian Neural Network	0.15 ± 0.04	120	Yes	Scalability to very high dimensions
Sparse Gaussian Process	0.14 ± 0.04	85	Moderate	Reduced compute for large datasets

Table 2: Impact of Incumbent Selection Strategy on EI Performance

Selection Strategy	Description	Convergence Rate (Iterations to 95% Optimum)	Robustness to Noisy Experimental Data
Best Observed	Simple max/min of evaluated samples	22	Low (overfits to outliers)
Posterior Mean Maximizer	Point with highest posterior mean	25	Medium
Penalized Best (Recommended)	Best observed, penalized by its posterior uncertainty	19	High

Experimental Protocols

Protocol 1: Establishing the Gaussian Process Surrogate for Catalyst Composition Space

Design Initial Library: Use space-filling design (e.g., Sobol sequence) to select 5-10 initial catalyst compositions (e.g., varying Pd:Pt ratio, ligand loading, solvent dielectric constant).
High-Throughput Experimentation: Synthesize and test initial library in parallel via automated flow/plate reactors. Record primary performance metric (e.g., turnover number).
Data Preprocessing: Normalize all compositional variables to [0,1] range. Apply log-transform to the performance metric if variance is non-stationary.
Model Training: Optimize GP hyperparameters (length scales, noise) by maximizing the log marginal likelihood on the initial data.
Model Validation: Use leave-one-out cross-validation. Calculate standardized mean squared error (SMSE); a value ~1.0 indicates a well-calibrated surrogate.

Protocol 2: Iterative Optimization Loop using Expected Improvement

Identify Incumbent: From the current dataset (D{1:t}), select the composition with the best observed performance as the incumbent (f^*t). Apply penalty if using noisy data (see Table 2).
Compute Posterior: Using the trained GP surrogate, compute the posterior mean ( \mut(x) ) and variance ( \sigma^2t(x) ) for all candidate compositions in a discretized search space.
Calculate EI: For each candidate (x), compute (EIt(x) = \mathbb{E}[max(0, f(x) - f^*t)] ). Under the GP posterior, this has the closed form: (EIt(x) = (\mut(x) - f^_t - \xi)\Phi(Z) + \sigma_t(x)\phi(Z) ), where (Z = \frac{\mu_t(x) - f^t - \xi}{\sigmat(x)}), and ( \xi ) is a small exploration parameter (e.g., 0.01).
Select Next Experiment: Choose the composition (x{t+1} = \arg\max EIt(x)).
Execute Experiment & Update: Synthesize and test (x{t+1}), record result (y{t+1}). Augment dataset: (D{1:t+1} = D{1:t} \cup {(x{t+1}, y{t+1})}).
Iterate: Re-train the GP surrogate on (D_{1:t+1}). Repeat from Step 1 until performance improvement plateaus or budget is exhausted.

Mandatory Visualization

Bayesian Optimization Workflow for Catalysis

EI Calculation from Posterior and Incumbent

The Scientist's Toolkit

Table 3: Key Research Reagent Solutions for High-Throughput Catalyst Optimization

Item / Reagent	Function in Protocol	Key Consideration for BO
Pre-catalyst Libraries (e.g., metal salt mixtures, ligand sets)	Provides the variable compositional space for the surrogate model to explore.	Ensure broad, well-defined chemical space coverage for initial design.
Automated Liquid Handling/Synthesis Robot (e.g., Chemspeed, Unchained Labs)	Enables precise, reproducible preparation of catalyst compositions from digital designs generated by the EI algorithm.	Integration with lab informatics system for direct data transfer to the model is critical.
High-Throughput Screening Reactor (e.g., plate-based parallel reactors, flow microreactors)	Generates the performance data (yield, selectivity) required to update the posterior distribution.	Data quality (noise level) must be characterized as it directly impacts GP hyperparameter training.
GPy/BOTorch/Scikit-learn Software	Provides the computational implementation for building the Gaussian Process surrogate, calculating the posterior, and optimizing the EI acquisition function.	Choice of kernel (e.g., Matern 5/2 for continuous variables) and optimizer significantly affects performance.
Lab Information Management System (LIMS)	Acts as the central data hub, linking experimental composition variables (inputs) with analytical results (outputs) for model training.	Must maintain strict metadata association for accurate model interpretation.

A Step-by-Step Guide: Implementing EI for High-Throughput Catalyst Screening

This document details an integrated workflow architecture designed to accelerate the discovery of heterogeneous catalysts. The protocols are framed within a broader thesis on using Expected Improvement (EI)—a core Bayesian optimization acquisition function—to guide the selection of catalyst compositions. The workflow synergistically combines autonomous robotic experimentation for synthesis and testing with high-throughput Density Functional Theory (DFT) calculations to provide atomic-scale insights. This closed-loop system iteratively proposes optimal experiments, minimizing the number of trials required to identify high-performance catalysts.

Application Notes: Integrated Workflow Architecture

The architecture is a data-centric pipeline where each module feeds information to the next, creating a cycle of hypothesis, experimentation, and learning.

EI as the Decision Engine: The Expected Improvement acquisition function balances exploration of uncertain regions of the composition space with exploitation of known high-performance areas. It quantifies the potential utility of testing a new candidate, mathematically expressed as: EI(x) = E[max(0, f(x) - f(x*))] where f(x) is the predicted performance of candidate x, and f(x*) is the current best observed performance.
Role of Robotic Experimentation: Automated platforms execute the physical synthesis (e.g., via inkjet printing, spin coating) and characterization (e.g., catalytic activity screening via mass spectrometry) of the candidates proposed by the EI algorithm. This generates rapid, reproducible, and quantitative experimental data.
Role of DFT Calculations: Parallel to experimentation, DFT calculations model the electronic structure and surface adsorption energies for proposed or synthesized compositions. This provides explanatory power and identifies descriptors (e.g., d-band center, oxygen vacancy formation energy) that can be fed back into the machine learning model to improve its predictive accuracy.
Closed-Loop Integration: The key innovation is the feedback of both experimental and computational results into a unified database. A machine learning model (e.g., Gaussian Process) is trained on this combined dataset. The EI function then queries this model to propose the next most informative set of compositions for both robotic synthesis and DFT investigation.

Core Protocols

Protocol 3.1: Expected Improvement-Driven Candidate Proposal

Objective: To select the next batch of catalyst compositions for experimental testing using Bayesian optimization. Materials: Computing workstation, Python environment with libraries (scikit-optimize, GPyTorch, numpy). Procedure:

Initialize: Start with a small, space-filling initial dataset (D) of n compositions (e.g., 10-20) and their measured performance metrics (e.g., turnover frequency, yield).
Train Model: Fit a Gaussian Process (GP) surrogate model to dataset D, specifying a kernel (e.g., Matérn 5/2) appropriate for compositional data.
Calculate EI: For a large set of candidate compositions in the search space (e.g., 10,000 random points), compute the Expected Improvement at each point using the trained GP model and the current best performance f(x*).
Select & Output: Identify the candidate composition x that maximizes EI(x). Output this composition, along with a user-defined number of next-best candidates, for the robotic experimentation queue.
Update: After experimental results are obtained, add the new (x, f(x)) pair to dataset D and repeat from Step 2.

Protocol 3.2: High-Throughput Robotic Synthesis & Screening

Objective: To autonomously synthesize and test solid-state catalyst libraries. Materials: Automated liquid handler or inkjet printer, multi-well substrate (e.g., alumina wafer), precursor solutions, robotic arm, integrated gas chromatograph/mass spectrometer (GC-MS) flow reactor. Procedure:

Substrate Preparation: Load substrate into the robotic platform. Execute a standard cleaning protocol (e.g., UV-ozone treatment).
Precision Dispensing: Translate the digital composition list from Protocol 3.1 into dispensing commands. Use the liquid handler to mix and deposit precursor solutions onto designated locations on the substrate.
Automated Processing: Transfer the substrate to integrated furnaces for calcination and reduction under programmed temperature ramps and gas flows (e.g., 400°C in air, then 500°C in H₂/Ar).
Activity Screening: The robotic arm sequentially positions each catalyst spot under the inlet of a packed-bed microreactor connected to GC-MS. Measure catalytic performance (e.g., CO₂ conversion for methanation) under standardized conditions (e.g., 300°C, 1 bar, CO₂:H₂ = 1:4).
Data Logging: Automatically record performance metrics (conversion, selectivity, rate) and associate them with the precise composition and synthesis parameters in the master database.

Protocol 3.3: High-Throughput DFT Workflow for Descriptor Calculation

Objective: To compute electronic structure descriptors for candidate compositions. Materials: High-performance computing cluster, DFT software (VASP, Quantum ESPRESSO), workflow manager (Fireworks, AiiDA). Procedure:

Structure Generation: For each proposed composition, generate likely surface slab models (e.g., (111) facet of a ternary alloy).
Job Submission: Launch DFT calculations using a standardized input set: PBE functional, plane-wave basis set with defined cutoff, PAW pseudopotentials, and k-point mesh. First perform geometry relaxation.
Property Calculation: On relaxed structures, run single-point calculations to extract the density of states (DOS). Compute key descriptors: a. d-band center: Calculate as the first moment of the projected d-band DOS. b. Adsorption Energy (E_ads): For key intermediates (e.g., *CO, *O), calculate E_ads = E(slab+adsorbate) - E(slab) - E(adsorbate_gas). c. Formation Energy: For defects like oxygen vacancies.
Data Parsing: Automatically parse output files to populate a computational database with the calculated descriptors for each composition.

Data Presentation

Table 1: Performance Data from an Iterative EI-Driven Catalyst Screening Cycle for CO₂ Hydrogenation

Iteration	Proposed Composition (A-B-C)	Experimental TOF (h⁻¹)	DFT d-band center (eV)	EI Value (Normalized)
0 (Seed)	Co₆₀Fe₂₀Ni₂₀	120	-1.85	N/A
0 (Seed)	Co₂₀Fe₆₀Ni₂₀	85	-1.92	N/A
1	Co₅₀Fe₄₀Ni₁₀	210	-1.78	0.65
1	Co₄₅Fe₁₅Ni₄₀	95	-1.95	0.21
2	Co₅₅Fe₃₅Ni₁₀	380	-1.72	0.89
3	Co₆₀Fe₃₀Ni₁₀	350	-1.70	0.15

Table 2: Essential Research Reagent Solutions & Materials

Item	Function in Workflow
Metal Nitrate Precursor Solutions (0.1M)	Standardized stock solutions for precise robotic dispensing of active metal components.
Alumina-coated Si Wafer Substrate	High-surface-area, inert support for creating catalyst libraries via printing.
Calibration Gas Mixture (e.g., 5% CO₂, 20% H₂, balance Ar)	Standard reactant stream for reproducible catalytic activity screening.
PAW Pseudopotential Library	Essential for accurate and efficient DFT calculations of transition metal systems.
Gaussian Process Kernel (Matérn 5/2)	Core mathematical function defining similarity between compositions in the surrogate model.

Mandatory Visualizations

Diagram Title: Closed-Loop Catalyst Discovery Workflow Architecture

Diagram Title: Expected Improvement Iteration Protocol

1. Introduction Within the context of Bayesian optimization for catalyst discovery, the acquisition function (e.g., Expected Improvement) guides the selection of the next candidate for experimental testing. The efficacy of this process is fundamentally constrained by how the multidimensional search space of catalyst formulations is defined and encoded. This protocol details the systematic encoding of catalyst compositions, supports, and dopants into numerical feature vectors, forming the critical input space for machine learning models in acquisition function-driven research.

2. Encoding Schemes and Quantitative Data A practical encoding strategy combines categorical, compositional, and structural descriptors. The following tables summarize key encoding approaches and their quantitative impact on search space dimensionality.

Table 1: Primary Encoding Schemes for Catalyst Components

Encoding Scheme	Application Example	Description	Dimensionality per Element
One-Hot / Label	Support Type (Al2O3, SiO2, TiO2, Carbon)	Binary vector for each distinct category.	1 (expands to N categories)
Atomic Fraction	Active Metal (Ni, Co, Fe) in a bimetallic catalyst	Molar ratio of each element in the active phase.	1 (sums to 1 for the phase)
Weight Loading	1 wt%, 5 wt% Pt on support	Mass percentage of active component.	1
Physical Descriptor	Support Surface Area, Pore Volume	Measured scalar property of the material.	1
Crystallographic	Dopant Ionic Radius, Dopant Electronegativity	Elemental property of a dopant atom.	1

Table 2: Example Encoded Catalyst Formulation Vector

Feature Category	Specific Feature	Encoding Method	Example Value (Catalyst: 2%Ni-0.5%Cu/SBA-15)
Support	Support Type: SBA-15	One-Hot (vs. Al2O3, TiO2)	[1, 0, 0]
	Support Surface Area (m²/g)	Physical Descriptor	600
Active Metals	Ni Weight Loading	Weight Loading	2.0
	Cu Weight Loading	Weight Loading	0.5
	Ni Atomic Fraction in Metal Phase	Atomic Fraction	0.86
	Cu Atomic Fraction in Metal Phase	Atomic Fraction	0.14
Dopant	Presence of K Dopant	Binary (0/1)	0
Preparation	Calcination Temp (°C)	Physical Descriptor	500
Total Vector Dimensionality			8

3. Experimental Protocol: Generating the Encoded Dataset for Bayesian Optimization

Protocol 3.1: Systematic Feature Vector Construction Objective: To translate a library of synthesized catalyst formulations into a standardized numerical matrix. Materials: Catalyst synthesis records, characterization data (e.g., BET, ICP-OES), elemental property tables (e.g., Pauling electronegativity, ionic radius).

Procedure:

Define the Universal Feature Set: List every unique feature relevant to the catalyst space (e.g., Support_Al2O3, Support_SiO2, Ni_wt%, Cu_wt%, Calcination_Temp). This defines the columns of your design matrix.
Populate Feature Vectors per Catalyst: a. For categorical features (support, dopant presence), assign 1 if true, 0 otherwise. b. For compositional features, input the measured or target weight loading or atomic fraction. c. For physical and elemental descriptors, input the measured or tabulated value.
Handle Missing Data: For unreported features, use imputation (e.g., median value for physical descriptors) or a dedicated null indicator (e.g., -999), ensuring the model is aware of the imputation.
Normalize Features: Apply standard scaling (z-score) or min-max scaling to all continuous features to ensure equal weighting during model training.
Associate with Target Property: Align each catalyst's feature vector with its corresponding performance metric (e.g., yield, TOF) from activity testing. This forms the complete dataset for the surrogate model.

Protocol 3.2: Iterative Search Space Expansion via Acquisition Function Objective: To integrate the encoded search space into the Bayesian optimization loop for candidate selection. Workflow Diagram:

Title: Bayesian Optimization Loop with Search Space Encoding

4. The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for Catalyst Synthesis & Encoding

Item / Reagent	Function in Search Space Definition
High-Throughput Impregnation Robot	Enables precise, automated synthesis of catalyst libraries with varying compositions/dopant levels, generating consistent data for encoding.
Inductively Coupled Plasma Optical Emission Spectrometry (ICP-OES)	Provides quantitative elemental analysis for accurate encoding of weight loading and atomic fraction features.
Surface Area & Porosimetry Analyzer (BET)	Measures critical physical descriptor features (surface area, pore volume) of catalyst supports.
Crystallographic Database (ICSD, COD)	Source for ionic radius and structural descriptors for dopant and active phase encoding.
Elemental Property Table (e.g., CRC Handbook)	Source for electronegativity, valence electron count used as dopant/site descriptors.
Data Curation Software (e.g., CATKit, custom Python/R scripts)	Essential for automating the transformation of synthesis records into standardized, encoded feature vectors.

5. Logical Framework for Search Space Definition The following diagram illustrates the hierarchical and combinatorial nature of defining the catalyst search space.

Title: From Catalyst Components to Acquisition Function Input

Choosing and Training the Surrogate Model for Catalytic Performance Prediction

Within the broader thesis on using acquisition functions, specifically Expected Improvement (EI), for catalyst composition selection, the surrogate model is the cornerstone. It acts as a computationally cheap proxy for expensive experimental or high-fidelity computational (e.g., DFT) evaluations of catalytic performance (e.g., activity, selectivity). This document details the application notes and protocols for selecting, training, and validating surrogate models to enable efficient Bayesian optimization (BO) loops for catalyst discovery.

Surrogate Model Options: Comparison and Selection

The choice of model depends on dataset size, dimensionality, and noise characteristics. Below is a comparative analysis of commonly used models in catalyst informatics.

Table 1: Comparison of Surrogate Model Candidates for Catalytic Performance Prediction

Model Type	Key Advantages	Key Limitations	Recommended Use Case	Key Hyperparameters to Tune
Gaussian Process (GP)	Provides uncertainty estimates natively, well-suited for BO. Strong theoretical foundation.	Poor scalability with data (O(n³)). Kernel choice is critical.	Small to medium datasets (<10k samples). High-value experiments where uncertainty quantification is critical.	Kernel type (RBF, Matern), length scales, noise level.
Random Forest (RF)	Handles high dimensions, robust to outliers and irrelevant features. Lower computational cost for training.	Uncertainty estimates are less reliable than GP. Extrapolation performance can be poor.	Medium to large datasets. Mixed feature types (compositional, structural).	Number of trees, max depth, min samples split.
Gradient Boosting Machines (GBM)	Often higher predictive accuracy than RF. Handles mixed data types well.	More prone to overfitting. Requires careful tuning. Sequential training is slower.	Medium to large datasets where predictive accuracy is paramount.	Learning rate, number of estimators, max depth.
Neural Networks (NN)	Extremely flexible, can model complex non-linear interactions. Scalable to very large datasets.	Requires large data. Uncertainty estimation not inherent (requires techniques like dropout or ensemble).	Very large datasets (>50k samples). Complex descriptor spaces (e.g., graph representations of catalysts).	Network architecture, learning rate, dropout rate.
Sparse Gaussian Process	Retains GP benefits (uncertainty) with improved scalability.	Approximation introduces error. More complex implementation.	Medium-sized datasets where GP is ideal but computationally prohibitive.	Inducing point number and initialization.

Application Note 2.1: For a typical catalyst discovery BO loop with an expensive-to-evaluate function (e.g., experimental turnover frequency) and a dataset size of a few hundred points, the Gaussian Process with a Matern 5/2 kernel is often the default recommendation due to its balanced performance and native uncertainty quantification essential for EI.

Detailed Protocol: Training and Validating a Gaussian Process Surrogate Model

Protocol 3.1: Data Preprocessing for Catalyst Features

Feature Engineering: Generate a unified feature vector for each catalyst candidate. This may include:
- Compositional Features: Elemental fractions, statistical moments (mean, variance) of atomic properties (electronegativity, radius).
- Structural/Conditional Features: Surface coverage, reaction temperature, pressure descriptors.
- Descriptors: Use libraries like matminer or dscribe to compute oxidation states, bond length distributions, etc.
Feature Scaling: Standardize all features to have zero mean and unit variance using StandardScaler from scikit-learn. Fit the scaler on the training set only, then transform both training and test sets.
Target Variable Handling: For regression (predicting a continuous performance metric), check for outliers. Consider log-transformation if the target values span several orders of magnitude.

Protocol 3.2: Model Training, Validation, and Uncertainty Calibration

Train-Test Split: Perform a stratified split (e.g., 80/20) based on key compositional families or use spatial splitting (e.g., Kennard-Stone) to ensure the test set is representative of the chemical space.
Kernel Selection & Initialization:
- Use a Matern(length_scale=1.0, nu=2.5) kernel as a robust default for modeling catalytic landscapes.
- Add a WhiteKernel(noise_level=0.1) to account for experimental noise.
- The final kernel is often the sum: Matern() + WhiteKernel().
Model Fitting: Use GaussianProcessRegressor (scikit-learn) or GPyTorch/GPflow for more flexibility. Optimize the kernel hyperparameters by maximizing the log-marginal likelihood.
Validation: Predict on the held-out test set. Calculate key metrics: Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Coefficient of Determination (R²).
Uncertainty Calibration (Critical for EI): Ensure the model's predicted standard deviation is meaningful. A well-calibrated model's uncertainty should correlate with prediction error. Use calibration plots (predicted std. dev. vs. actual absolute error).

Table 2: Example Performance Metrics for a GP Model on a Bimetallic Catalyst Dataset (n=420)

Data Split	Sample Size	R²	MAE (TOF, s⁻¹)	RMSE (TOF, s⁻¹)	Avg. Predictive Std. Dev.
Training	336	0.89	0.18	0.25	0.21
Test	84	0.82	0.25	0.34	0.29

Integration with Expected Improvement Acquisition Function

Once trained and validated, the surrogate model is integrated into the BO loop. The Expected Improvement (EI) for a candidate catalyst x is calculated as: EI(x) = E[max( f(x) - f(x), 0 )] where f(x) is the surrogate model's prediction (a Gaussian distribution: N(μ(x), σ²(x))), and f(x*) is the best performance observed so far. *Implementation Note: Use a library like BoTorch or scikit-optimize which provides efficient, numerically stable implementations of EI that handle the exploration-exploitation trade-off.

Visualization of the Workflow

Title: Surrogate Model Training and BO Loop for Catalysts

Title: Thesis Context Model Hierarchy

The Scientist's Toolkit

Table 3: Essential Research Reagent Solutions for Catalyst Surrogate Modeling

Item	Function & Application Note
scikit-learn	Core library for ML models (GP, RF, GBM), preprocessing, and validation. Use `GaussianProcessRegressor` for basic GP implementations.
GPyTorch / GPflow	Advanced libraries for scalable, flexible Gaussian Process modeling, essential for larger datasets or custom kernels.
matminer / dscribe	Libraries for generating feature descriptors from material compositions and structures (e.g., elemental property statistics, SOAP descriptors).
BoTorch	A Bayesian optimization library built on PyTorch. Provides state-of-the-art implementations of acquisition functions like EI and supports compositional spaces.
pymatgen	Python materials analysis library for parsing, analyzing, and representing catalyst structures and compositions.
Catalysis-Hub.org	A public repository for surface reaction energies and barriers from DFT, a potential source of training data for surrogate models.
StandardScaler	The default tool for feature standardization (zero mean, unit variance). Critical for distance-based models like GP and NN.
Matern Kernel (ν=2.5)	The recommended default kernel for GPs in this domain, offering a good balance of smoothness and flexibility to model catalytic response surfaces.

Application Notes

Expected Improvement (EI) is the predominant acquisition function for Bayesian optimization (BO), a sequential design strategy for global optimization of expensive-to-evaluate black-box functions. In catalyst composition selection research, EI enables efficient navigation of high-dimensional, combinatorial search spaces (e.g., multi-metallic ratios, dopants, supports) by quantifying the potential utility of evaluating a candidate composition based on a probabilistic surrogate model, typically a Gaussian Process (GP).

Core Algorithmic Implementation: The EI acquisition function for a minimization problem at a candidate point x is defined as: EI(x) = (μ(x) - f(x*) - ξ) * Φ(Z) + σ(x) * φ(Z), if σ(x) > 0, else 0. Where: Z = (μ(x) - f(x*) - ξ) / σ(x). Here, μ(x) and σ(x) are the GP posterior mean and standard deviation, f(x*) is the current best observed function value (incumbent), ξ is a user-defined trade-off parameter balancing exploration and exploitation, and Φ and φ are the CDF and PDF of the standard normal distribution, respectively. Maximizing EI selects the next point for experimental synthesis and testing.

Key Software Libraries: Modern libraries implement robust, scalable EI optimization, handling gradients, constraints, and parallel evaluation.

Table 1: Comparison of Primary Software Libraries for EI

Library	Primary Language	Key Features for EI & Catalyst Research	License
BoTorch	Python (PyTorch)	High-dimensional optimization, compositional/one-hot encoding for categorical variables (e.g., support type), batch (parallel) EI, analytic gradients.	MIT
GPyOpt	Python (GPy)	Easy-to-use interface, basic sequential and batch EI.	BSD 3-Clause
Dragonfly	Python	Handles variables of mixed types (continuous, discrete, categorical), suitable for complex catalyst parameter spaces.	MIT
scikit-optimize	Python	Simple "ask-and-tell" interface, supports expected improvement for numerical spaces.	BSD 3-Clause

Experimental Protocols

Protocol 2.1: High-Throughput Virtual Screening of Bimetallic Catalysts Using EI

This protocol outlines a computational workflow for optimizing the composition and strain of a bimetallic alloy catalyst for oxygen reduction reaction (ORR) activity.

Objective: Maximize predicted ORR activity descriptor (e.g., ΔG_O - ΔG_OH) via Density Functional Theory (DFT) calculations guided by EI. Design Space: Two continuous variables: Composition (A$x$B${1-x}$, x ∈ [0,1]) and Biaxial Strain (ε ∈ [-5%, +5%]). Surrogate Model: Gaussian Process with Matérn 5/2 kernel. Acquisition Function: Expected Improvement (ξ = 0.01).

Procedure:

Initial Design: Select 10 points via Latin Hypercube Sampling (LHS) across the 2D space.
Initial Evaluation: Perform DFT calculations at these 10 points to obtain the activity descriptor values. This forms the initial dataset D.
BO Loop (Iterate for 30 cycles): a. Model Training: Fit a GP to the current dataset D. b. EI Maximization: Using BoTorch's qEI with L-BFGS-B, find the point x_next that maximizes EI. Incorporate known physical constraints via penalty functions if needed. c. Parallel Evaluation: For batch mode (e.g., 4 candidates per batch), use qEI to select a batch of points that jointly maximize information gain. d. Expensive Evaluation: Run DFT calculation for x_next (or batch). e. Data Augmentation: Append {x_next, y_next} to dataset D.
Termination & Validation: After 30 iterations, validate the top 3 predicted optimal compositions with higher-fidelity DFT calculations or literature comparison.

Protocol 2.2: Experimental Optimization of Zeolite Catalyst Synthesis via Batch EI

This protocol guides the lab-scale optimization of zeolite synthesis conditions for maximizing yield.

Objective: Maximize zeolite product yield (wt%). Design Space: Four continuous variables: Hydrothermal Temperature (140-180°C), Time (12-72 hr), SiO2/Al2O3 Ratio (20-50), and OH-/SiO2 Ratio (0.2-0.5). Surrogate Model: Gaussian Process with Matérn 5/2 kernel with Automatic Relevance Determination (ARD). Acquisition Function: Expected Improvement (ξ = 0.1) with a noisy observations assumption.

Procedure:

Initial Design: Select 15 experimental conditions via LHS.
Initial Synthesis & Characterization: Execute syntheses in parallel autoclaves, filter, dry, and weigh products to determine yields. Record in dataset D.
BO Loop (Iterate for 20 batches): a. GP Training: Fit a GP model to D, using a noise likelihood to account for experimental variability. b. Batch EI Optimization: Using BoTorch's qNoisyExpectedImprovement (qNEI), select a batch of 4 synthesis conditions that maximize joint EI, accounting for pending experiments. c. Experimental Execution: A technician carries out the 4 synthesis and characterization protocols in parallel. d. Data Update: Append the new results to D.
Analysis: Identify the optimal synthesis condition from the final dataset. Characterize the resultant zeolite material via XRD and BET surface area analysis.

Visualizations

EI-Driven Catalyst Optimization Workflow

EI Calculation for Candidate Selection

The Scientist's Toolkit

Table 2: Key Research Reagent Solutions for EI-Guided Catalyst Discovery

Item / Solution	Function in Protocol
BoTorch / GPyOpt Library	Core software for implementing the Bayesian optimization loop, including GP fitting and EI maximization.
High-Performance Computing (HPC) Cluster	Executes parallel DFT calculations (Protocol 2.1) or manages computational jobs for surrogate modeling.
Parallel Synthesis Reactor Array	Enables high-throughput experimental batch evaluation (e.g., 4-8 simultaneous hydrothermal syntheses in Protocol 2.2).
Automated Characterization Suite	Provides rapid feedback on catalyst properties (e.g., yield, selectivity, surface area) to feed the BO data loop.
Domain-Specific Descriptor Calculator	Translates catalyst composition/structure into quantitative features for the GP model if not using raw variables.

1. Introduction and Thesis Context

This application note details a case study demonstrating the efficacy of Expected Improvement (EI) as an acquisition function within a Bayesian optimization (BO) framework for the discovery of novel heterogeneous bimetallic catalysts. The work is situated within a broader thesis positing that EI, which balances exploration of uncertain regions and exploitation of known high-performance areas, is uniquely suited for navigating the high-dimensional, costly-to-evaluate composition spaces typical in catalyst discovery. The target reaction is the Suzuki-Miyaura cross-coupling, a pivotal C-C bond-forming reaction in pharmaceutical synthesis, where improving catalyst activity, selectivity, and stability under mild conditions remains a key industrial objective.

2. Experimental Design and Bayesian Optimization Workflow

The experimental space was defined by two continuous variables: the atomic ratio of Palladium (Pd) to a second, earth-abundant metal (M), and the calcination temperature of the catalyst support. A Gaussian Process (GP) surrogate model was trained on an initial dataset of 12 randomly selected compositions. The EI acquisition function was then used to sequentially select the next candidate catalyst for synthesis and testing, maximizing the expected gain over the current best performance (here, yield %).

2.1. Detailed Experimental Protocol: Catalyst Synthesis (Impregnation & Calcination)

Objective: To prepare a series of Pd-M bimetallic catalysts on a mesoporous carbon support.
Materials: See "Research Reagent Solutions" table.
Procedure:
- Solution Preparation: Calculate the required masses of Pd(NO₃)₂·xH₂O and M(NO₃)ₙ·xH₂O to achieve the target Pd:M atomic ratio (e.g., 1:1, 3:1, 1:3) for a total metal loading of 2 wt.% on 1.0 g of support.
- Wet Impregnation: Dissolve the calculated metal precursors in 10 mL of deionized water. Add 1.0 g of mesoporous carbon powder to the solution. Stir the slurry at room temperature for 4 hours.
- Drying: Remove water via rotary evaporation at 60°C under reduced pressure until a dry powder is obtained.
- Calcination: Transfer the dry powder to a quartz boat. Place in a tube furnace under a flowing N₂ atmosphere (50 mL/min). Heat to the target temperature (range: 300°C–600°C) at a ramp rate of 5°C/min, hold for 3 hours, then allow to cool to room temperature under N₂.
- Reduction (Optional, in situ): For testing, the catalyst is reduced in situ in the reaction vessel under H₂ flow prior to reaction commencement, unless otherwise specified by the calcination protocol.
- Characterization: Perform X-ray diffraction (XRD) and X-ray photoelectron spectroscopy (XPS) on select samples to confirm alloy formation and metal oxidation states.

2.2. Detailed Experimental Protocol: Suzuki-Miyaura Coupling Reaction Screening

Objective: To evaluate catalyst performance in the coupling of 4-bromotoluene with phenylboronic acid.
Materials: See table.
Procedure:
- In a dried 10 mL Schlenk tube under N₂ atmosphere, combine 4-bromotoluene (1.0 mmol, 171 mg), phenylboronic acid (1.5 mmol, 183 mg), and K₂CO₃ (2.0 mmol, 277 mg).
- Add a solvent mixture of toluene/water (4:1 v/v, 5 mL total).
- Add the synthesized bimetallic catalyst (25 mg, 0.5 mol% Pd relative to aryl halide).
- Seal the tube and heat the reaction mixture to 80°C with vigorous stirring (800 rpm).
- Monitor reaction progress by thin-layer chromatography (TLC) or withdraw aliquots at 1, 2, 4, and 8 hours for GC-MS analysis.
- After 8 hours, cool the reaction to room temperature. Dilute with ethyl acetate (10 mL) and filter through a Celite pad to recover the catalyst.
- Analyze the organic phase by gas chromatography with flame ionization detection (GC-FID) using dodecane as an internal standard to determine yield.

3. Data Presentation and Optimization Results

Table 1: Representative Experimental Data from the EI-Guided Campaign

Experiment	Pd:M Ratio	Calcination Temp. (°C)	Yield (%) @ 8h	EI Selection Rank
Initial-01	1:1 (Co)	400	45	N/A
Initial-02	3:1 (Ni)	500	62	N/A
...	...	...	...	...
EI-01	1:2 (Cu)	350	78	1
EI-02	2:1 (Co)	450	65	2
...	...	...	...	...
EI-07	1:3 (Cu)	375	>99	1
Final Best	1:3 (Cu)	375	>99	-

Table 2: Comparison of Optimal Catalyst vs. Benchmarks

Catalyst	Pd Loading (mol%)	Yield (%)	Turnover Number (TON)	Selectivity (%)
Commercial Pd/C	0.5	85	170	>99
Pd-Ni (Initial Best)	0.5	62	124	>99
Pd-Cu (EI-Optimized)	0.5	>99	>198	>99
Monometallic Pd	0.5	70	140	>99

4. Visualization of Workflows and Relationships

Title: Bayesian Optimization Loop for Catalyst Discovery

Title: Suzuki-Miyaura Catalytic Cycle on Pd-Cu Site

5. The Scientist's Toolkit: Research Reagent Solutions

Item	Function / Role in Experiment
Pd(NO₃)₂·xH₂O	Palladium precursor for catalyst synthesis.
Cu(NO₃)₂·3H₂O	Copper precursor; co-metal in optimal bimetallic catalyst.
Mesoporous Carbon Support	High-surface-area support for dispersing metal nanoparticles.
4-Bromotoluene	Model aryl halide coupling partner.
Phenylboronic Acid	Model boronic acid coupling partner.
Potassium Carbonate (K₂CO₃)	Base, activates boronic acid and facilitates transmetalation.
Toluene/Water (4:1) Solvent	Biphasic solvent system common for Suzuki reactions.
GC-MS & GC-FID System	For reaction monitoring and quantitative yield analysis.
Schlenk Line/Tube	For conducting air-sensitive reactions under inert (N₂) atmosphere.
Bayesian Optimization Software	(e.g., GPyOpt, BoTorch) To implement the GP and EI algorithm.

Overcoming Practical Hurdles: Tuning EI for Noisy, Constrained, and Multi-Objective Catalyst Data

Handling Experimental Noise and Replicability in Catalytic Activity Measurements

1. Introduction and Context Within catalyst discovery driven by Bayesian optimization and acquisition functions like Expected Improvement (EI), the fidelity of the catalytic activity measurement is the critical bottleneck. Noisy or irreproducible data misdirects the composition search, wasting iterations and resources. This document provides protocols to quantify, mitigate, and account for experimental noise, ensuring that the "improvement" sought by the EI function is statistically significant and replicable.

2. Quantifying Measurement Noise: A Pre-Optimization Requirement Before initiating any high-throughput experimentation (HTE) or optimization loop, baseline noise for the primary activity assay must be established.

Protocol 2.1: Determining Assay Signal-to-Noise Ratio (SNR) and Z'-Factor

Plate Design: On a single microtiter plate, prepare two sets of control wells: high-control (catalyst known to give strong signal) and low-control (no catalyst or deactivated catalyst). Use a minimum of n=16 replicates for each control type, distributed across the plate.
Assay Execution: Run the standard catalytic activity assay under identical conditions.
Data Analysis: Calculate the mean (μ) and standard deviation (σ) for both high and low controls.
- SNR: (μhigh - μlow) / σ_high
- Z'-Factor: 1 - [ (3σhigh + 3σlow) / |μhigh - μlow| ]
Interpretation: A Z'-Factor > 0.5 indicates an excellent assay suitable for screening. SNR >10 is typically desirable. These values must be re-checked periodically.

Table 1: Example Baseline Noise Metrics for a Model Hydrogenation Reaction

Control Type	Mean Conversion (%)	Std Dev (σ)	N	SNR (vs. Low)	Z'-Factor
High (5% Pd/C)	95.2	2.1	16	45.3	0.86
Low (No Catalyst)	1.5	0.7	16	-	-

3. Core Protocol: Replicable Catalyst Activity Measurement This protocol is designed for solid heterogeneous catalysts in liquid-phase batch reactions, with conversion measured by GC.

Protocol 3.1: Standardized Catalyst Testing Workflow

Catalyst Synthesis & Loading: Precisely control precursor concentrations, deposition sequences, and calcination/reduction temperature ramps (±2°C). For supported catalysts, use an analytical balance (±0.01 mg) to load a precise mass (e.g., 10.0 mg) into the reaction vessel.
Reaction Setup (Inert Atmosphere): Perform all transfers in a glovebox or using Schlenk-line techniques. Use septum-sealed vials/reactors.
Pre-treatment: Activate catalysts in situ under a flow of relevant gas (e.g., H₂, 20 mL/min) at specified temperature for 1 hour, then cool under inert gas.
Reaction Initiation: Inject a degassed, precise volume of substrate solution via syringe pump to ensure consistent start time and mixing.
Sampling: At defined timepoints (e.g., 5, 15, 30, 60 min), withdraw a small, consistent aliquot (e.g., 50 µL) via syringe, immediately quench, and dilute for analysis.
Analysis: Use gas chromatography (GC) or high-performance liquid chromatography (HPLC) with internal standard calibration. Each sample is analyzed in triplicate injections.
Data Processing: Report conversion, turnover frequency (TOF), and selectivity. TOF should be calculated from the initial slope (first 10-15% conversion) to minimize mass-transfer artifacts.

4. Noise Mitigation Through Experimental Design

Blocking: When testing a library of compositions across multiple plates or batches, include a shared reference catalyst in each block to correct for inter-batch variability.
Randomization: Test compositions in a randomized order to avoid systematic bias from instrument drift or reagent degradation.
Replication Strategy: Use a nested replication model. Technical replicates (multiple aliquots from the same reaction mixture) quantify analytical noise. Independent experimental replicates (separately synthesized and tested catalysts) quantify synthesis and holistic experimental noise. For EI, prioritize independent replicates for promising compositions.

Table 2: Replication Strategy for Different Optimization Phases

Phase	Goal	Independent Replicates (Synthesis)	Technical Replicates (Analysis)	Primary Output
Initial Screening	Identify hits	n=1	n=3 (injection)	Conversion ± SD
EI Candidate Evaluation	Reliable ranking for EI	n=2	n=3	Mean TOF & 95% CI
Validation	Confirm final leads	n=3	n=5	Full kinetics ± Error

5. Integrating Noise into the Acquisition Function The EI acquisition function can be modified to account for noise by using a noisy expected improvement criterion, which uses the posterior predictive distribution from a Gaussian Process (GP) model that incorporates a noise term (σ²_n).

Protocol 5.1: Configuring a Noise-Aware GP Model for Catalyst Data

Model Definition: Use a GP prior with a Matérn kernel (e.g., ν=5/2) to model the catalyst composition-activity landscape.
Likelihood: Employ a Gaussian likelihood function where the total variance is the sum of the GP variance and the observed noise variance (σ²_obs) for each data point.
Input Data: For each tested composition i, input the mean activity (yi) and the standard error of the mean (SEMi) derived from replicates.
Hyperparameter Optimization: Optimize kernel hyperparameters (length scale, variance) and the global noise hyperparameter by maximizing the marginal log-likelihood.
Noisy EI Calculation: The acquisition function αEI(x) is computed using the posterior mean μ(x) and variance σ²(x) + σ²n, where σ²_n is the estimated noise at the candidate point x.

The Scientist's Toolkit: Key Research Reagent Solutions

Table 3: Essential Materials for Noise-Reduced Catalytic Testing

Item	Function & Rationale
Automated Liquid Handling Robot	Enables precise, sub-microliter dispensing of catalyst precursors and reagents, eliminating pipetting variability in library synthesis.
High-Pressure Parallel Reactor System	Provides consistent temperature (±0.5°C) and agitation control across multiple catalyst tests, removing environmental noise.
Online GC/MS or HPLC with Autosampler	Allows for automated, timed sampling and analysis, ensuring consistent quenching and injection volumes, critical for kinetic profiles.
Deuterated Internal Standards	Added to reaction aliquots before analysis to correct for variations in sample preparation and injection volume in quantitative GC/LC-MS.
Certified Reference Catalyst (e.g., EUROPT-1)	A well-characterized, commercial silica-supported Pt catalyst used as a benchmark to validate reactor performance and analytical protocols across labs.
Degassed, HPLC-Grade Solvents in Sealed Bottles	Minimizes variability in solvent purity and dissolved oxygen content, which can poison or alter catalyst performance.

6. Visualization of Workflows and Concepts

Title: Bayesian Optimization Loop with Noise Handling

Title: Noise Sources and Corresponding Mitigation Protocols

Within the broader thesis on "Advanced Acquisition Functions for Catalyst Composition Selection in Drug Development," the Expected Improvement (EI) criterion serves as a cornerstone for Bayesian optimization (BO) of high-value, multi-property catalytic materials. Traditional EI solely maximizes an objective function (e.g., reaction yield), often leading to proposals that are chemically intractable, prohibitively expensive, or unstable. This document details protocols for integrating cost, stability, and synthetic feasibility as explicit constraints into the EI framework, enabling the efficient navigation of complex composition spaces towards viable, developable catalysts.

Mathematical Formulation of Constrained EI

The constrained Expected Improvement (cEI) modifies the standard EI by multiplying it with a probability of feasibility. For multiple constraints, the acquisition function becomes:

cEI(x) = EI( f(x) ) * ∏ P( Ci(x) ≤ thresholdi )

Where:

EI(f(x)) is the standard Expected Improvement on the primary objective (e.g., yield, selectivity).
P( Ci(x) ≤ thresholdi ) is the probability that the i-th predicted constraint (modeled by a separate Gaussian Process) is within a specified acceptable limit.

Table 1: Quantitative Metrics and Their Corresponding Constraint Thresholds

Constraint Dimension	Representative Metric (C_i)	Typical Threshold (for P = 0.95)	GP Kernel Common Choice
Cost	Estimated $/kg of Catalyst	≤ $5,000	Matérn 5/2
Stability	% Activity Loss after 24h	≤ 10%	Matérn 3/2
Synthetic Feasibility	Predicted Step Score (0-1)	≥ 0.7	Radial Basis Function (RBF)

Experimental Protocols for Constraint Modeling

Protocol 3.1: Data Generation for Cost Modeling

Objective: To create a dataset linking catalyst composition (e.g., %Pt, %Pd, support identity) to a normalized cost metric. Procedure:

Define Basis: List all precursor salts, ligands, and supports for the target catalyst library.
Price Aggregation: Query current bulk prices (≥100g) from Sigma-Aldrich, Fisher Scientific, and Strem Chemicals for each component. Record date and source.
Calculate Composition Cost: For each hypothetical catalyst Cat_{A_x,B_y}, compute: Cost = Σ (molar_frac_i * MW_i * price_$_per_g_i) / Target_MW_Catalyst.
Normalize: Scale all costs from 0-1 relative to the most expensive plausible composition in the design space. Key Output: A lookup table or a trained surrogate model GP_cost = f(composition).

Protocol 3.2: Accelerated Stability Screening Protocol

Objective: To rapidly assess catalyst stability under simulated reaction conditions. Procedure:

Material: 10 mg of each candidate catalyst (synthesized via Protocol 3.3).
Equipment: High-throughput parallel pressure reactor array (e.g., from Unchained Labs).
Process: a. Charge each reactor with catalyst, substrate, and solvent under inert atmosphere. b. Run the main reaction at standard conditions (T, P) for 1 hour. Take an aliquot for initial activity A_0 (e.g., via UPLC yield analysis). c. Without catalyst removal, maintain the reaction mixture at an elevated temperature (e.g., T + 20°C) for 24 hours. d. Cool, take a final aliquot, and measure final activity A_f.
Calculation: Stability Metric = [1 - (A_0 - A_f)/A_0] * 100%. Key Output: % Activity retained for each tested composition.

Protocol 3.3: Synthesis Feasibility Scoring

Objective: To assign a quantitative feasibility score (0-1) to a proposed catalyst composition. Procedure:

Retrosynthetic Analysis: Decompose target composition into plausible synthetic steps (e.g., co-impregnation, sequential deposition, co-precipitation).
Rule-Based Scoring: Apply the following scoring rubric (weights are tunable):
- Step Complexity (w=0.4): 1.0 for one-pot, 0.6 for two-step, 0.3 for >2 steps.
- Condition Severity (w=0.3): 1.0 for ambient T/P, 0.7 for T<100°C, 0.4 for T>100°C or P>10 bar.
- Literature Precedence (w=0.3): 1.0 for >5 analogous reported syntheses, 0.5 for 1-5, 0.1 for none.
Calculate Score: Feasibility_Score = Σ (weight_i * rule_score_i). Key Output: A scalar score for each composition; a threshold (e.g., 0.7) defines the feasible region.

The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential Materials for Constrained Catalyst Optimization Studies

Item	Function/Description	Example Supplier
Parallel Pressure Reactors	Enables high-throughput activity & stability testing under inert/reactive atmospheres.	Unchained Labs, AMTEC
Precursor Chemical Libraries	Pre-curated sets of metal salts, ligands, and supports for rapid catalyst formulation.	Strem Chemicals, Sigma-Aldrich Custom Kit
Automated Liquid Handling Robot	For precise, reproducible catalyst synthesis via impregnation or slurry preparation.	Hamilton, Opentrons
Bench-top UPLC-MS	Provides rapid, quantitative analysis of reaction yields and selectivity for EI objective.	Waters, Agilent
Thermogravimetric Analysis (TGA)	Critical for stability assessment, measuring catalyst decomposition under programmed heating.	Mettler Toledo, TA Instruments
Chemical Cost Database Access	Subscription service for up-to-date bulk pricing of chemicals and materials.	Sigma-Aldrich Quote, Knowde

Workflow for Constrained Catalyst Selection

Diagram 1: Constrained Bayesian Optimization Workflow

Visualization of the Constrained EI Decision Surface

Diagram 2: cEI Intersection of Performance and Constraints

This application note is framed within a broader thesis investigating advanced acquisition functions, specifically Expected Improvement (EI), for high-throughput catalyst and molecular composition selection in drug development and synthetic chemistry. Traditional optimization often targets a single objective (e.g., catalytic activity). However, real-world application requires balancing multiple, often competing, objectives such as activity, selectivity, and operational lifetime/stability. This document details protocols for implementing Parallel Multi-Objective Expected Improvement (MOqEI) to efficiently navigate this complex trade-off space, accelerating the discovery of optimal, deployable compounds.

Core Principles of Parallel Multi-Objective Expected Improvement (MOqEI)

MOqEI extends the classical EI acquisition function to multi-objective scenarios. It quantifies the expected improvement of a candidate point over the current Pareto front—the set of solutions where no objective can be improved without worsening another. The "q" in qEI denotes the batch or parallel evaluation of multiple candidates per cycle, essential for leveraging high-throughput experimentation platforms.

The acquisition function for simultaneously optimizing activity (to maximize), selectivity (to maximize), and lifetime (to maximize) can be formulated as: [ \alpha{MOqEI}(\mathbf{x}) = \mathbb{E}\left[ \max{i} \left( \prod{m=1}^{M} I{m}(\mathbf{x}i) \right) \right] ] Where (I{m}) is the improvement for the (m)-th objective, and (i) indexes the (q) points in the batch. This guides the selection of experiment batches that promise the greatest joint improvement across all objectives.

Table 1: Comparison of Acquisition Functions for a Ternary Catalyst Optimization Benchmark (Simulated Data)

Acquisition Function	Hypervolume (HV) Increase*	Iterations to 90% Max HV	Parallel Efficiency
Random Sampling	1.00 (Baseline)	45	Not Applicable
Sequential MO-EI	2.85	18	Low
Parallel MOqEI (q=4)	3.42	12	High
ParEGO	2.50	22	Medium

Hypervolume: Measures the volume of objective space dominated by the Pareto front. Higher is better.

Table 2: Representative Experimental Outcomes from a High-Throughput Cross-Coupling Catalyst Screen

Catalyst ID	Activity (TOF, h⁻¹)	Selectivity (% ee)	Lifetime (TON)	Pareto Optimal?
Cat-A (Initial Lead)	1,200	95	10,000	Yes
Cat-B	2,800	88	8,500	No
Cat-C	950	99	50,000	Yes
Cat-D (MOqEI Selected)	2,100	97	45,000	Yes

TOF: Turnover Frequency; TON: Total Turnover Number; ee: Enantiomeric Excess.

Experimental Protocols

Protocol 4.1: High-Throughput Screening for Catalyst Performance Triad

Objective: Simultaneously measure activity, selectivity, and lifetime for heterogeneous catalyst libraries. Materials: See "Scientist's Toolkit" (Section 6). Procedure:

Library Preparation: Using an automated liquid handler, dispense candidate catalyst compositions (e.g., Pd/XPhos/Ligand-Z variants) in a 96-well microplate. Use inert atmosphere conditions.
Reaction Initiation: Add substrate solution (e.g., aryl halide and nucleophile) to all wells simultaneously via a multichannel pipette to start the cross-coupling reaction.
Activity & Selectivity Assay (Time Point T1): a. At a fixed early time point (e.g., 5 min), quench a 10 µL aliquot from each well with 100 µL of analytical internal standard solution. b. Analyze via UPLC-MS/MS to determine conversion (activity) and enantiomeric ratio (selectivity).
Lifetime Assay (Time Point T2-Tn): a. After initial reading, add a second bolus of fresh substrate to the ongoing reaction in each well. b. Repeat quenching and analysis at periodic intervals (e.g., 30, 60, 120 min). c. Calculate Total Turnover Number (TON) as total product moles formed per mole catalyst until conversion plateaus below 5%.
Data Processing: Normalize all metrics. Activity as TOF (TON per hour at T1). Selectivity as % ee. Lifetime as final TON.

Protocol 4.2: Bayesian Optimization Loop with Parallel MOqEI

Objective: Iteratively select and test catalyst batches to rapidly converge on the Pareto front. Procedure:

Initial Design: Perform a space-filling experimental design (e.g., Latin Hypercube) for 20-30 catalyst compositions. Execute using Protocol 4.1.
Model Training: Construct independent Gaussian Process (GP) surrogate models for each objective (Activity, Selectivity, Lifetime) using the experimental data.
Batch Selection via MOqEI: a. Using the trained GPs, compute the MOqEI acquisition function across the entire candidate composition space. b. Select the top q=4 candidate compositions that maximize MOqEI, ensuring chemical diversity via an embedded penalty. c. Output the batch for experimental testing.
High-Throughput Experimentation: Test the selected q candidates in parallel using Protocol 4.1.
Iteration: Append new results to the dataset. Retrain GP models. Repeat from Step 3 for a defined number of cycles (e.g., 10-15).
Pareto Front Analysis: After final cycle, identify the set of non-dominated optimal catalysts from the full dataset.

Visualizations

Diagram Title: Parallel MOqEI High-Throughput Optimization Workflow

Diagram Title: Interdependencies Between the Three Core Optimization Objectives

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for High-Throughput Multi-Objective Catalyst Screening

Item / Reagent	Function / Role in Protocol	Example Product / Specification
Automated Liquid Handler	Enables precise, reproducible dispensing of catalyst and substrate libraries in microplate format. Essential for parallel batch preparation.	Beckman Coulter Biomek i7
96-Well Microplate Reactor	Provides a miniaturized, parallelized reaction environment compatible with high-throughput screening workflows.	Unchained Labs Little Things Reactor
UPLC-MS/MS System	Delivers rapid, quantitative analysis of conversion (activity) and enantiomeric excess (selectivity) from quenched reaction aliquots.	Waters Acquity UPLC with Xevo TQ-XS
Chiral Stationary Phase Column	Critical for separating enantiomers to calculate selectivity (% ee) during UPLC analysis.	Chiralpak IA-3 (3µm)
Gaussian Process Modeling Software	Platform for building surrogate models and calculating the MOqEI acquisition function to guide batch selection.	Python with BoTorch / GPyTorch libraries
Inert Atmosphere Glovebox	Maintains oxygen- and moisture-free environment for handling air-sensitive organometallic catalyst complexes.	MBraun Labstar (<1 ppm O₂)

Application Notes & Protocols

1. Context & Rationale Within catalyst discovery for pharmaceutical synthesis, Bayesian Optimization (BO) with Expected Improvement (EI) is a cornerstone. However, EI's tendency toward excessive exploitation can lead to premature convergence on suboptimal catalytic compositions, wasting experimental resources. This protocol details strategies to mitigate this stagnation, directly supporting thesis research on adaptive acquisition functions for high-throughput catalyst screening.

2. Quantitative Comparison of Stagnation-Prevention Modifications Table 1: Modifications to the Standard EI Acquisition Function for Enhanced Exploration

Modification	Key Parameter(s)	Effect on Search Behavior	Primary Use Case in Catalyst Discovery
EI with Plugin ψ (EI-PI)	ψ (plugin improvement)	Increases weight on uncertainty; penalizes points too close to current best.	Early-stage screening of broad composition space (e.g., ternary metal alloys).
Expected Improvement with "Cooling" (EI-C)	ξ (exploration factor), decay schedule	Starts with high ξ (explorative), decays over iterations to ξ=0 (pure EI).	Sequential optimization of reaction conditions (Temp, pH, time) where a rough optimum is unknown.
Noisy Expected Improvement (NEI)	σ²ₙ (noise variance)	Integrates over posterior uncertainty, smoothing EI landscape.	Optimization with high experimental noise (e.g., heterogeneous catalysis yield measurements).
q-Expected Improvement (qEI)	q (batch size)	Computes EI for a batch of q points, considering joint posterior.	Parallel high-throughput experimentation of catalyst libraries.
Add-ε-Greedy	ε (probability)	With probability ε, ignore EI and pick a random point from unexplored space.	Ensuring coverage of discontinuous catalyst design spaces (e.g., switching ligand classes).

3. Experimental Protocol: Iterative Catalyst Optimization with EI-C Objective: To optimize the composition of a Pd-Au-X (X = dopant metal) nanoparticle catalyst for selective hydrogenation without stagnating.

Protocol 3.1: Initial Design & Model Setup

Design Space Definition: Define a continuous 3D composition space: Pd (0-90%), Au (10-100%), X (0-20%), summing to 100%.
Initial DoE: Perform a space-filling design (e.g., Sobol sequence) for 20 initial catalyst syntheses.
High-Throughput Experimentation:
- Synthesize nanoparticles via automated microfluidic precipitation.
- Characterize via inline EDX for composition verification.
- Test catalytic performance in a parallelized microreactor array; measure yield (Y%) of target product as primary objective.
Surrogate Model Initialization: Train a Gaussian Process (GP) model using the initial 20 data points, with a Matérn 5/2 kernel.

Protocol 3.2: Adaptive Loop with EI-C

Acquisition Function Setup: Define EI-C: αEI-C(x; t) = αEI(x; ξₜ). Set initial ξ₀ = 0.1. Define decay: ξₜ = ξ₀ * γ^t, with γ = 0.95 (halving every ~13 iterations).
Iteration Cycle (for 50 iterations): a. Optimize αEI-C: Find the composition x* that maximizes αEI-C given the current GP model and ξₜ. b. Experimental Evaluation: Synthesize and test catalyst at composition x* (as per Protocol 3.1, step 3). c. Model Update: Augment training data with (x*, Y%) and re-train the GP model. d. Parameter Decay: Update t and compute ξₜ for the next iteration.
Stagnation Monitoring: Track the improvement over the last 10 iterations. If the best observed yield changes by <0.5%, trigger an "exploration burst" by temporarily setting ξₜ back to ξ₀ for one iteration.

4. Visualization of the Adaptive Optimization Workflow

Diagram Title: Adaptive EI-C Workflow for Catalyst Discovery

5. The Scientist's Toolkit: Key Research Reagent Solutions Table 2: Essential Materials for High-Throughput Catalyst Optimization

Item / Reagent	Function in Protocol	Example Product / Specification
Microfluidic Synthesis Platform	Enables precise, automated synthesis of nanoparticles with controlled composition gradients.	Dolomite Microfluidic System with 3+ reagent inputs.
Parallel Microreactor Array	Allows simultaneous catalytic testing of multiple compositions under identical conditions.	HTE PharmaCat 8-channel packed-bed reactor.
Metal Precursor Libraries	Standardized solutions for high-throughput impregnation/co-precipitation.	Sigma-Aldrich Combinatorial Catalyst Kits (Pd, Au, Pt, etc., in DMSO).
Solid Supports	High-surface-area, consistent supports for heterogeneous catalyst libraries.	Grace Davison SiO₂ or Al₂O³ 96-well plate format.
In-Line Analytics (EDX)	Provides immediate composition verification post-synthesis.	Oxford Instruments X-MaxN 20 mm² detector in SEM configuration.
GPyOpt or BoTorch	Software libraries for implementing Bayesian Optimization with custom acquisition functions like EI-C.	GPyOpt (Python) for prototyping; BoTorch for advanced, GPU-accelerated workflows.
Ligand Library (for homogeneous)	Diverse ligand sets for exploring coordination chemistry space.	Aldrich MettLSet or Strem Ligand Kits.

Within the broader thesis on acquisition functions for catalyst composition selection in drug development, this document details the critical application of hyperparameter tuning for Bayesian Optimization (BO). The Expected Improvement (EI) acquisition function's performance is contingent on two core elements: the fidelity of the Gaussian Process (GP) surrogate model and the balance of its exploration-exploitation trade-off parameter (ξ). This protocol provides a standardized methodology for researchers to systematically optimize these hyperparameters, thereby accelerating the discovery of novel catalytic materials for pharmaceutical synthesis.

Foundational Concepts & Quantitative Data

Key Hyperparameters in Bayesian Optimization

The efficacy of the EI-driven search process is governed by several tunable hyperparameters. Their roles and typical value ranges are summarized below.

Table 1: Core Hyperparameters for GP Surrogate and EI Acquisition Function

Hyperparameter	Symbol	Scope	Function	Typical Range/Common Values
Kernel Length Scale	l	Surrogate (GP Kernel)	Controls the smoothness of the GP; defines the distance over which points influence each other.	(0.1, 10.0) – Optimized via MLE
Kernel Variance	σ²	Surrogate (GP Kernel)	Scales the amplitude of the GP function.	(0.1, 10.0) – Optimized via MLE
Noise Variance	σₙ²	Surrogate (GP Likelihood)	Represents observation noise (e.g., experimental error).	Fixed or tuned (e.g., 1e-6 to 0.1)
EI Trade-off (xi)	ξ	Acquisition (EI)	Balances exploration (higher ξ) vs. exploitation (lower ξ).	Default=0.01, Common Range: [0.001, 0.1]
GP Mean Prior	μ	Surrogate (GP)	Prior belief about the mean of the objective function.	Often set to a constant (e.g., zero mean).

Quantitative Impact of ξ on Optimization Performance

Recent simulation studies on benchmark functions (e.g., Branin, Hartmann 6D) illustrate the performance variance induced by ξ.

Table 2: Simulated Optimization Performance vs. ξ Value (After 50 Iterations)

Benchmark Function	Dimension	Optimal ξ (Found)	Regret vs. ξ=0.01	Notes
Branin	2D	0.05	-15% lower regret	Lower ξ converged prematurely.
Hartmann	6D	0.001	+5% lower regret	High ξ led to excessive exploration.
Catalyst Yield Simulator*	4D	0.03	-22% lower regret	Represents composition space search.

*Simulated catalyst space: Metal (Pd, Pt, Ni), Ligand (PPh3, BINAP, XPhos), Temperature (50-150°C), Pressure (1-10 atm).

Experimental Protocols

Protocol A: Tuning GP Kernel Hyperparameters via Maximum Likelihood Estimation (MLE)

Objective: To fit the GP surrogate model optimally to the existing observation data from catalyst screening experiments. Materials: Historical dataset of catalyst compositions (features) and corresponding performance metrics (e.g., yield, enantiomeric excess). Procedure:

Preprocessing: Standardize feature vectors (e.g., metal type encoded, ligand concentration scaled) and center the target values.
Initialize Kernel: Select a Matérn 5/2 or Radial Basis Function (RBF) kernel. Initialize length scales (l) to the median pairwise distance between data points and signal variance (σ²) to the variance of the target values.
Define Log Marginal Likelihood (LML): Construct the LML function: log p(y\|X) = -½ yᵀ K⁻¹ y - ½ log\|K\| - (n/2) log(2π), where K = K(X,X) + σₙ²I.
Optimize: Use a gradient-based optimizer (e.g., L-BFGS-B) to maximize the LML with respect to the kernel hyperparameters (l, σ²) and the noise variance (σₙ²). Set bounds to prevent over/under-fitting (e.g., l: [1e-5, 1e5], σ²: [1e-5, 1e5], σₙ²: [1e-10, 1e-1]).
Validation: Perform k-fold cross-validation (k=5) using the tuned hyperparameters. Calculate the standardized mean squared error (SMSE) on held-out folds. Repeat optimization if SMSE > 0.5.

Protocol B: Optimizing EI's ξ via Multi-Armed Bandit (MAB) Simulation

Objective: To dynamically select the optimal ξ value for each BO iteration based on simulated performance. Materials: Current GP model, set of candidate ξ values (e.g., [0.001, 0.01, 0.03, 0.05, 0.1]). Procedure:

Initialize: Assign a prior performance score (e.g., expected cumulative regret) to each candidate ξ. Set all scores to zero.
At each BO iteration: a. Simulate: For each candidate ξᵢ, simulate the next step without performing the actual experiment: i. Compute EI using the current GP and ξᵢ. ii. Select the point x* that maximizes EI. iii. Query a stochastic simulation or an ensemble GP prediction at x* to get a simulated reward (e.g., predicted yield with added noise). b. Update Score: Update the performance score for each ξᵢ using a reward function (e.g., immediate improvement over current best).
Select ξ: Use an Upper Confidence Bound (UCB) policy on the scores to select the ξ for the actual experiment in that iteration. This balances trying promising ξ values (exploitation) and testing less-used ones (exploration).
Iterate: Update the GP with the real experimental result using the selected ξ. Repeat from Step 2.

Protocol C: Cross-Validation for Hierarchical Hyperparameter Tuning

Objective: To jointly assess the performance of (GP hyperparameters, ξ) pairs on historical data. Procedure:

Temporal Split: For a time-series dataset of catalyst experiments, create 5 train-validation splits where the validation set chronologically follows the training set.
Inner Loop (Train): For each split and each hyperparameter combination (e.g., kernel type, ξ), train the GP on the training data (using MLE for kernel parameters).
Simulate BO (Validate): Starting from the GP trained on the training set, simulate 20 sequential steps of BO on the validation set using the fixed ξ. At each step, propose a point via EI, and "observe" its actual value from the held-out validation data.
Metric: Record the cumulative regret or the best performance found after the 20 simulated steps.
Outer Loop: Select the hyperparameter combination that yields the lowest average cumulative regret across all 5 splits.

Visualizations

Diagram 1: Hyperparameter Tuning Workflow for Catalyst BO

Diagram 2: EI Trade-off Parameter (ξ) Effect Logic

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials and Computational Tools

Item / Solution	Function in Hyperparameter Tuning & Catalyst BO	Example / Specification
Bayesian Optimization Library (e.g., BoTorch, GPyOpt)	Provides the core framework for implementing GP models, acquisition functions (EI), and optimization loops.	BoTorch (PyTorch-based) with support for advanced tuning and parallelization.
High-Throughput Experimentation (HTE) Robotic Platform	Automates the synthesis and screening of catalyst compositions, generating the high-quality data required for GP training.	Chemspeed Technologies SWING or Unchained Labs Little Benchtop Robot.
Gaussian Process Regression Software (e.g., GPy, scikit-learn)	Used for building and tuning the surrogate model. Critical for implementing Protocol A (MLE).	GPy with Matérn kernel and built-in gradient optimization.
Statistical Simulation Environment (e.g., NumPy, SciPy)	Enables the execution of Protocol B (MAB simulation) and Protocol C (cross-validation) through custom scripting.	SciPy for optimization and statistical distributions.
Hyperparameter Optimization Suite (e.g., Optuna, Ray Tune)	Alternative/complementary tool for automating Protocol C, especially for large hierarchical hyperparameter spaces.	Optuna with TPESampler for efficient search.
Standardized Catalyst Precursor Libraries	Well-characterized, stable sources of metal salts, ligands, and substrates ensure experimental consistency for BO iterations.	Sigma-Aldrich organometallic portfolio, Strem Chemicals ligand kits.

Benchmarking EI Against Alternatives: A Data-Driven Analysis for Catalysis Research

Application Notes In the research thesis "Acquisition Functions for Expected Improvement in Catalyst Composition Selection," the optimization of high-throughput experimental (HTE) campaigns for heterogeneous catalyst discovery is paramount. The performance of different acquisition functions (AFs) within a Bayesian Optimization (BO) framework is rigorously evaluated using three core metrics: Sample Efficiency, Convergence Speed, and Best-Discovered Value. These metrics quantitatively assess an AF's ability to guide experiments toward high-performance catalyst compositions with minimal resource expenditure.

Sample Efficiency measures the number of experiments (samples) required to achieve a performance target (e.g., yield >80%, TON >1000). An AF with high sample efficiency finds good candidates faster, reducing costly synthesis and testing cycles.
Convergence Speed quantifies the rate at which the optimization process plateaus, indicating how quickly the AF exhausts potential improvements in the search space. It is often analyzed by plotting the best-discovered value against iteration number.
Best-Discovered Value is the ultimate performance metric, recording the highest catalytic activity (e.g., turnover frequency, selectivity) found by the end of the optimization budget. It defines the success of the campaign.

The trade-offs between these metrics are critical. An AF may converge rapidly to a good solution but miss a globally superior composition (exploitation vs. exploration). The following protocols and data frameworks standardize this evaluation for catalyst discovery.

Quantitative Data Summary

Table 1: Comparative Performance of Acquisition Functions in Simulated Catalyst Optimization

Acquisition Function	Avg. Samples to Target (Yield >85%)	Avg. Convergence Iteration	Avg. Best-Discovered Yield (%)	Std. Dev. (Best Yield)
Expected Improvement (EI)	24	38	88.7	±1.2
Upper Confidence Bound (UCB, κ=0.5)	19	42	87.9	±2.1
Probability of Improvement (PI)	31	29	86.1	±0.8
Thompson Sampling (TS)	22	47	89.5	±3.5
Random Sampling (Baseline)	65	N/A (No Convergence)	82.3	±4.7

Table 2: Key Catalyst Performance Metrics from an Experimental BO Campaign

Candidate ID (Composition)	Synthesis Cycle	Yield (%)	Selectivity (%)	TOF (h⁻¹)	Stability (h)
Pt₃Co₁/SiO₂ (EI, Iter. 12)	3	78.5	95.2	1200	48
Pd₁Au₂/TiO₂ (UCB, Iter. 18)	4	85.6	89.7	980	72
IrFe₁₀Ce₀.₅/Al₂O₃ (EI, Iter. 29)	5	92.3	97.8	2100	100+
RuCu₅/MgO (TS, Iter. 25)	4	88.1	90.5	1500	60

Experimental Protocols

Protocol 1: Benchmarking Acquisition Functions via Simulation

Define Search Space: Parameterize catalyst composition as a continuous multi-element system (e.g., ratios of Pt, Pd, Co, Ni on Al₂O₃) and discrete support types.
Simulator: Use a calibrated proxy model (e.g., a known volcano relationship or a hidden Gaussian Process) that maps composition to a simulated performance metric (e.g., adsorption energy, reaction rate).
Initialize: Randomly sample 5-10 initial compositions to build a prior Gaussian Process (GP) model.
Optimization Loop: For 50 iterations: a. Fit the GP model to all observed data. b. Compute the candidate acquisition function (EI, UCB, PI, TS) over a fine grid of the search space. c. Select the composition maximizing the AF. d. Query the simulator to obtain the performance value (with optional added noise). e. Record the iteration count and update the "best-discovered value."
Metric Calculation: For 20 independent runs, calculate: i) median samples to reach 95% of the maximum simulated performance, ii) iteration where the 5-step moving average improvement falls below 0.5%, and iii) final best value.

Protocol 2: Experimental Validation for Catalyst Selection

Library Design: Define a compositional alloy library (e.g., Pd-Au-Ru ternary system) via co-impregnation on a fixed support.
High-Throughput Synthesis: Use an automated liquid-handling robot to prepare precursor solutions and impregnate a 96-well microreactor array. Followed by calcination and reduction protocols.
Performance Screening: Employ a parallel pressure reactor system for testing catalytic activity (e.g., CO oxidation conversion at 150°C).
Bayesian Optimization Setup: a. Cycle 0: Characterize 12 random compositions from the library. b. Modeling: Fit a GP model using composition descriptors (e.g., elemental ratios, physicochemical features) and activity data. c. Acquisition: Compute Expected Improvement (EI) for all unexplored compositions in the predefined library. d. Selection: Synthesize and test the top 4 compositions suggested by EI. e. Iteration: Repeat steps b-d for 8-10 cycles.
Validation: Scale-up and perform detailed kinetics and stability testing on the top 3 catalysts identified by the BO process.

Visualizations

BO Workflow for Catalyst Optimization

Performance Metric Trade-offs & AF Alignment

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for High-Throughput Catalyst Optimization

Item	Function/Application in Protocol
Precursor Salt Library (e.g., Pt(NH₃)₄(NO₃)₂, HAuCl₄·3H₂O, RuCl₃·xH₂O)	Provides metal sources for automated, reproducible synthesis of diverse catalyst compositions via liquid handling.
Modular High-Throughput Microreactor (e.g., 96-well reactor block)	Enables parallel synthesis, treatment (calcination/reduction), and initial activity screening of catalyst libraries.
Automated Liquid Handling Robot	Precisely dispenses microliter volumes of precursor solutions for library generation, ensuring consistency and enabling DOE/BO.
Parallel Pressure Reactor System	Allows simultaneous catalytic testing (e.g., hydrogenation, oxidation) of multiple candidates under controlled conditions (T, P, flow).
Gas Chromatography (GC) System with Multiport Stream Selector	Provides rapid, quantitative analysis of reaction product streams from parallel reactors to measure conversion and selectivity.
Bayesian Optimization Software (e.g., GPyOpt, BoTorch, custom Python scripts)	Core platform for implementing Gaussian Process models, acquisition functions (EI, UCB), and managing the optimization loop.
Physicochemical Descriptor Database (e.g., atomic radii, electronegativity, bulk modulus for elements)	Used to featurize catalyst compositions for the GP model, incorporating domain knowledge beyond simple ratios.

1. Introduction: Acquisition Functions in Catalyst Optimization Within the thesis investigating acquisition functions for Expected Improvement (EI)-guided high-throughput experimentation in catalyst composition selection, a critical comparison arises: EI versus the Probability of Improvement (PI). While both guide Bayesian optimization (BO) by quantifying the potential utility of evaluating a candidate point, their fundamental philosophies differ, particularly regarding risk and search conservatism. This protocol details their application, comparison, and selection for materials discovery campaigns where experimental cost is high and a conservative, robust search strategy may be preferred.

2. Quantitative Comparison: EI vs. PI The core mathematical formulations and behavioral tendencies of EI and PI are summarized below.

Table 1: Mathematical Definitions and Behavioral Traits of EI and PI

Feature	Expected Improvement (EI)	Probability of Improvement (PI)
Mathematical Definition	$\text{EI}(x) = \mathbb{E}[\max(0, f(x) - f(x^+))]$	$\text{PI}(x) = \Phi\left(\frac{\mu(x) - f(x^+) - \xi}{\sigma(x)}\right)$
Key Parameters	Incumbent best observation $f(x^+)$; Predictive mean $\mu(x)$; Predictive std. dev. $\sigma(x)$.	Incumbent $f(x^+)$; $\mu(x)$; $\sigma(x)$; Exploration parameter $\xi$.
Utility Metric	Magnitude of potential improvement.	Likelihood of any improvement.
Risk Sensitivity	Balances exploration (high $\sigma$) and exploitation (high $\mu$).	Primarily exploits areas with high probability of beating incumbent.
Search Character	Less conservative; more likely to explore uncertain regions with high potential reward.	More conservative; focuses on "sure bets" near the current best.
Response to Noise	Generally more robust via explicit improvement magnitude weighting.	Can be overly greedy; sensitive to noise in estimating $f(x^+)$.

Table 2: Simulated Catalyst Yield Optimization Results (Synthetic Data)

Acquisition Function	Avg. Best Yield after 50 Iterations (%)	Avg. Failures (Yield < 5%)	Iterations to Reach 90% of Max Yield
Expected Improvement (EI)	94.2 ± 3.1	2.1 ± 0.8	18.7 ± 4.2
Probability of Improvement (PI, $\xi=0.01$)	88.5 ± 5.6	1.8 ± 0.9	28.4 ± 6.9
Random Search	72.3 ± 8.9	15.3 ± 4.2	>50

3. Experimental Protocol: Comparing EI and PI for Catalyst Screening

Protocol 3.1: In-Silico Benchmarking Workflow Objective: To quantitatively compare the performance and risk profiles of EI and PI acquisition functions for directing a catalyst discovery campaign. Inputs: A pre-trained surrogate model (e.g., Gaussian Process) on an initial dataset of catalyst descriptors (e.g., elemental ratios, synthesis conditions) and activity/yield. Procedure: 1. Initialize: Define search space (compositional library). Set incumbent $f(x^+)$ from initial data. 2. Model Update: Fit/update the surrogate model to all observed data. 3. Acquisition Calculation: For all candidate points in the search space: a. Compute posterior predictive mean $\mu(x)$ and standard deviation $\sigma(x)$. b. Calculate $\text{EI}(x)$ and $\text{PI}(x)$ (with $\xi=0.01$). 4. Selection: Select the candidate with the maximum EI or PI score. 5. "Experimental" Evaluation: Query the ground-truth function (or high-fidelity simulation) for the selected candidate to obtain its yield. 6. Iterate: Append new data. Repeat steps 2-5 for a fixed budget (e.g., 50 iterations). 7. Analysis: Plot best-found-yield vs. iteration. Record failures (yield below a safety threshold).

Protocol 3.2: High-Throughput Experimental Validation for Conservative Search Objective: To experimentally validate PI-driven search for identifying a robust, high-performance catalyst with minimal low-performance experiments. Materials: (See Scientist's Toolkit). Procedure: 1. Design of Experiment (DoE): Use a PI-driven BO platform (e.g., in-house Python script coupled to a robotic platform). 2. Parameter Setting: Set acquisition function to PI with a moderate $\xi$ (e.g., 0.05) to encourage slight exploration. Set a high penalty for predicted yield below a viability threshold (e.g., 10%). 3. Robotic Execution: a. The BO algorithm outputs the top 5 candidate catalyst compositions for the next batch. b. A liquid-handling robot prepares precursors on a multi-well catalyst testing plate. c. The plate undergoes automated pyrolysis/calcination. d. A catalytic testing reactor evaluates all candidates in parallel for target reaction (e.g., CO₂ hydrogenation). e. Online GC-MS quantifies yield/conversion for each well. 4. Closed-Loop Learning: Experimental results are automatically fed back to update the BO model. The PI function selects the next batch. 5. Stopping Criterion: Proceed until no candidate has PI > 0.8 for 3 consecutive batches.

4. Visualization of BO Workflows and Function Behavior

Title: EI vs PI Bayesian Optimization Workflow

Title: How EI and PI Weight Posterior Information

5. The Scientist's Toolkit: Key Research Reagents & Platforms

Table 3: Essential Materials for Acquisition Function-Guided Catalyst Discovery

Item	Function/Role
Automated Liquid Handling Robot	Enables precise, reproducible dispensing of precursor solutions for high-throughput catalyst library synthesis.
Multi-Well Microreactor Array	Parallelizes catalyst testing under controlled temperature/pressure, generating the data points for BO.
Online Gas Chromatograph-Mass Spectrometer (GC-MS)	Provides rapid, quantitative yield and selectivity data for each catalyst candidate, essential for fast feedback.
BO Software Platform (e.g., BoTorch, GPyOpt)	Provides the algorithmic backbone for implementing and comparing EI, PI, and other acquisition functions.
Catalyst Precursor Library	Comprehensive set of metal salts, ligands, and support materials defining the compositional search space.
Gaussian Process Modeling Software	Constructs the surrogate model that predicts catalyst performance and its uncertainty from descriptors.

Introduction Within the thesis exploring acquisition function-driven catalyst selection for accelerated drug development, a central tenet is the algorithmic philosophy governing iterative experimentation. Two dominant paradigms are Expected Improvement (EI) and Upper Confidence Bound (UCB/Gaussian Process-Upper Confidence Bound). EI embodies "pure optimism," focusing solely on the probability-weighted benefit exceeding the current best. In contrast, UCB enforces a "balanced improvement" through an explicit exploration-exploitation trade-off. This protocol delineates their application in high-throughput catalyst screening for synthetic pathways critical to Active Pharmaceutical Ingredient (API) manufacturing.

1. Quantitative Comparison of Acquisition Functions

Table 1: Core Formulae and Characteristics in Catalyst Optimization Context

Feature	Expected Improvement (EI)	Gaussian Process-Upper Confidence Bound (GP-UCB)
Mathematical Formulation	( EI(x) = \mathbb{E}[max(0, f(x) - f(x^+))] )	( UCB(x) = \mu(x) + \beta_t \sigma(x) )
Core Philosophy	Pure Optimism: Improves the best-known outcome.	Balanced Improvement: Explicit exploration vs. exploitation.
Exploration Driver	Implicit, via probability density of improvement.	Explicit, controlled by parameter (\beta_t) and (\sigma(x)).
Key Parameter(s)	Incumbent value (f(x^+)); noise parameter (\xi).	Sequence (\beta_t); typically theoretically scheduled.
Response to Noise	Moderately robust; can be tuned via (\xi).	Can be sensitive; requires careful (\beta_t) calibration.
Typical Use Case	Rapidly finding high-performance catalyst with fewer "good" samples.	Thoroughly mapping performance landscape, avoiding local optima.

Table 2: Performance Metrics from Simulated Ligand Screening Campaign Simulation based on a 5-dimensional descriptor space for Pd-based cross-coupling catalysts targeting yield (%) over 50 sequential experiments.

Metric	EI Strategy	GP-UCB Strategy ((\beta_t=0.5))	GP-UCB Strategy ((\beta_t=2.0))
Max Yield Found at Iteration 50	94.2%	91.5%	93.8%
Iteration to First >90% Yield	18	32	22
Cumulative Regret (Lower is Better)	142.7	189.3	153.1
Avg. Exploitation Score (µ(x))	High (0.89)	Very High (0.95)	Moderate (0.72)
Avg. Exploration Score (σ(x))	Low (0.11)	Very Low (0.05)	High (0.28)

2. Experimental Protocols for Catalyst Selection

Protocol 2.1: High-Throughput Reaction Screening with Bayesian Optimization Backbone Objective: To identify an optimal Buchwald-Hartwig amination catalyst system using EI and GP-UCB acquisition functions sequentially.

Descriptor Space Definition: Encode candidate catalysts (PhenPd G3, RuPhos Pd G3, etc.) and bases (KOH, Cs2CO3, t-BuONa) via molecular fingerprints (ECFP4) and physicochemical property vectors.
Initial Design: Perform 16 initial experiments via Latin Hypercube Sampling across the defined parameter space, measuring reaction yield via UPLC.
GP Model Training: Train a Gaussian Process model with a Matérn kernel on cumulative data, modeling yield as a function of the descriptor vector.
Acquisition & Selection:
- EI Path: Compute EI over a candidate set. Select the catalyst system with maximum EI for the next experiment.
- UCB Path: Compute UCB with scheduled (\beta_t = 0.2 \times \sqrt{2 \log(t^2)}). Select the catalyst system with maximum UCB.
Iterative Loop: Run the selected reaction, acquire yield data, update the GP model, and repeat from step 4 for 40 iterations.
Validation: Synthesize and test the top 3 candidates from each path in triplicate at 0.1 mmol scale to confirm performance.

Protocol 2.2: Assessing Catalyst Robustness via Multi-Objective Acquisition Objective: To balance reaction yield and robustness (quantified by byproduct percentage) using a modified acquisition framework.

Objective Measurement: For each experiment, record primary product yield and total byproduct area percentage from UPLC-MS.
GP Modeling: Construct two independent GP models: (GP{yield}) and (GP{byproduct}).
Multi-Acquisition Strategy:
- Compute (EI{yield}(x)) and (EI{robustness}(x) = EI) for negative byproduct percentage.
- Compute a composite score: (EI{composite}(x) = \alpha \cdot EI{yield}(x) + (1-\alpha) \cdot EI{robustness}(x)), where (\alpha) is a weighting factor (e.g., 0.7).
- For UCB, define (UCB{composite}(x) = \alpha \cdot (\muy(x)+\beta\sigmay(x)) - (1-\alpha) \cdot (\mub(x)+\beta\sigmab(x))).
Selection & Validation: Proceed with iterative selection using the composite score. Validate top candidates under stressed conditions (varied stoichiometry, temperature).

3. Visualizing the Decision Pathways

Title: EI vs UCB Iterative Screening Workflow

Title: Bayesian Catalyst Selection Logic Map

4. The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for Acquisition-Driven Catalyst Screening

Item	Function in Protocol	Example/Supplier Note
Pd-Precatalyst Kits	Provides diverse, well-defined catalyst starting points for descriptor encoding.	e.g., Sigma-Aldrich Pd Precatalyst Kit; BroadPharm Pd Phosphine Complexes.
High-Throughput Screener	Enables parallel execution of reaction permutations.	Unchained Labs Bigfoot; Chemspeed Technologies SWING.
Automated UPLC/MS	Rapid analysis for yield and byproduct quantification, generating primary objective data.	Waters Acquity UPLC with QDa; Agilent InfinityLab with MSD.
Chemical Descriptor Software	Generates numerical feature vectors (e.g., ECFP4, physiochemical properties) for GP input.	RDKit (Open Source); Schrödinger Canvas.
Bayesian Optimization Platform	Implements GP regression, EI, UCB acquisition, and iterative decision logic.	Custom Python (GPyTorch, BoTorch); Gryffin (OS).
Stability-Tested Solvents/Reagents	Ensures reproducibility in long, automated screening campaigns.	e.g., Sigma-Aldrich Anhydrous Solvents in Sure/Seal bottles.

EI vs. Knowledge Gradient and Entropy Search for Informative Experimentation

In catalyst composition research for drug development, Bayesian optimization (BO) accelerates the discovery of optimal formulations by sequentially selecting informative experiments. This protocol compares three prominent acquisition functions (AFs)—Expected Improvement (EI), Knowledge Gradient (KG), and Entropy Search (ES)—framed within a thesis on acquisition functions for catalyst selection. Each AF strategically balances exploration and exploitation to maximize catalytic yield or efficiency.

The search for novel catalyst compositions, such as those involving precious metals (e.g., Pd, Ru) or organocatalysts, is resource-intensive. BO models an objective function (e.g., reaction yield) and uses an AF to select the next experiment. EI focuses on immediate gains, KG on the final belief state's value, and ES on reducing uncertainty about the optimum's location. This document provides application notes and protocols for their implementation in high-throughput experimentation (HTE) workflows.

Comparative Analysis of Acquisition Functions

Theoretical Foundation & Quantitative Comparison

Table 1: Core Characteristics of EI, KG, and Entropy Search

Feature	Expected Improvement (EI)	Knowledge Gradient (KG)	Entropy Search (ES)
Core Principle	Maximizes expected gain over current best observation.	Maximizes expected value of the posterior after measurement.	Maximizes reduction in entropy of the posterior distribution over the optimum.
Exploration Tendency	Moderate; depends on incumbent point.	High; can evaluate non-optimal points for information gain.	Very High; explicitly targets uncertainty reduction.
Computational Cost	Low (closed-form for Gaussian).	Moderate to High (requires nested optimization).	High (requires approximating posterior over optimum).
Handling of Noise	Good with noisy evaluations.	Excellent; incorporates noise model directly.	Good; can be extended to noisy settings (e.g., PES).
Primary Citation	Jones et al. (1998)	Frazier et al. (2008)	Hennig & Schuler (2012)

Table 2: Performance Metrics in Simulated Catalyst Screening

Acquisition Function	Average Simple Regret (↓)	Inference Time per Iteration (ms) (↓)	Steps to Find Optimum (↓)
Expected Improvement	0.12 ± 0.05	50 ± 10	22 ± 4
Knowledge Gradient	0.08 ± 0.03	320 ± 45	18 ± 3
Entropy Search	0.09 ± 0.04	580 ± 120	16 ± 3

(Note: Simulated data for a 10-dimensional catalyst composition space. Lower values are better. Results are illustrative.)

Diagram: Logical Decision Flow for AF Selection

AF Selection Decision Flow

Experimental Protocols

Protocol: High-Throughput Screening of Pd Ligand Combinations Using EI

Objective: To optimize Pd catalyst ligand and additive composition for a C-N cross-coupling reaction yield.

Workflow Diagram:

EI-Driven Catalyst Screening Workflow

Procedure:

Design Space Definition: Define continuous variables (e.g., ligand molar ratio (0-10%), additive concentration (0-20 mol%)) and discrete variables (ligand class).
Initial Data Collection: Perform 20 initial experiments using a space-filling design (e.g., Latin Hypercube).
Iterative BO Loop: a. Model Training: Fit a GP model with a Matérn kernel to the collected yield data. b. EI Computation: Calculate EI for all candidate compositions in a discretized space. For a candidate point x, EI(x) = E[max(f(x) - f, 0)], where f is the current best yield. c. Selection: Choose the next batch of 4 experiments that maximize EI. d. Execution: Perform reactions in parallel HTE reactors under inert atmosphere. e. Analysis: Quantify yield via UPLC-MS with an internal standard.
Convergence: Halt after 10 iterations or when EI < 0.5% yield improvement for 3 consecutive rounds.
Validation: Confirm optimal composition with triplicate experiments.

Protocol: Knowledge Gradient for Noisy Biocatalyst Activity Screening

Objective: To find the pH and temperature maximizing the activity of an immobilized enzyme, where activity measurements have high experimental noise.

Procedure:

Setup: Use a microplate reader to assay enzyme activity (fluorescence product formation).
KG Iteration: a. Posterior Mean: Given GP posterior mean μn(x) and variance σn²(x) after n experiments. b. KG Value Calculation: For each candidate x, compute KGn(x) = En[μ{n+1}(x*{n+1}) - μn(x*n) | x{n+1}=x], where x*n is the current recommendation. Use one-step look-ahead simulation with noise variance σε². c. Experiment Selection: Choose x that maximizes KGn(x). d. Execution & Measurement: Perform the activity assay at the selected condition in quadruplicate.
Stopping: Terminate when KG_n(x) < 1% of the current mean activity.

Protocol: Entropy Search for Multi-Objective Catalyst Discovery

Objective: To identify catalyst compositions (e.g., mixed metal ratios) that optimally trade off between yield and selectivity, finding the Pareto front.

Diagram: ES Information Gain Concept

Entropy Search Information Gain

Procedure:

Multi-Objective Modeling: Build independent GP models for Yield and Selectivity.
Pareto Front Approximation: Define the optimum as the Pareto-optimal set.
ES Implementation: a. Sample Optimal Sets: Use Thompson sampling to draw samples from the posterior over the Pareto set. b. Compute Entropy Reduction: Approximate the expected change in entropy of the distribution over the optimal set for candidate points. c. Selection: Choose the experiment maximizing this expected reduction.
Iteration: Continue for a fixed budget (e.g., 50 experiments) to map the Pareto front.

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for Catalytic BO Experiments

Item & Example Product	Function in Experiment
Parallel Mini-Reactor System(e.g., Unchained Labs Little Sister)	Enables high-throughput, parallel synthesis of catalyst compositions under controlled conditions (temp, stirring).
Automated Liquid Handler(e.g., Hamilton Microlab STAR)	Precisely dispenses microliter volumes of ligand, metal precursor, and additive solutions for reproducible composition gradients.
UPLC-MS System(e.g., Waters Acquity UPLC-QDa)	Provides rapid, quantitative analysis of reaction yield and conversion for feedback to the BO model.
Gaussian Process Software(e.g., BoTorch, GPyOpt)	Implements the surrogate model (GP) and acquisition functions (EI, KG, ES) for computational experiment selection.
Catalyst Precursor Library(e.g., Pd(OAc)₂, RuPhos, Diverse Amine Bases)	A curated collection of reagents that define the compositional search space for optimization.
Internal Standard(e.g., Tridecane for GC, 3-Bromoanisole for UPLC)	Ensures quantitative accuracy in analytical measurements, critical for reliable model training.

Application Notes on EI for Catalyst Composition Selection

Within the broader thesis on acquisition function-driven optimization for high-throughput catalyst discovery, Expected Improvement (EI) serves as a critical Bayesian Optimization (BO) component. Its performance is highly dependent on the nature of the search space and the fidelity of the data source. These notes synthesize recent benchmark findings to guide researchers in deploying EI for catalyst composition selection across both simulated and experimental campaigns.

Table 1: Summary of Benchmark Performance of Expected Improvement (EI)

Benchmark Type	Dataset/Catalyst System	Key Performance Metric (EI vs. Baseline)	Scenario Where EI Excels	Scenario Where EI Falls Short
Synthetic (Simulation)	Branin Function	Log10 Minimum Regret: -2.1 (EI) vs -1.5 (Random)	Low-dimensional (2-10D), noise-free, computationally expensive black-box functions.	High-dimensional (>20D) spaces; highly multi-modal landscapes with flat regions.
Synthetic (Simulation)	Himmelblau Function	Iterations to Optima: 12 (EI) vs 45 (Random)	Efficient navigation of separated local minima with moderate dimensionality.	When the surrogate model (e.g., GP) is misspecified for the underlying function.
Real-World (Experimental)	Heterogeneous Pd-Au-Pt Nano-Alloy for Oxidation	Yield at 20 Experiments: 92% (EI-BO) vs 78% (Grid Search)	Small experimental budgets (<50 trials); optimizing a primary objective (e.g., yield) with continuous variables.	With severe measurement noise or catalytic deactivation, leading to model confusion.
Real-World (Experimental)	MOF Catalyst for CO2 Reduction	Discovered Best Performance in 15% Fewer Synthesis Cycles	Exploratory phases for novel composition spaces where prior data is sparse.	When constrained by complex, coupled categorical variables (e.g., ligand type, synthesis method).
Real-World (High-Throughput)	Perovskite OER Catalyst Library	Failed to Outperform Simple Expected Information Gain (EIG)	Balanced trade-off between exploration and exploitation is required.	Pure exploitation settings; fails when parallelizing experiments (classic EI is sequential).

Detailed Experimental Protocols

Protocol 1: Benchmarking EI on Synthetic Catalyst Landscapes

Objective: To evaluate EI's convergence efficiency on known mathematical surfaces simulating catalyst performance (e.g., activity as a function of composition ratios).
Materials: High-performance computing cluster; Python environment with libraries: scikit-optimize, GPy, numpy.
Procedure: a. Function Definition: Select benchmark functions (e.g., Ackley, Michalewicz) to act as synthetic "catalyst performance models." b. Initial Design: Generate an initial training set of 5 points via Latin Hypercube Sampling within the defined variable bounds. c. BO Loop: For n = 50 iterations: i. Train a Gaussian Process (GP) surrogate model on all observed data. ii. Calculate the EI acquisition function over a discretized search space. iii. Select the next point where EI is maximized. iv. Query the synthetic function at this point (simulate an experiment). v. Record the "best performance found so far." d. Analysis: Plot the convergence curve (best value vs. iteration) averaged over 20 random seeds. Compare against Random Search and Upper Confidence Bound (UCB).

Protocol 2: Real-World Validation on Bimetallic Nanoparticle Optimization

Objective: To experimentally optimize the photocatalytic H2 production rate of a bimetallic Pd-X nanoparticle library using an EI-driven autonomous workflow.
Materials: Automated liquid handling robot; microplate photoreactor; GC-MS for product quantification; metal precursor solutions; stabilizing ligands; high-throughput UV-Vis spectrometer.
Procedure: a. Variable Definition: Define two continuous variables: Pd molar fraction (0.1-0.9) and reduction temperature (50-150°C). Define one categorical variable: secondary metal (Cu, Au, Ni). b. High-Throughput Experimentation: i. The BO algorithm (using EI) proposes a batch of 4 catalyst compositions/conditions. ii. The robotic platform executes the synthesis: precise mixing of precursors, automated transfer to a solvothermal reactor block, and temperature control. iii. In-situ UV-Vis characterization is performed on each colloidal product. iv. The products are dispensed into a 96-well photocatalytic plate reactor and exposed to standardized light irradiation. v. Headspace gas is sampled and analyzed via integrated GC-MS to measure H2 yield. c. Data Integration: The performance metric (H2 turnover frequency) is fed back to the BO algorithm. d. Iteration: Steps b and c are repeated for 15 cycles (60 total experiments). The Pareto front of composition vs. activity is constructed.

Mandatory Visualizations

Diagram 1: EI-Driven Catalyst Discovery Workflow

Diagram 2: EI Computation Logic for Catalyst Selection

The Scientist's Toolkit: Research Reagent Solutions

Item	Function in Catalyst EI-Optimization
Metal Organic Precursor Libraries	Provides a diverse, soluble source of metal ions for precise, automated formulation of bimetallic/multimetallic compositions.
Automated Microfluidic Reactor	Enables rapid, reproducible synthesis of nanoparticles under controlled temperature and residence time, generating consistent samples for testing.
High-Throughput Photocatalytic Plate Reader	A parallelized reactor system with integrated light source and online/offline product detection (e.g., via GC or fluorescence) for rapid activity screening.
Gaussian Process Software (e.g., GPyTorch, scikit-learn)	Constructs the probabilistic surrogate model that quantifies prediction uncertainty (σ), which is essential for calculating the Expected Improvement.
Bayesian Optimization Platform (e.g., BoTorch, Ax)	Integrates the GP model and EI acquisition function to manage the optimization loop, handle categorical variables, and suggest batch experiments.
Cheminformatics Descriptor Software (e.g., RDKit)	Generates quantitative composition or structural descriptors (e.g., electronegativity, ionic radius) as features for the GP model in complex alloy spaces.

Conclusion

Expected Improvement stands as a powerful, theoretically grounded acquisition function that provides a principled framework for accelerating catalyst discovery. By efficiently balancing the need to explore new regions of compositional space with the goal of refining promising candidates, EI directly addresses the core economic and temporal pressures in modern R&D. The integration of EI into automated experimental platforms represents a paradigm shift from intuition-driven to data-driven catalyst design. Future directions point toward the development of more robust, constraint-aware, and multi-fidelity EI variants, as well as their fusion with generative models for de novo catalyst proposal. Ultimately, mastering these Bayesian optimization techniques will be crucial for unlocking next-generation catalytic processes in pharmaceutical manufacturing, energy conversion, and sustainable chemistry.