Stochastic Optimization

Optimization Algorithms

Solution Landscape Example

Optimization forms the basis of machine learning and planning algorithms, e.g., to minimize a loss function or maximize the expected cumulative reward of a sequential problem.

We focus on stochastic optimization based on evolutionary [1] or quantum computing [2] to solve complex problems in planning and (polymatrix) game theory.

Publications:
[1] Productive Fitness
[2] Quantum Annealing for Nash Equilibria Search

Monte Carlo Planning

Closed-Loop and Open-Loop Planning

Monte Carlo Planning (MCP) is a sampling-based approach to sequential decision making suitable for domains with enormous branching factors. MCP can be used for online planning, where the agent alternates between acting and reasoning per time step. MCP only requires limited domain knowledge in form of a black box simulator and a computation budget for stochastic optimization of decisions.

We devise algorithms that address various real-world challenges regarding MCP like resource restrictions [1,2] and temporal abstraction [3], and counterfactual reasoning [4]. Our algorithms are open-loop to reduce the search space, while being able to make good decisions in complex domains.

Publications:
[1] Memory Bounded Open-Loop Planning
[2] Adaptive Thompson Sampling Stacks
[3] Subgoal-Based Monte Carlo Planning
[4] Counterfactual Monte Carlo Planning

Thomy Phan

Stochastic Optimization

Optimization Algorithms

Monte Carlo Planning