Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning

Publication date

2025-06-05

Authors

Röpke, Willem
Reymond, Mathieu
Mannion, Patrick
Roijers, Diederik M.
Nowé, Ann
Radulescu, RoxanaORCID 0000-0003-1446-5514ISNI 0000000524689348

Editors

Vorobeychik, Yevgeniy
Das, Sanmay
Nowe, Ann

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

cc_by

Abstract

An important challenge in multi-objective reinforcement learning is obtaining a Pareto front of policies to attain optimal performance under different preferences. We introduce Iterated Pareto Referent Optimisation (IPRO), which decomposes finding the Pareto front into a sequence of constrained single-objective problems. This enables us to guarantee convergence while providing an upper bound on the distance to undiscovered Pareto optimal solutions at each step. We evaluate IPRO using utility-based metrics and its hypervolume and find that it matches or outperforms methods that require additional assumptions. By leveraging problem-specific single-objective solvers, our approach also holds promise for applications beyond multi-objective reinforcement learning, such as planning and path finding.

Keywords

Multi-objective, Pareto front, Reinforcement learning, Artificial Intelligence, Software, Control and Systems Engineering

Citation

Röpke, W, Reymond, M, Mannion, P, Roijers, D M, Nowé, A & Rădulescu, R 2025, Divide and Conquer : Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning. in Y Vorobeychik, S Das & A Nowe (eds), Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 1774-1783, 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025, Detroit, United States, 19/05/25. https://doi.org/10.5555/3709347.3743813, conference