Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning
Publication date
2025-06-05
Editors
Vorobeychik, Yevgeniy
Das, Sanmay
Nowe, Ann
Advisors
Supervisors
Document Type
Part of book
Metadata
Show full item recordCollections
License
cc_by
Abstract
An important challenge in multi-objective reinforcement learning is obtaining a Pareto front of policies to attain optimal performance under different preferences. We introduce Iterated Pareto Referent Optimisation (IPRO), which decomposes finding the Pareto front into a sequence of constrained single-objective problems. This enables us to guarantee convergence while providing an upper bound on the distance to undiscovered Pareto optimal solutions at each step. We evaluate IPRO using utility-based metrics and its hypervolume and find that it matches or outperforms methods that require additional assumptions. By leveraging problem-specific single-objective solvers, our approach also holds promise for applications beyond multi-objective reinforcement learning, such as planning and path finding.
Keywords
Multi-objective, Pareto front, Reinforcement learning, Artificial Intelligence, Software, Control and Systems Engineering
Citation
Röpke, W, Reymond, M, Mannion, P, Roijers, D M, Nowé, A & Rădulescu, R 2025, Divide and Conquer : Provably Unveiling the Pareto Front with Multi-Objective Reinforcement Learning. in Y Vorobeychik, S Das & A Nowe (eds), Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 1774-1783, 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025, Detroit, United States, 19/05/25. https://doi.org/10.5555/3709347.3743813, conference