A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis

doi:10.48550/arXiv.2302.04179

A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis

Amidzadeh, Mohsen

Many sequential decision-making problems need optimization of different objectives which possibly conflict with each other. The conventional way to deal with a multi-task problem is to establish a scalar objective function based on a linear combination of different objectives. However, for the case of having conflicting objectives with different scales, this method needs a trial-and-error approach to properly find proper weights for the combination. As such, in most cases, this approach cannot guarantee an optimal Pareto solution. In this paper, we develop a single-agent scale-independent multi-objective reinforcement learning on the basis of the Advantage Actor-Critic (A2C) algorithm. A convergence analysis is then done for the devised multi-objective algorithm providing a convergence-in-mean guarantee. We then perform some experiments over a multi-task problem to evaluate the performance of the proposed algorithm. Simulation results show the superiority of developed multi-objective A2C approach against the single-objective algorithm.

Publication:

arXiv e-prints

Pub Date:

February 2023

DOI:

10.48550/arXiv.2302.04179

arXiv:

arXiv:2302.04179

Bibcode:

2023arXiv230204179A

Keywords:

Computer Science - Machine Learning

ADS

A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis

Abstract