Can clustering methods improve reward shaping in reinforcement learning?

SiteDiane · April 10, 2025, 8:01am

Hey everyone,

I’ve been playing around with some small reinforcement learning environments, and I started wondering if there’s any practical benefit to using unsupervised clustering (like K-Means or DBSCAN) to shape rewards or better define states.

For example, could clustering be used to detect patterns in state trajectories and somehow influence the reward function dynamically? Or maybe as a preprocessing step to reduce state complexity?

Thanks!

conscell · April 11, 2025, 3:01am

Hi @SiteDiane,

Excellent question!
Clustering can help identify structure in the state space that isn’t obvious from raw state features. Clustered Reinforcement Learning method uses clustering to divide the collected states into several clusters, based on which a bonus reward in the neighboring cluster of the current state is given to the agent.

This paper discusses the problem of direct application of clustering to reinforcement learning, which can lead to the issue where states may have different state transition processes under the same action, resulting in poor policy performance.

SiteDiane · April 11, 2025, 5:36am

Thank you soo much…

Topic		Replies	Views
Why map X -> Y and use supervised learning when making an example of RL Unsupervised Learning, Recommenders, Reinforcement week-3	1	324	October 2, 2023
Week1 Video: What is clustering? Unsupervised Learning, Recommenders, Reinforcement week-1	1	525	July 28, 2022
Unsupervised Learning Clustering Unsupervised Learning, Recommenders, Reinforcement ai-discussions	4	31	August 29, 2024
Is collaborative filtering still Unsupervised? Unsupervised Learning, Recommenders, Reinforcement week-2	7	909	January 20, 2023
Clustering Algorithm Unsupervised Learning, Recommenders, Reinforcement week-1	1	356	September 14, 2023

Can clustering methods improve reward shaping in reinforcement learning?

Related topics