Enhancing Reinforcement Learning in Vision-Based Environments with Optical Flow

Amartya Mukherjee; Jun Liu

doi:10.15353/jcvis.v9i1.10000

Vol. 9 No. 1 (2023)
Special Issue: Proceedings of CVIS 2023

Articles

Enhancing Reinforcement Learning in Vision-Based Environments with Optical Flow

https://doi.org/10.15353/jcvis.v9i1.10000

Published 2024-04-30

Amartya Mukherjee
Jun Liu

How to Cite

Mukherjee, A., & Liu, J. (2024). Enhancing Reinforcement Learning in Vision-Based Environments with Optical Flow. Journal of Computational Vision and Imaging Systems, 9(1), 1–3. https://doi.org/10.15353/jcvis.v9i1.10000

Download Citation

Abstract

Reinforcement learning (RL) has emerged as a powerful technique for training agents to excel in a wide range of sequential decision-making tasks, including playing video games in the Atari 2600 environment. While convolutional neural networks (CNNs) have been effective in extracting meaningful features from frames, the representation of motion remains a challenge. Optical flow (OF) gives information about the motion in sequential image data such as videos, which makes it useful in reinforcement learning. In this paper, we propose an approach to improve the performance of RL models in Atari environments by concatenating OF with raw image frames as input. Our experiments show that adding OF to an environment improves the training of the Deep Q Network (DQN) model and shows higher rewards compared to concatenating the present frame with its previous frame.

PDF