Reinforcement Learning: Navigating mazes using SARSA
Sign up for access to the world's latest research
Abstract
AI
AI
This paper explores the application of the SARSA reinforcement learning algorithm to navigate mazes and identify the shortest path. The agent learns through a reward system that encourages exploration while penalizing sub-optimal actions. Experiments demonstrate that after sufficient episodes, the agent successfully reduces both the number of steps taken and the error rate in reaching the destination, highlighting the efficiency of the SARSA algorithm combined with an epsilon-greedy strategy.
FAQs
AI
What are the key results of using SARSA in maze navigation?add
The agent learned to find the shortest path, achieving a 0% error rate after approximately 80 episodes in smaller mazes.
How does the ϵ-greedy strategy enhance the SARSA algorithm?add
The ϵ-greedy strategy allows the agent to explore alternative paths, preventing it from getting trapped in local minima.
What impact do maze sizes have on the learning process?add
Larger mazes, such as 10x10, required significantly more episodes to stabilize performance, with 400 episodes needed.
How are Q-values updated in the SARSA algorithm?add
Q-values are updated based on the actual actions taken by the agent and the accompanying rewards received.
What metrics were used to measure the agent's performance in the mazes?add
The study measured rewards, steps required to reach the destination, and error rates over defined episode intervals.






Related papers
Pathfinding algorithm addresses the problem of finding the shortest path from source to destination and avoiding obstacles. One of the greatest challenges in the design of realistic Artificial Intelligence (AI) in computer games is agent movement. Pathfinding strategies are usually employed as the core of any AI movement system. In this work, A* search algorithm is used to find the shortest path between the source and destination on image that represents a map or a maze. Finding a path through a maze is a basic computer science problem that can take many forms. The A* algorithm is widely used in pathfinding and graph traversal. Different map and maze images are used to test the system performance (100 images for each map and maze). The system overall performance is acceptable and able to find the shortest path between two points on the images. More than 85% images can find the shortest path between the selected two points.
Lecture Notes in Computer Science
An emergence of intelligent behavior within a simple robotic agent is studied in this paper. Two control mechanisms for an agent are considereda radial basis function neural network trained by evolutionary algorithm, and a traditional reinforcement learning algorithm over a finite agent state space. A comparison of these two approaches is presented on the maze exploration problem.
TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES, 2016
In recent decades, reinforcement learning (RL) has been widely used in different research fields ranging from psychology to computer science. The unfeasibility of sampling all possibilities for continuous-state problems and the absence of an explicit teacher make RL algorithms preferable for supervised learning in the machine learning area, as the optimal control problem has become a popular subject of research. In this study, a system is proposed to solve mobile robot navigation by opting for the most popular two RL algorithms, Sarsa( λ) and Q( λ) . The proposed system, developed in MATLAB, uses state and action sets, defined in a novel way, to increase performance. The system can guide the mobile robot to a desired goal by avoiding obstacles with a high success rate in both simulated and real environments.
International Journal of Mechanical Engineering and Robotics Research
In robotics, autonomous movement is an important feature that enables the robot to move independently from one location to another. Autonomous movement within an unknown area requires the robot to carry out investigations. The concept of solving a maze has an important place in the field of robotics, and is based on one of the most important areas of robotics, the Decision-Making Algorithm. In this paper, we discuss and analyse existing maze solving algorithms, and investigate the recent development of autonomous maze solving robotic systems. In addition, the work presented in this paper guides the researcher and developer for choosing an adequate maze solving algorithm to develop an efficient maze solving robotic system for a certain scenario. Index Terms-maze, autonomous robot, maze solving, solver robot, maze solving algorithms II. ROBOTIC APPLICATIONS The possible applications for maze solving vehicles range from simple tasks such as transferring goods through factories, office buildings, classrooms and other 668
2019 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC), 2019
An approach to the problem of autonomous mobile robot obstacle avoidance using Reinforcement Learning, more precisely Q-Learning, is presented in this paper. Reinforcement Learning in Robotics has been a challenging topic for the past few years. The ability to equip a robot with a powerful enough tool to allow an autonomous discovery of an optimal behavior through trial-and-error interactions with its environment has been a reason for numerous deep research projects. In this paper, two different Q-Learning approaches are presented as well as an extensive hyperparameter study. These algorithms were developed for a simplistically simulated Bot'n Roll ONE A (Fig. 1). The simulated robot communicates with the control script via ROS. The robot must surpass three levels of iterative complexity mazes similar to the ones presented on RoboParty [1] educational event challenge. For both algorithms, an extensive hyperparameter search was taken into account by testing hundreds of simulations with different parameters. Both Q-Learning solutions develop different strategies trying to solve the three labyrinths, enhancing its learning ability as well as discovering different approaches to certain situations, and finishing the task in complex environments.
Maze solving problem is a very old problem, but still now it is considered as an important field of robotics. This field is based on decision making algorithms. The main aim of this project is to make an Arduino based efficient autonomous maze solver robot. Two simple mazes solving algorithms “Wall following algorithm” and “Flood fill algorithm” are used to make this robot. In this project Hardware development, software development and maze construction had been done. For performance testing, the robot will implement to solve 4×4 maze. Capability of finding the shortest path is also verified.
Robotics is very important now days, especially due to its increasing practice in many industries. It was recently observed that there is a great difficulty being faced in the separation of the articles in the industries, like Textiles. Being a manual job it was also time consuming and boring. Therefore, for giving this problem a technological perspective, automation in such field was required. When a robot having its own sense of judgment to the path which it follows, would be introduced then a high efficiency in performance could be achieved along with increase in reliability and affordability of the manufacturers could be seen. The robot would be self-sufficient to take a note the paths through which it is moving, hence executing some complex maze-solving algorithms in its CPU core and taking its own decision on turnings and reaching its goal. It would certainly be a proof of a robot having its own " Brain-like " structural methodology having an access to the real-time inputs, making the prototype an Artificially Intelligent Robot.
In this paper, the design of maze solving robot which has the ability to navigate automatically in an unknown area based on its own decision is presented. For the proposed design algorithm, a wall following technique based on LSRB and RSLB algorithm is used. The designed robot obtains input from ultrasonic sensor, Infra-red sensor and wheel rotation encoders and then make decision for solving maze. It has the capability to solve the maze by taking the shortest distant path and it stores the details for the further reference also. The designed robot has the ability to learn any arbitrary maze and find the possible shortest route for solving it. The best application of this designed robot could be for navigational purposes
2022
The following document describes the submission to the All The Agents Challenge of a system that inter-acts with the Autonomous Maze Environment Explorer Project (AMEE) 1 . Our solution is a collaborative route finding service for Semantically Defined Mazes. Agents with Reinforcement Learning tools are used to discover paths through the maze. These agents collaborate by sharing schematic knowledge of the maze (information about the maze layout). The code repository and video demonstration locations are detailed in the Online Resources section at the end of the document.

Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.