BAHADIR ARABACI – Medium

BAHADIR ARABACI

Reinforcement Learning — Mini Glossary — EN/TR

Agent (Ajan): An agent acquires decision-making skills through trial and error, guided by rewards and punishments from its surroundings.

Jun 9, 2023

Jun 9, 2023

The “Deep” in Reinforcement Learning

Deep Reinforcement Learning incorporates deep neural networks into the framework of Reinforcement Learning. This integration allows for…

May 31, 2023

The “Deep” in Reinforcement Learning

May 31, 2023

Two main approaches for solving RL problems: Policy-Based Methods/Value-Based Methods

Policy-Based Methods

May 31, 2023

Two main approaches for solving RL problems: Policy-Based Methods/Value-Based Methods

May 31, 2023

The Exploration/Exploitation trade-off

The exploration/exploitation trade-off is a fundamental concept in reinforcement learning that refers to the dilemma of choosing between…

May 31, 2023

The Exploration/Exploitation trade-off

May 31, 2023

Two Types of Tasks: Episodic and Continuing

Episodic Task

May 31, 2023

May 31, 2023

Identifying reward functions and the concept of discounted rewards

In reinforcement learning (RL), the reward serves as the fundamental feedback for the agent’s actions.

May 31, 2023

Identifying reward functions and the concept of discounted rewards

May 31, 2023

Observations/States Space

Observations: Observations refer to the information that an agent receives from the environment. In the context of reinforcement learning…

May 31, 2023

Observations/States Space

May 31, 2023

Markov Property

The Markov Property in Markov Decision Processes (MDPs) is a fundamental concept that significantly impacts the agent’s decision-making…

May 31, 2023

Markov Property

May 31, 2023

The reward hypothesis

In reinforcement learning, the learning process typically follows a loop that generates a sequence of state-action-reward-next state…

May 31, 2023

The reward hypothesis

May 31, 2023

How does Reinforcement Learning work?

The agent receives the initial state, denoted as S₀, from the environment. In this case, the state represents the first frame of a game.

May 31, 2023

How does Reinforcement Learning work?

May 31, 2023

BAHADIR ARABACI

BAHADIR ARABACI

https://github.com/arabacibahadir

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech