yonaka

About Website

This website is a devlog for my personal project with the goal of letting Yonaka play games.

I will be using the posts as a way to log my experiments and experience, and methods I develop while working on the project.

The project will mostly consist of working with Reinforcement Learning, specifically Q learning instead of any of the “advanced” policy gradient methods.

This is because I believe value based methods are more extendable and higher potential, once you properly understand them. That is why I’m starting by extending Q learning to continuous actions and multiple actions first, as those is seen as the major limiting factor of Q learning.