Home » Posts

Part II - From AlphaGo to MuZero
^[draft]

A paper review of Mastering the game of Go without human knowledge and Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm, as well as an introduction of AlphaGo Zero and AlphaZero

March 4, 2021 · 1 min · SY Chou

Mastering the game of Go without human knowledge

The paper propose AlphaGo Zero which is known as self-playing without human knowledge.

Reinforcement learning in AlphaGo Zero

$$ (p, v) = f_{\theta} $$

$$ l = (z - v)^2 - \pi^T log(p) + c||\theta||^2 $$

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

The paper propose AlphaZero which is known as self-playing to compete any kinds of board game.

Part II - From AlphaGo to MuZero
^[draft]

Mastering the game of Go without human knowledge

Reinforcement learning in AlphaGo Zero

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

COMMENTS

Your comments will encouage me to share more~~

Mastering the game of Go without human knowledge#

Reinforcement learning in AlphaGo Zero#

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm#

COMMENTS

Your comments will encouage me to share more~~

Mastering the game of Go without human knowledge

Reinforcement learning in AlphaGo Zero

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm