@mittens

Virgil King

@mittens

Joined November 2024

About

Nothing here yet.

Available for

Nothing here yet.

Virgil King's blogs

Exploration Biasexplorationbias.hashnode.dev4 posts

Articles Threads Comments

Recently published

explorationbias.hashnode.dev

Batch MCTS, Part 2

The previous post described a batch-inference MCTS implementation for self-play. Batches consisted of one state from each of N concurrent episodes. Unfortunately that approach doesn’t work for playing a competitive episode since in that case there’s ...

Dec 1, 20242 min read

explorationbias.hashnode.dev

Batch MCTS, Part 1

It’s not straightforward to get good performance out of AlphaZero’s Monte Carlo Tree Search. Search wants to perform inference for one game state at a time — the new leaf state visited at the end of each game simulation — and the results of that infe...

Dec 1, 20242 min read

explorationbias.hashnode.dev

AlphaZero modifications

I had to make a couple of changes to AlphaZero to apply it to games with randomness and more than two players. More than two players Handling more than two players is the easy one. AlphaZero uses values in the range 0-1 to represent state values for ...

Nov 29, 20243 min read

explorationbias.hashnode.dev

Introduction

Hi. I plan to use this blog to document my personal project to build a platform for playtesting board games against AI opponents, starting with the game Kingdomino. A primary motivation for this project is to learn a bunch of new-to-me tools like Typ...

Nov 28, 20243 min read

Virgil King

About

Available for

Virgil King's blogs

Recently published

Batch MCTS, Part 2

Batch MCTS, Part 1

AlphaZero modifications

Introduction

Search Hashnode

Virgil King

About

Available for

Virgil King's blogs

Recently published

Batch MCTS, Part 2

Batch MCTS, Part 1

AlphaZero modifications

Introduction