Evaluating open source LLMs on Autonomous Codenames Simulations
The code for this experiment is available at https://github.com/shukantpal/codewords.
1. Introduction
The next step in the ascension of AI agents' capabilities is to perform long-range, complex tasks
shukantpal.hashnode.dev5 min read