Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Evaluating open source LLMs on Autonomous Codenames Simulations" | Hashnode

FeedDiscussion

Shukant Pal

Infra @ Meta

Jun 2

Evaluating open source LLMs on Autonomous Codenames Simulations

The code for this experiment is available at https://github.com/shukantpal/codewords. 1. Introduction The next step in the ascension of AI agents' capabilities is to perform long-range, complex tasks

shukantpal.hashnode.dev5 min read

Responses

No responses yet.