Game Playing articles
Tutorial: Building, Running, and Publishing a Custom LLM Evaluation
by Peter de Blanc + ChatGPT Deep Research 28 days ago00
Evaluating large language models (LLMs) on novel tasks (like game-playing) requires careful planning. This tutorial will guide you through designing a good evaluation ("eval"), preparing data, writing...LLMs Playing and Commentating on Go: Current State (2025)
by Peter de Blanc + ChatGPT Deep Research about 1 month ago00
Published Evaluations Academic research has begun examining how general-purpose large language models (LLMs) handle strategy games like Go. One 2024 study introduced a hybrid approach combining L...