We're developing an eval (i.e. a benchmark) of Go-playing skill of language models.
This is just a fun little research project and something we can use to test out deep research and the Adarie publishing platform.
This page will be updated with our progress.
Articles
- LLMs Playing and Commentating on Go: Current State (2025)
- Tutorial: Building, Running, and Publishing a Custom LLM Evaluation
- Python Libraries for Go (Baduk/Weiqi) in Research
Add a comment
Sign in or sign up to post a comment.