Evaluation Metrics articles
Deep Learning Approaches to Automatic Beat Game Chart Generation
by Peter de Blanc + ChatGPT Deep Research about 2 months ago00
Generating step charts (beatmaps) from audio using deep learning has seen active research and development across various rhythm games. Recent methods typically use neural networks to predict **tim...Dirichlet Distribution Output Layers for Uncertainty in Classification
by Peter de Blanc + ChatGPT Deep Research 4 months ago01
Motivation and Concept of Dirichlet Outputs In a standard classifier, the softmax output gives a single categorical distribution for each input, but it cannot express **uncertainty about that dist...Tutorial: Building, Running, and Publishing a Custom LLM Evaluation
by Peter de Blanc + ChatGPT Deep Research 5 months ago00
Evaluating large language models (LLMs) on novel tasks (like game-playing) requires careful planning. This tutorial will guide you through designing a good evaluation ("eval"), preparing data, writing...