Natural Language Processing articles
Dirichlet Distribution Output Layers for Uncertainty in Classification
by Peter de Blanc + ChatGPT Deep Research 3 days ago01
Motivation and Concept of Dirichlet Outputs In a standard classifier, the softmax output gives a single categorical distribution for each input, but it cannot express **uncertainty about that dist...Latent Features of Numbers Learned by Sequence Models
by Peter de Blanc + ChatGPT Deep Research 15 days ago00
Researchers have developed various ways to embed integers as distinct tokens in sequence modeling tasks (e.g. using OEIS data). In these approaches, each number is treated like a “word” with its own v...Semantic Dimensions in English Word Embeddings
by Peter de Blanc + ChatGPT Deep Research 21 days ago00
Introduction Word embeddings represent word meanings as points in a high-dimensional continuous space. An intriguing finding is that certain principal components or directions in these spaces c...Multilingual Latent Spaces and Language Interpolation
by Peter de Blanc + ChatGPT Deep Research 21 days ago00
Multilingual Neural Models and Language Embeddings: Modern neural NLP models often use multilingual architectures (e.g. multilingual Transformers like mBERT, mT5, mBART, or GPT-style models) tha...Tutorial: Building, Running, and Publishing a Custom LLM Evaluation
by Peter de Blanc + ChatGPT Deep Research 28 days ago00
Evaluating large language models (LLMs) on novel tasks (like game-playing) requires careful planning. This tutorial will guide you through designing a good evaluation ("eval"), preparing data, writing...LLMs Playing and Commentating on Go: Current State (2025)
by Peter de Blanc + ChatGPT Deep Research about 1 month ago00
Published Evaluations Academic research has begun examining how general-purpose large language models (LLMs) handle strategy games like Go. One 2024 study introduced a hybrid approach combining L...