Peter de Blanc

But maybe an even more important application could be in fine-tuning or online learning. When training on a new observation, we should increase its pseudocount by 1, which we might achieve by doing binary search over gradient descent step sizes.

Peter de Blanc posted a new article

Dirichlet Distribution Output Layers for Uncertainty in Classification

by Peter de Blanc + ChatGPT Deep Research 3 months ago

#Neural Networks #epistemic uncertainty #multi-label classification more...

Motivation and Concept of Dirichlet Outputs In a standard classifier, the softmax output gives a single categorical distribution for each input, but it cannot express **uncertainty about that dist...

Peter de Blanc posted a new article

Balancing Strength and Surprise in Adversarial AI

by Peter de Blanc + ChatGPT Deep Research 3 months ago

#Temperature Sampling #Adversarial AI #Nash Equilibrium more...

Mixed Strategies and Exploitability (Game-Theoretic Foundations) In classical game theory, randomized (mixed) strategies are often essential to avoid exploitation. A deterministic (pure) stra...

Peter de Blanc commented on Risk-Adjusted Performance Metrics for Investment Portfolios

Peter de Blanc

4 months ago

It looks like there was an issue with the formatting. I think there should be a "copy" button in the Gemini frontend that copies the markdown code. That should work better than "select all." You can still edit your article after posting to fix it.

Peter de Blanc posted a new article

Adarie Go Eval Project

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Research Projects #Language Models #Artificial Intelligence more...

We're developing an eval (i.e. a benchmark) of Go-playing skill of language models. This is just a fun little research project and something we can use to test out deep research and the Adarie publis...

Peter de Blanc posted a new article

Latent Features of Numbers Learned by Sequence Models

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Machine Learning #Divisibility #Neural Networks more...

Researchers have developed various ways to embed integers as distinct tokens in sequence modeling tasks (e.g. using OEIS data). In these approaches, each number is treated like a “word” with its own v...

Peter de Blanc posted a new article

Semantic Dimensions in English Word Embeddings

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Cognitive Science #Semantic Dimensions #Natural Language Processing more...

Introduction Word embeddings represent word meanings as points in a high-dimensional continuous space. An intriguing finding is that certain principal components or directions in these spaces c...

Peter de Blanc posted a new article

Multilingual Latent Spaces and Language Interpolation

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Variational Autoencoders #Language Interpolation #Zero-Shot Translation more...

Multilingual Neural Models and Language Embeddings: Modern neural NLP models often use multilingual architectures (e.g. multilingual Transformers like mBERT, mT5, mBART, or GPT-style models) tha...

Peter de Blanc commented on Python Libraries for Go (Baduk/Weiqi) in Research

Peter de Blanc

4 months ago

I decided to go with KaTrain for now. We can pin the version, so even if the API is unstable it's still kinda usable.

If we're feeling ambitious in the future, we might consider developing our own library, or we might fork the KaTrain repo and delete all the nonessential (GUI) code as a starting point.

Peter de Blanc posted a new article

Python Libraries for Go (Baduk/Weiqi) in Research

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Research tools #KataGo #Game theory more...

Introduction Researchers working with the game of Go often need tools to simulate games, parse game records, and leverage AI engines. Key requirements include: Board State Simulation: Giv...

Peter de Blanc posted a new article

Tutorial: Building, Running, and Publishing a Custom LLM Evaluation

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Large Language Models #Machine Learning #Model Performance more...

Evaluating large language models (LLMs) on novel tasks (like game-playing) requires careful planning. This tutorial will guide you through designing a good evaluation ("eval"), preparing data, writing...

Peter de Blanc posted a new article

Introduction to Japanese and Korean Grammar: A Comparative Overview

by Peter de Blanc + Gemini 2.5 Pro 4 months ago

#Korean Grammar #Syntax #Japanese Grammar more...

Japanese and Korean, while geographically close and sharing significant cultural exchange and loanwords, have a disputed genetic linguistic relationship (often debated within the context of the contro...

Peter de Blanc posted a new article

LLMs Playing and Commentating on Go: Current State (2025)

by Peter de Blanc + ChatGPT Deep Research 4 months ago

#Large Language Models #Monte Carlo Tree Search #Artificial Intelligence more...

Published Evaluations Academic research has begun examining how general-purpose large language models (LLMs) handle strategy games like Go. One 2024 study introduced a hybrid approach combining L...

Peter de Blanc commented on Perturbation Theory in Classical Mechanics: An Introduction

Peter de Blanc

4 months ago

In regular perturbation theory, we assume the solution can be expressed as a regular power series in the small parameter ε for ε sufficiently small.

This statement confused me a bit, and I wonder if it's a mistake. I think it should just say "...expressed as a power series..." rather than as a "regular" power series.

Peter de Blanc posted a new article