Blog | Kyoung Whan Choe

2026

Less is More: When Agents Learn Not Because But Despite My Doings

January 25, 2026 • ai, reinforcement-learning

A humbling story of how debugging led to a massive simplification that worked twice as well. Yes, RL is complicated.

2025

Scaffolding to Superhuman: How Curriculum Learning Solved 2048 and Tetris

December 29, 2025 • ai, reinforcement-learning

Training gaming agents that beat massive search-based solutions on 2048 using a 15MB policy, and discovering that bugs can be features in Tetris.

Shedding Expertise: What Matters?

December 23, 2025 • ai, thoughts

I'm living this, questioning what I thought was my expertise. AI models provide internet-scale possibilities, and motivated humans can crystallize one thread with their own preference and taste.

The US–China AI Bet: Monopoly vs. Commodity

August 26, 2025 • ai, geopolitics

Deep insights on why China is building open-weight models and their bet on energy and action over intelligence alone.

If Robot Hands Can Make Robot Hands Can Make Robot Hands Can

August 05, 2025 • robotics, ai

Robot hand assembly could serve as an excellent benchmark for robot manipulation, similar to how coding benchmarks work for LLMs. Who knew LLMs would code so well?

Reinforcement Learning Is Hitting Its Inflection Point

July 19, 2025 • ai, reinforcement-learning

Reflections on technology maturity cycles, from missing the eye-tracking wave to catching the RL wave with PufferLib.

Puffer-PHC: Simplifying Perpeptual Humanoid Control with PufferLib

March 24, 2025 • robotics, reinforcement-learning

Making the Perceptual Humanoid Control repo simpler and training 60k+ SPS on a single 4090 with PufferLib.

2024

Bring Me a Beer: A Turing Test for Home Robots

December 05, 2024 • robotics

Starting to hack the Hiwonder Armpi Pro robot arm. The 'bring me a beer' task is still hard.

Building Card Table with Claude Artifacts

November 15, 2024 • projects, ai

I had never used React before, but with Claude Artifacts I built a web app for literature reviews. A strange but increasingly common experience.

Thoughts on Picking Games Worth Playing

November 08, 2024 • research, thoughts

An interesting analysis of how luck and skill influence rankings across chess, sports, video games, and hierarchies. Makes me think about which games I'm playing.

mg2hfbot: Testing LeRobot on MimicGen Tasks

November 06, 2024 • robotics, reinforcement-learning

Converting MimicGen datasets to train and evaluate LeRobot & RoboMimic policies on complex manipulation tasks without a physical robot.

Spatial Intelligence: Research or Product?

September 20, 2024 • ai, thoughts

Thoughts on Fei-Fei Li's spatial intelligence venture and whether transformers are the right architecture for this problem.

Reflections from Actuate 2024

September 18, 2024 • robotics, ai

Key reflections from the 2024 Actuate Robotics & Embodied AI Conference on unified models vs specialized collaboration and interpretability.

Crafting Review Criteria for RL Environments, with AI

June 26, 2024 • reinforcement-learning, research

Developing review criteria for RL environment papers with Claude's help. Emphasizing reproducibility and usability over novelty.

Meta MMO: A Massive Update to Neural MMO

June 07, 2024 • reinforcement-learning, research

Introducing Meta MMO - 3x faster training, diverse minigames, and successfully trained generalist agents. Open-source under MIT license.

2023

Neural MMO Talk at NeurIPS 2023

December 10, 2023 • reinforcement-learning, research

Excited to speak about Neural MMO at NeurIPS and discuss how NMMO can support open-ended multi-agent RL research.

NeurIPS 2023 Neural MMO Challenge

November 15, 2023 • reinforcement-learning, research

The NeurIPS 2023 Neural MMO Challenge is live on AICrowd with $20K in prizes. Train agents that generalize to new tasks, maps, and opponents.

In Support of Libraries as AI Research

October 02, 2023 • research, thoughts

Supporting Joseph Suarez's open letter to NeurIPS D&B organizers about the importance of libraries and infrastructure in AI research.

2020

Web-based Cognitive Tasks: Interactive jsPsych Experiments

April 26, 2020 • research, cognitive-neuroscience

A collection of web-based cognitive tasks built with jsPsych for research. Try them yourself - from decision-making to working memory tasks!

2019

Math Anxiety & Avoidance: We Can't Even Pay People to Do Hard Math

November 24, 2019 • research, cognitive-neuroscience

Higher levels of math anxiety were associated with a tendency to select easier, low-reward problems over harder, high-reward math problems, suggesting that we cannot even pay math-anxious people to do hard math.