Topics

Curated reading paths for the recurring themes on this site.

LLM Agents
Tool use, agent runtime design, evaluation, context, and production patterns for systems that act across tools and environments.
Evaluation
Practical approaches to measuring model and agent capability with deterministic checks, rubrics, trajectories, and verifiable outcomes.
Post-Training
SFT, RLHF, preference optimization, instruction following, reasoning traces, and data pipelines for shaping model behavior after pretraining.
RLHF and Preference Optimization
Engineering notes and research synthesis on PPO, DPO, GRPO, reward modeling, preference data, and model behavior optimization.
Generative UI
How AI systems can produce, steer, and execute user interfaces with structured representations and practical product constraints.