Topics
Curated reading paths for the recurring themes on this site.
- LLM Agents
Tool use, agent runtime design, evaluation, context, and production patterns for systems that act across tools and environments.
- Evaluation
Practical approaches to measuring model and agent capability with deterministic checks, rubrics, trajectories, and verifiable outcomes.
- Post-Training
SFT, RLHF, preference optimization, instruction following, reasoning traces, and data pipelines for shaping model behavior after pretraining.
- RLHF and Preference Optimization
Engineering notes and research synthesis on PPO, DPO, GRPO, reward modeling, preference data, and model behavior optimization.
- Generative UI
How AI systems can produce, steer, and execute user interfaces with structured representations and practical product constraints.