Skip to content

Start Here

A short map of the main themes on this site.

This site is about the practical side of ML and AI systems: how models, agents, training pipelines, evaluations, and product interfaces behave when they leave the paper and enter a real engineering loop.

The fastest entry point is to pick a path below, then follow the linked topic hub when you want the full archive.

Reading Paths

LLM Agents

Tool use, agent runtime design, evaluation, context, and production patterns for systems that act across tools and environments.

Evaluation

Practical approaches to measuring model and agent capability with deterministic checks, rubrics, trajectories, and verifiable outcomes.

Post-Training

SFT, RLHF, preference optimization, instruction following, reasoning traces, and data pipelines for shaping model behavior after pretraining.

RLHF and Preference Optimization

Engineering notes and research synthesis on PPO, DPO, GRPO, reward modeling, preference data, and model behavior optimization.

Generative UI

How AI systems can produce, steer, and execute user interfaces with structured representations and practical product constraints.