Tag: ML Engineering
All the articles with the tag "ML Engineering".
-
Tool Selection Optimization for LLM Agents at Scale
· 14 min readA deep technical dive into tool selection—retrieval strategies, context optimization, learned selection, and the engineering trade-offs that matter when scaling to hundreds of tools.
-
RLHF from an Engineering Perspective: PPO, GRPO, DPO, and Tool-Use Implementation
· 12 min readA practical engineering guide to RLHF implementation—covering PPO, GRPO, DPO, and tool-use training with code snippets and debugging tips.
-
Post-Training Is Not 'One Algorithm': Objective Functions and Implementation Essentials for PPO / DPO / GRPO
· 12 min readReading notes on RLHF covering PPO, DPO, and GRPO—understanding post-training as an engineering pipeline rather than a single algorithm.