ML & AI in Action
RSS FeedPractical insights on machine learning, AI systems, and building products that ship. Topics include RLHF, LLM optimization, recommender systems, and generative UI.
Featured
-
Tool Selection Optimization for LLM Agents at Scale
· 14 min readA deep technical dive into tool selection—retrieval strategies, context optimization, learned selection, and the engineering trade-offs that matter when scaling to hundreds of tools.
-
Generative Engine Optimization (GEO): How to Get Your Product Cited by AI
· 13 min readA comprehensive guide to Generative Engine Optimization—making your content retrievable, citable, and recommendable by large language models.
-
Generative UI Doesn't Move the Needle—Steering Does
· 10 min readAfter shipping multiple generative UI features, I've concluded that the sophistication of AI-generated interfaces often doesn't translate to user benefit—but steering does.
-
RLHF from an Engineering Perspective: PPO, GRPO, DPO, and Tool-Use Implementation
· 12 min readA practical engineering guide to RLHF implementation—covering PPO, GRPO, DPO, and tool-use training with code snippets and debugging tips.
-
Post-Training Is Not 'One Algorithm': Objective Functions and Implementation Essentials for PPO / DPO / GRPO
· 12 min readReading notes on RLHF covering PPO, DPO, and GRPO—understanding post-training as an engineering pipeline rather than a single algorithm.
Recent Posts
-
Ad Formats in LLM Products: What's Live vs. What's Research
· 7 min readA survey of advertising formats in LLM products—separating what's deployed in production from what remains in research.
-
Adding Ads in LLM/Chatbot: Character Training for Monetization
· 4 min readExploring how to integrate ads in LLMs through character training—making recommendations genuinely helpful rather than annoyingly promotional.
-
What Worked (and What Didn't) When Training AEs and VAEs for Embedding Compression
· 4 min readPractical lessons from training autoencoders and VAEs for embedding compression—covering dimension choice, KL scheduling, contrastive signals, and evaluation metrics.
-
User Interest Modeling with Transformer Architectures
· 4 min readExploring position embeddings, architecture choices, and training techniques for Transformer-based recommender systems.
-
UI Representation and Action Execution for Generative UI
· 3 min readExploring structured UI representation using JSON Schema, and how to implement action handlers for generative UI systems.
-
A Path Towards Generative UI
· 10 min readExploring how LLMs can dynamically generate user interfaces that adapt in real time to user needs—the vision behind generative UI.