Posts
All the articles I've posted.
-
Post-Training Is Not 'One Algorithm': Objective Functions and Implementation Essentials for PPO / DPO / GRPO
· 12 min readReading notes on RLHF covering PPO, DPO, and GRPO—understanding post-training as an engineering pipeline rather than a single algorithm.
-
What Worked (and What Didn't) When Training AEs and VAEs for Embedding Compression
· 4 min readPractical lessons from training autoencoders and VAEs for embedding compression—covering dimension choice, KL scheduling, contrastive signals, and evaluation metrics.
-
User Interest Modeling with Transformer Architectures
· 4 min readExploring position embeddings, architecture choices, and training techniques for Transformer-based recommender systems.
-
UI Representation and Action Execution for Generative UI
· 3 min readExploring structured UI representation using JSON Schema, and how to implement action handlers for generative UI systems.
-
A Path Towards Generative UI
· 10 min readExploring how LLMs can dynamically generate user interfaces that adapt in real time to user needs—the vision behind generative UI.
-
How to Make LLM Inference Faster
· 4 min readAn overview of LLM inference optimization techniques including KV cache, FlashAttention, and memory management strategies.