Tag: ML
All the articles with the tag "ML".
-
What Worked (and What Didn't) When Training AEs and VAEs for Embedding Compression
· 4 min readPractical lessons from training autoencoders and VAEs for embedding compression—covering dimension choice, KL scheduling, contrastive signals, and evaluation metrics.
-
User Interest Modeling with Transformer Architectures
· 4 min readExploring position embeddings, architecture choices, and training techniques for Transformer-based recommender systems.
-
How to Make LLM Inference Faster
· 4 min readAn overview of LLM inference optimization techniques including KV cache, FlashAttention, and memory management strategies.