Tag: ML

All the articles with the tag "ML".

What Worked (and What Didn't) When Training AEs and VAEs for Embedding Compression

6 Sep, 2025 · 4 min read

Practical lessons from training autoencoders and VAEs for embedding compression—covering dimension choice, KL scheduling, contrastive signals, and evaluation metrics.
User Interest Modeling with Transformer Architectures

6 Sep, 2025 · 4 min read

Exploring position embeddings, architecture choices, and training techniques for Transformer-based recommender systems.
How to Make LLM Inference Faster

19 Aug, 2023 · 4 min read

An overview of LLM inference optimization techniques including KV cache, FlashAttention, and memory management strategies.

What Worked (and What Didn't) When Training AEs and VAEs for Embedding Compression