Tag: Post-Training

All the articles with the tag "Post-Training".

Instruction Following: What Models Get Wrong and How to Fix It with Better Post-Training Data

4 Mar, 2026 · 36 min read

LLMs can write poetry and solve math, but ask them to 'respond in exactly 3 bullet points using only lowercase' and they stumble. This post dissects the taxonomy of instruction-following failures and provides a practical playbook for building post-training data that actually fixes them.

Instruction Following: What Models Get Wrong and How to Fix It with Better Post-Training Data