Tag: Post-Training
All the articles with the tag "Post-Training".
-
Instruction Following: What Models Get Wrong and How to Fix It with Better Post-Training Data
ยท 36 min readLLMs can write poetry and solve math, but ask them to 'respond in exactly 3 bullet points using only lowercase' and they stumble. This post dissects the taxonomy of instruction-following failures and provides a practical playbook for building post-training data that actually fixes them.