https://ajing.github.io/ https://ajing.github.io/about/ https://ajing.github.io/archives/ https://ajing.github.io/posts/ https://ajing.github.io/start-here/ https://ajing.github.io/topics/ https://ajing.github.io/topics/agents/ https://ajing.github.io/topics/evaluation/ https://ajing.github.io/topics/post-training/ https://ajing.github.io/topics/rlhf/ https://ajing.github.io/topics/generative-ui/ https://ajing.github.io/posts/2026-05-28-agent-eval-difficulty-trajectory-constraints/ https://ajing.github.io/posts/2026-04-13-mercor-breach-what-leaked-about-ai-training/ https://ajing.github.io/posts/2026-03-11-improving-llm-i18n-tool-use-agency/ https://ajing.github.io/posts/2026-03-08-unverifiable-rewards-rl-frontier/ https://ajing.github.io/posts/2026-03-05-instruction-following-post-training-data/ https://ajing.github.io/posts/2026-03-01-experience-augmented-icl-complement-to-rl-post-training/ https://ajing.github.io/posts/2026-01-10-tool-selection-optimization-llm-agents-at-scale/ https://ajing.github.io/posts/2026-01-03-generative-engine-optimization-geo/ https://ajing.github.io/posts/2026-01-02-ads-formats-in-llm-products/ https://ajing.github.io/posts/2026-01-02-ads-in-llm-chatbot/ https://ajing.github.io/posts/2026-01-01-generative-ui-doesnt-move-the-needle-steering-does/ https://ajing.github.io/posts/2025-12-31-rlhf-engineering-implementation/ https://ajing.github.io/posts/2025-12-31-rlhf-ppo-dpo-grpo-notes/ https://ajing.github.io/posts/2025-09-07-ae-vae-training-learnings/ https://ajing.github.io/posts/2025-09-07-user-interest-modeling-with-transformer-architectures/ https://ajing.github.io/posts/2025-05-03-ui-representation-action-execution/ https://ajing.github.io/posts/2025-04-01-generative-ui/ https://ajing.github.io/posts/2023-08-20-how-to-make-llm-serving-faster/