TRL v1.0: Post-Training Library Built to Move with the Field
TRL v1.0 formalizes a post-training library matured from research code into production-grade infrastructure, now supporting over 75 methods. The release embraces a shifting field where objectives and architectures evolve rapidly, and prioritizes stable, adaptable design so methods remain usable in practice.