2026.01.30Chapter 8: Implementing Basic RLHF Workflows with TunixTunix RLHF LLMLearn how to implement basic RLHF workflows with Tunix for creating helpful and aligned Language Models.ACCESS_FILE >>
2026.01.30Chapter 12: Advanced RLHF Strategies and Proximal Policy Optimization (PPO)Tunix JAX LLMLearn advanced RLHF strategies, focusing on Proximal Policy Optimization (PPO) with Tunix.ACCESS_FILE >>
2026.01.30Chapter 14: Project 2: Aligning an LLM for Factual AccuracyTunix LLM JAXLearn to align an LLM for factual accuracy using Tunix, a JAX-native framework.ACCESS_FILE >>