# Example: Demonstrates RLCER-style rubric validity filtering and CoT reward shaping. """Tutorial 01: RLCER rubric validity filter on toy rollouts.""" # Pearson ...