The benchmark tests LLMs on their ability to play Fizzbuzz correctly. This is the system prompt: You are playing FizzBuzz with the following rules: If a number is divisible by {fizz_num}, say 'fizz' ...
This Python program iterates the integers from 1 to 50. For multiples of three, it prints "Fizz" instead of the number and for the multiples of five, it prints "Buzz". For numbers which are multiples ...