The benchmark tests LLMs on their ability to play Fizzbuzz correctly. This is the system prompt: You are playing FizzBuzz with the following rules: If a number is divisible by {fizz_num}, say 'fizz' ...
This Python program iterates the integers from 1 to 50. For multiples of three, it prints "Fizz" instead of the number and for the multiples of five, it prints "Buzz". For numbers which are multiples ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results