A riddle from the game show “Wheel of Fortune” consists of a hidden sentence that can be discovered starting from a simple clue and by iteratively guessing its letters. Although the game is very popular and intuitive, solving one of these riddles is not trivial. In fact, for interpreting the clue, identifying the most probable letters, and leveraging the game’s mechanics effectively, a player requires linguistic abilities, world knowledge, and even some form of strategic thinking. The goal of this study is to verify whether Large Language Models (LLMs) possess the necessary abilities to solve Wheel of Fortune riddles. We propose a software framework called LLMike in which an algorithmic Game Master interacts with an LLM: prompting it, enforcing the game’s rules, updating the hidden sentence based on the model’s guesses, and evaluating their correctness. We study several models with different sizes, evaluating their performance, behavioural patterns, and common types of errors. Our dataset and code are available at https://github.com/ejdisgjinika/LLMike.

LLMike: Exploring Large Language Models’ Abilities in Wheel of Fortune Riddles

Ejdis Gjinika
;
Nicola Arici;Andrea Loreggia;Luca Putelli;Ivan Serina;Alfonso Emilio Gerevini
2025-01-01

Abstract

A riddle from the game show “Wheel of Fortune” consists of a hidden sentence that can be discovered starting from a simple clue and by iteratively guessing its letters. Although the game is very popular and intuitive, solving one of these riddles is not trivial. In fact, for interpreting the clue, identifying the most probable letters, and leveraging the game’s mechanics effectively, a player requires linguistic abilities, world knowledge, and even some form of strategic thinking. The goal of this study is to verify whether Large Language Models (LLMs) possess the necessary abilities to solve Wheel of Fortune riddles. We propose a software framework called LLMike in which an algorithmic Game Master interacts with an LLM: prompting it, enforcing the game’s rules, updating the hidden sentence based on the model’s guesses, and evaluating their correctness. We study several models with different sizes, evaluating their performance, behavioural patterns, and common types of errors. Our dataset and code are available at https://github.com/ejdisgjinika/LLMike.
File in questo prodotto:
File Dimensione Formato  
46_main_long.pdf

accesso aperto

Tipologia: Full Text
Licenza: Copyright dell'editore
Dimensione 1.63 MB
Formato Adobe PDF
1.63 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/635286
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact