Evaluating the Ability of Large Language Models to Reason About Cardinal Directions (Short Paper)

Authors Anthony G Cohn , Robert E Blackwell

Anthony G Cohn
  • School of Computing, University of Leeds, UK
Robert E Blackwell
  • Alan Turing Institute, London, UK


We thank the anonymous referees for their helpful comments. We also thank Microsoft Research - Accelerating Foundation Models Research program, for the provision of Azure resources to access GPT which were used in the early stages of the work.

Anthony G Cohn and Robert E Blackwell. Evaluating the Ability of Large Language Models to Reason About Cardinal Directions (Short Paper). In 16th International Conference on Spatial Information Theory (COSIT 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 315, pp. 28:1-28:9, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)


We investigate the abilities of a representative set of Large language Models (LLMs) to reason about cardinal directions (CDs). To do so, we create two datasets: the first, co-created with ChatGPT, focuses largely on recall of world knowledge about CDs; the second is generated from a set of templates, comprehensively testing an LLM’s ability to determine the correct CD given a particular scenario. The templates allow for a number of degrees of variation such as means of locomotion of the agent involved, and whether set in the first , second or third person. Even with a temperature setting of zero, Our experiments show that although LLMs are able to perform well in the simpler dataset, in the second more complex dataset no LLM is able to reliably determine the correct CD, even with a temperature setting of zero.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Spatial and physical reasoning
  • Large Language Models
  • Spatial Reasoning
  • Cardinal Directions


