AG-2024.02-1439·cs.CL·cross-listed: quant-ph
Developments in Sheaf-Theoretic Models of Natural Language Ambiguities
Authors
- Kin Ian Lo
- Mehrnoosh Sadrzadeh
- Shane Mansfield
Abstract
Sheaves are mathematical objects consisting of a base which constitutes a topological space and the data associated with each open set thereof, e.g. continuous functions defined on the open sets. Sheaves have originally been used in algebraic topology and logic. Recently, they have also modelled events such as physical experiments and natural language disambiguation processes. We extend the latter models from lexical ambiguities to discourse ambiguities arising from anaphora. To begin, we calculated a new measure of contextuality for a dataset of basic anaphoric discourses, resulting in a higher proportion of contextual models-82.9%-compared to previous work which only yielded 3.17% contextual models. Then, we show how an extension of the natural language processing challenge, known as the Winograd Schema, which involves anaphoric ambiguities can be modelled on the Bell-CHSH scenario with a contextual fraction of 0.096.
Submitted
7 February 20242 years ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2402.04505
Summary
Researchers use sheaves—mathematical structures from topology—to model how pronouns and other discourse ambiguities get resolved by context, finding that context plays a much larger role (83%) than previously measured.
- Sheaves, borrowed from pure mathematics, can represent how meaning depends on surrounding context, treating language ambiguity similarly to how quantum mechanics handles context-dependent properties.
- On a dataset of pronoun-resolution puzzles, the new approach identifies context as essential 83% of the time versus only 3% in prior work, suggesting previous models were missing something fundamental.
- The framework connects linguistic ambiguity to physics (via the Bell-CHSH inequality), hinting that natural language might exploit context in ways mathematically parallel to quantum entanglement.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.