PhD Thesis

Download it and have a wee look!
Lilybank Gardens, Glasgow, Scotland - May 2018

Modelling Search and Stopping in Interactive Information Retrieval

University of Glasgow Crest David Martin Maxwell
School of Computing Science
University of Glasgow
April 2019
Download PDF Here Or click/tap here for a chapter summary

Searching for information when using a computerised retrieval system is a complex and inherently interactive process. Individuals during a search session may issue multiple queries, and examine a varying number of result summaries and documents per query. Searchers must also decide when to stop assessing content for relevance – or decide when to stop their search session altogether. Despite being such a fundamental activity, only a limited number of studies have explored stopping behaviours in detail, with a majority reporting that searchers stop because they decide that what they have found feels "good enough". Notwithstanding the limited exploration of stopping during search, the phenomenon is central to the study of Information Retrieval, playing a role in the models and measures that we employ. However, the current de facto assumption considers that searchers will examine k documents – examining up to a fixed depth.

In this thesis, we examine searcher stopping behaviours under a number of different search contexts. We conduct and report on two user studies, examining how result summary lengths and a variation of search tasks and goals affect such behaviours. Interaction data from these studies are then used to ground extensive simulations of interaction, exploring a number of different stopping heuristics (operationalised as twelve stopping strategies). We consider how well the proposed strategies perform and match up with real-world stopping behaviours. As part of our contribution, we also propose the Complex Searcher Model (see below), a high-level conceptual searcher model that encodes stopping behaviours at different points throughout the search process. Within the Complex Searcher Model, we also propose a new results page stopping decision point. From this new stopping decision point, searchers can obtain an impression of the page before deciding to enter or abandon it.

The Complex Searcher Model The Complex Searcher Model, a key contribution of this thesis.

Results presented and discussed demonstrate that searchers employ a range of different stopping strategies, with no strategy standing out in terms of performance and approximations offered. Stopping behaviours are clearly not fixed, but are rather adaptive in nature. This complex picture reinforces the idea that modelling stopping behaviour is difficult. However, simplistic stopping strategies do offer good performance and approximations, such as the frustration-based stopping strategy. This strategy considers a searcher's tolerance to non-relevance. We also find that combination strategies – such as those combining a searcher's satisfaction with finding relevant material, and their frustration towards observing non-relevant material – also consistently offer good approximations and performance. In addition, we also demonstrate that the inclusion of the additional stopping decision point within the Complex Searcher Model provides significant improvements to performance over our baseline implementation. It also offers improvements to the approximations of real-world searcher stopping behaviours.

This work motivates a revision of how we currently model the search process and demonstrates that different stopping heuristics need to be considered within the models and measures that we use in Information Retrieval. Measures should be reformed according to the stopping behaviours of searchers. A number of potential avenues for future exploration can also be considered, such as modelling the stopping behaviours of searchers individually (rather than as a population), and to explore and consider a wider variety of different stopping heuristics under different search contexts. Despite the inherently difficult task that understanding and modelling the stopping behaviours of searchers represents, potential benefits of further exploration in this area will undoubtedly aid the searchers of future retrieval systems – with further work bringing about improved interfaces and experiences.

Links and Facts

Copyright & Attribution

  • Thesis Copyright© David Maxwell, 2019.
  • Funding

Funding for my PhD (fees & stipend) was provided by the EPSRC, under grant number 1367507.

  • Artwork, Images & Fonts

Several illustrations within this thesis (and the footer image of this page) incorporate artworks from freepik.com. These assist in explaining concepts. You can view a list of all the third-party artworks used on my GitHub. Artworks are used under the assumption that no commercial gain will be made from their inclusion.

The IKEA assembly man introduced Part II of this thesis. Permission from Inter IKEA Holding S.A. was granted to incorporate the image. Thanks, IKEA!

The typeface Foundry Sterling was used for titles, headers and illustrations within this thesis. A licence permitting use of this typeface to be embedded within the PDF was acquired by the author.

Finally, everyone shown in Picture 1 in The PhD Journey granted permission for the image to be used. Thanks, everybody — I love you all!

Icons above from Font Awesome under CC 4.0 Attribution