Filtering, limiting, and sorting: Managing large search results lists

Making large sets of search results manageable — whether that is Google Scholar, PubMed, Web of Science, or any other large database — can seem overwhelming, mostly due to the sheer amount of information stored in these databases. Fragmentation of content, indexing and algorithmic complexity, and lack of transparency also contribute to the challenges of processing information found in these databases. The option of filtering, limiting, or sorting offers a pragmatic approach to handling information overload by creating subsets of search results and arranging them based on specific rules, criteria, or attributes. Most importantly, using these features increases efficiency and minimizes the likelihood of missing critical associations.

Filtering, Limiting, and Sorting 

The filtering technique enables us to maximize the efficiency of examining search results from different angles without compromising the integrity of the search itself. Filtering acts as a lens to magnify and illuminate otherwise invisible but potentially meaningful associations when conducting literature-based discovery. Although often confused with filtering, limiting serves a different purpose. It allows effective reduction of results to discover a known fact or study. Using the sorting and re-sorting option supports better browsing of search results by providing ordering options, based on attributes specific to each database.

The nuances among filtering, limiting, and sorting play a pivotal role in extracting meaningful insights from search results. 

Filtering 

Filtering refers to the process of creating and highlighting subsets of the search results based on specific rules or criteria. Contemporary databases and search engines have built-in relevancy filters, like PubMed’s "best match" default display. This same PubMed filter creates a list of “similar articles” for each record, as well as a “cited by” list. Similarly, Web of Science has filters that automatically create “relevant articles” subsets, and it offers a range of filters to analyze search results in many different ways. "Clinical queries" in PubMed is a collection of semantic filters that when applied produce highly relevant results based on strategically built search strings operating in the background. All databases provide readily available filters to slice and dice search results by most frequently used parameters. Finally, advanced searchers can create custom semantic filters.

Limiting

While filtering involves highlighting results based on specific criteria, limiting involves setting boundaries on the scope of the search. Limits might include specifying a particular language, timeframe, or gender and age of the population in question. Limiting helps users manage the breadth of information, ensuring that the results remain within defined parameters.

Sorting 

In bibliographic databases, sorting involves arranging search results based on attributes, such as publication date, relevance, or journal titles and author names. While sorting is a common feature, some databases offer unique options, such as the "times cited" or "usage" sorts found in Web of Science. 

Was this article helpful?
What made the article not helpful?