Bug #1152863 “Support for traditional Boolean operators” : Bugs : Evergreen

Revision history for this message

Dan Scott (denials) wrote on 2013-03-09:

#1

After a quick read of the code, _prepare_biblio_search_boolean() concerns me; it's globbing and parsing files on disk for each and every search? Typically that would happen once, at startup time, and the results would be cached thereafter.

The location of the boolean localization files in the "/location/" directory seems a bit strange. Is that a placeholder for a better name?

Also, if I understand correctly, searching for "To be or not to be" is no longer going to result in highly relevant results for that phrase (as just a quick example that comes to mind). That concerns me, too, as a major change in behaviour.

Revision history for this message

Kathy Lussier (klussier) wrote on 2013-03-09:

#2

Hey Dan,

Speaking to your last point, our expectation is that the user who clicks on the Boolean search tab would be a person more knowledgeable of search strategy than the typical user who is entering searches on the Basic or Advanced search pages. If they know enough about search to use those Boolean operators, I would expect they know enough to enclose a phrase like "to be or not to be" in quotation marks. When the final coding is complete, those quoted searches will continue to search as a phrase without any conversion of "or" to ||

I would never suggest replacing Evergreen's handling of those search terms in our current search interfaces. The ability to use "and," "or", "not" as search terms is something we value and would not want to lose.

Revision history for this message

Thomas Berezansky (tsbere) wrote on 2013-03-09:

#3

I am concerned with the fact that the branch appears to be missing any and all new files? At least one template file seems to be outright missing right now.

I am also not sure that having different limiter options for advanced and boolean searches makes sense. Re-using the advanced limiter set seems like it would make the most sense there.

Revision history for this message

Dan Scott (denials) wrote on 2013-03-09:

#4

Kathy: Right, thanks for the clarification. In the light of morning it's clear that the changes are meant to be limited to one interface. But in some ways I think that the goal is even weirder: the information literacy people think it's better to teach to one special interface that accepts a particular dialect of && / || / - instead of just teaching AND = && / OR = || / NOT = - which can be used almost everywhere else in Evergreen? Do they expect users to grok the Venn diagrams that are undoubtedly part of the Boolean logic lessons, but they don't think that those same users will be capable of understanding symbols as Boolean operators? I fear tears and recriminations as the users subsequently attempt to apply their lessons to the basic search box / advanced search box and encounter failure.

It seems to me that solving the root problems your academics identified (giving the advanced search box the ability to nest Boolean queries & providing contextual help that OR = || and NOT = - in this particular system) might be a better area to focus energies.

Thomas: Right; I noticed that last night but thought that perhaps the templates were still on their way. I echo your concern that introducing more potential interface inconsistency / configuration complexity via separate limiter options is not desirable.

Other notes:

* I don't think we really want the new $logger->info() calls in Search.pm.
* (nit) s/substatutes/substitutes/ in comments
* (nit) s/l18n/i18n/ (I think?) in comments
* (super minor nit) Two empty lines were removed in Search.pm -- arguably making the code harder to read because there is no separation between logical blocks of code--but in any case whitespace changes are typically kept in separate commits.

Jason Stephenson (jstephenson) on 2013-03-11

Changed in evergreen:
importance:	Undecided → Wishlist
status:	New → Triaged

Revision history for this message

Kathy Lussier (klussier) wrote on 2013-03-11:

#5

Download full text (3.3 KiB)

Thanks Dan and Thomas!

I'll share the coding concerns with the developers, if they haven't seen them already.

Dan, you asked the question of whether we could just teach students AND = && / OR = || / NOT = -

The answer I've received repeatedly from our academics is 'no', and I can see their point. If it were just a matter of teaching NOT = -, there would be no problem as this has become a de facto standard on the Web. Using && and || is just not intuitive. In the case of the latter, you might need to first start the lesson by making sure everyone knows where the pipe key lives on the keyboard.

AND and OR (or whatever their equivalents are in other languages) are intuitive because they have real meaning in the language that the user speaks. You make the valid point that there might be frustration because the same search strategy doesn't work in the basic and advanced search interfaces. However, on the flip side, there might also be the frustration that the more intuitive operators can be used in all of the electronic resources provided by their library except the catalog.

You also suggested that we focus on giving the advanced search box the ability to nest Boolean queries. This focus was actually how we started thinking about the project. I looked through other search interfaces for examples on how we could support more complex nesting without adding further complication to the interface. While we had some ideas of ways to support more complexity in the nesting, each of these ideas seemed to further complicate the advanced search interface, making it more confusing not only for those who might want the complex nesting, but also for those who might just use the advanced search interface to add more limiters to the search.

Ultimately, my team here believed that manually entering complex Boolean queries with the more intuitive operators was more straightforward than improving the GUI to support the more complex nesting.

I'm not saying it's a perfect solution, and we certainly were open to working with the community to support the best implementation. However, when I posted a message about the project to the general list last month, the only feedback I received was a question regarding the release that would see this new feature and an e-mail about ongoing work for an Evergreen add-on search option that would also support traditional Boolean operators - http://markmail.org/message/n7j2s363jsj6asgp.

My hope was that if there were any concerns about this approach being "weird," that I would have heard it at that time BEFORE the coding had begun. If I had received this type of feedback at that time, I could have delayed the start of coding and then explored alternate approaches with the community. Given the feedback that I received, I assumed that people either a) liked the approach, b) didn't care or c) saw the bit about there being a setting to disable it and mentally made a note to disable it when the time came.

I hope you can understand that, at this point, we are limited in our ability to make major changes to those larger implementation details. If the additional config.tt2 options to identify limiters on the Boolean search tab are...