periods should be normalized to empty string for search
Bug #965430 reported by
Galen Charlton
This bug affects 7 people
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Evergreen |
Confirmed
|
Medium
|
Unassigned |
Bug Description
At present, search_normalizer() and naco_normalizer() map periods to blanks, so using the default index definitions, strings that contain periods would get normalized like this:
"U.S.S.R." => "U S S R"
"USSR" => "USSR"
It would be desirable to allow "U.S.S.R" and "USSR" to retrieve the same sets of records, and one way of implementing this would be to tweak search_normalizer() to either collapse all periods to empty strings.
So, as a discussion item: does anybody see any pitfalls?
Evergreen: 2.0 and later
Changed in evergreen: | |
importance: | Undecided → Wishlist |
milestone: | none → 2.2.0beta1 |
tags: | added: indexing search |
Changed in evergreen: | |
milestone: | 2.2.0beta1 → 2.2.0rc1 |
Changed in evergreen: | |
milestone: | 2.2.0rc1 → 2.2.0 |
Changed in evergreen: | |
milestone: | 2.2.0 → 2.2.1 |
Changed in evergreen: | |
milestone: | 2.2.1 → 2.3.0-alpha2 |
Changed in evergreen: | |
milestone: | 2.3.0-alpha2 → 2.3.0-beta1 |
Changed in evergreen: | |
milestone: | 2.3.0-beta1 → none |
Changed in evergreen: | |
status: | New → Incomplete |
Changed in evergreen: | |
status: | Incomplete → Triaged |
Changed in evergreen: | |
status: | Triaged → Confirmed |
importance: | Wishlist → Medium |
To post a comment you must log in.
I wonder if we should instead add a collapse_periods normalizer and put that before search_normalize on the appropriate fields?