searching for exact title prefix gets no matches (text search issue)

Bug #768685 reported by pablomme
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Launchpad itself
Triaged
High
Unassigned
apport (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

Problem
=======

Searching for "package blcr-dkms 0.8.2-15ubuntu1 failed to install/upgrade: blcr kernel module failed to build" does not find bug 700036 which at the time of writing has tyle "package blcr-dkms 0.8.2-15ubuntu1 failed to install/upgrade: blcr kernel module failed to build - error: ‘struct signal_struct’ has no member named ‘count’ "

Analysis
========

Probably a fti vectorisation issue; need to determine the search terms being looked up and those present in the fti vector for the bug.

Tags: search
Revision history for this message
Brian Murray (brian-murray) wrote :

This is actually a problem with Launchpad itself and its duplicate finder feature. I've made sure that the duplicate finder is turned on for blcr - it is.

Additionally, I've written an apport bug pattern for bug 760036 so if people encounter this bug they'll be redirected to that bug number rather than going through the process of filing another bug report.

Changed in apport (Ubuntu):
status: New → Invalid
Revision history for this message
pablomme (pablomme) wrote :

Apport now finds the bug, thanks.

summary: - Launchpad does not find that bug is duplicate of #700036
+ searching for exact title prefix gets no matches (text search issue)
description: updated
Changed in launchpad:
status: New → Triaged
importance: Undecided → High
tags: added: search
Revision history for this message
Curtis Hovey (sinzui) wrote :

the fti (free text index) is very clever when it handles word stems and punctuation. Too clever in some cases because exact matches are not found or are not ranked as a top match :(

Revision history for this message
Abel Deuring (adeuring) wrote :

Searching for the "versioned package names" like "blcr-dkms 0.8.2-15ubuntu1" should work, once the fix for ug 29713 lands. The main issue was probably the bad processing of '-' in ftq().

select to_tsvector('blcr-dkms 0.8.2-15ubuntu1');
                          to_tsvector
---------------------------------------------------------------
 '-15':5 '0.8.2':4 'blcr':2 'blcr-dkms':1 'dkms':3 'ubuntu1':6
(1 row)

select ftq('blcr-dkms 0.8.2-15ubuntu1');
                             ftq
-------------------------------------------------------------
 'blcr-dkms' & 'blcr' & 'dkms' & '0.8.2' & '-15' & 'ubuntu1'
(1 row)

select to_tsvector('blcr-dkms 0.8.2-15ubuntu1') @@ ftq('blcr-dkms 0.8.2-15ubuntu1');
 ?column?
----------
 t
(1 row)

So, the version string is cut into three parts: '0.8.2', '-15', 'ubuntu1'. It might make sense to add a special tokenizer for these version strings.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.