No Package(s) for Language-specific Stemming Dictionary and Affix Files
Bug #301770 reported by
Duncan McGreggor
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Hardy Backports |
Won't Fix
|
Undecided
|
Unassigned | ||
postgresql-8.3 (Ubuntu) |
Fix Released
|
Undecided
|
Martin Pitt | ||
postgresql-common (Ubuntu) |
Fix Released
|
Medium
|
Martin Pitt |
Bug Description
Binary package hint: postgresql-common
Currently, PostgreSQL 8.3 full text search only provides simple stemming support by default. postgresql-common does not install files needed to support full stemming. In order for Ubuntu-packaged PostgreSQL to support full stemming, ispell (or myspell or hunspell) dictionary and affix files for the desired languages need to be installed. They need to be UTF-8 files, and as of now, they need to be installed in the postgres "tsearch_data" directory.
If packaging support was provided, then full text search with improved stemming could be supported in environments that require Ubuntu packages for all software/source code installations.
Changed in postgresql-8.3: | |
assignee: | nobody → pitti |
status: | New → In Progress |
Changed in hardy-backports: | |
status: | New → Won't Fix |
To post a comment you must log in.
As per our email discussion:
- can't directly use hunspell directories in /usr/share/ myspell/ dicts/, since they are often not UTF-8 encoded, which is required for PostgreSQL
- p-common gets a dpkg trigger which iconvs available hunspell dictionaries to /var/lib/ postgresql/ dicts/
- p-common gets test cases based on Duncan's Launchpad-private scripts.
- p-8.3 gets a patch which falls back to /var/lib/ postgresql/ dicts/ if no available dictionary is found in the postgres tsearch-data/ directory.