DeeTextAnalyzer feature checklist

Bug #885600 reported by Mikkel Kamstrup Erlandsen on 2011-11-03
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Unity
Triaged
Undecided
Unassigned
dee
High
Mikkel Kamstrup Erlandsen
dee (Ubuntu)
Undecided
Unassigned
unity (Ubuntu)
Undecided
Unassigned

Bug Description

This is a tracker bug to help me remember which features I want in DeeTextAnalyzer:

 - Detect numeric sub sequences. Fx "Foo125" -> "foo", "125"
 - Split on "CamelCase" -> "camel", "case"
 - Detect and create CJK n-grams (and tokenize CJK subsequences when embedded in non-CJK text)

Changed in dee:
status: New → Triaged
importance: Undecided → High
assignee: nobody → Mikkel Kamstrup Erlandsen (kamstrup)
milestone: none → 1.0.0
Didier Roche (didrocks) on 2011-11-22
Changed in unity:
status: New → Triaged
Changed in dee (Ubuntu):
status: New → Triaged
Changed in dee:
milestone: 1.0.0 → none
Changed in unity (Ubuntu):
status: New → Confirmed
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers