Regex behavior in Falko metadata is incorrect

Bug #967096 reported by Amir Zeldes
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
ANNIS
Fix Committed
Undecided
Thomas Krause

Bug Description

When searching for regex in annotations, it is assumed the user means an initially and finally anchored string, i.e. the query

tok=/[Uu]nd/

is actually looking for the expression /^[Uu]nd$/. As a consequence, this query does not find "Hund". This is correct.

In metadata, this does not happen correctly. Looking in FalkoEssaysL2, this query:

tok & meta::l1_1=/[^e].+/

finds texts with l1_1="eng", because "eng" contains [^e].+ as a substring: the "ng" part. This query should also be treated as looking for l1_1=/^[^e].+$/, so that metadata and annotations behave in the same way.

Tags: bug

Related branches

Thomas Krause (krause)
Changed in annis:
milestone: none → 3.0.0
status: New → In Progress
Thomas Krause (krause)
Changed in annis:
status: In Progress → Fix Committed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.