Use re.UNICODE

Bug #545452 reported by Max Rabkin
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Ibid
Triaged
Low
Unassigned

Bug Description

The regex flag re.U causes \b, \w, \s, etc. to match all appropriate characters, not just the ones in ASCII. In conjunction with re.I, it does correct (or at least better) case-insensitive matching for non-ASCII characters. We should probably use it everywhere.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.