read-aloud and unicode

Bug #688258 reported by raj
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Internet Archive BookReader
New
Undecided
raj

Bug Description

When readaloud encounters unicode characters, it sometimes speaks out the unicode codepoint.

For example, on this page: http://www.archive.org/stream/dictionaryofpain01brya#page/6/mode/1up

The characters in djvu.xml look OK, but the gettext wrapper returns what seems to be unicode codepoints as ascii:
http://ia700308.us.archive.org/BookReader/BookReaderGetTextWrapper.php?path=/3/items/dictionaryofpain01brya/dictionaryofpain01brya_djvu.xml&page=21&callback=ttsStartCB

Em dashes get turned into \u2014, which then gets read aloud as percent-you-2-0-1-4...

Also, some doublequotes get turned into \u201e.

raj (raj-archive)
Changed in bookreader:
milestone: none → morebooksinbrowsers
assignee: nobody → raj (raj-archive)
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.