Feeds with non-UTF8 characters can't be parsed
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
PlanetFilter |
Fix Released
|
Medium
|
François Marier |
Bug Description
The attached feed contains a bad UTF-8 character (according to "file", it's an ISO-8859-1 file) and fails to parse with the following error:
Warning: 'http://
70)
Traceback (most recent call last):
File "/usr/bin/
if main():
File "/usr/bin/
return process_
File "/usr/bin/
document = parse_feed(
File "/usr/bin/
noentities = remove_
File "/usr/bin/
ret = contents.
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd2 in position 457: invalid continuation byte
Changed in planetfilter: | |
status: | Fix Committed → Fix Released |