htmldiff creates erroneous diff
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
lxml |
New
|
Undecided
|
Unassigned |
Bug Description
I reported that previously in the Github issue tracker, but this has been deactivated, so I'll add it in here again.
I'm creating a new issue instead of adding information to bug 315511 as this is a much simpler example, which doesn't involve nesting and only uses one tag type and thus might be easier to fix:
Htmldiff does the following:
>>> htmldiff(
u'<
However the correct result would be - there's no need to touch the `<div>H</div>`.
>>> htmldiff(
u'<
A similar example from the original Github issue is the following:
>>> htmldiff(
u'<
instead of
u'<
This happens in lxml 3.0.1 as well as 3.2.1.
Thanks for looking into that stuff again at some time!