tidy removes important whitespace

Bug #23304 reported by David Mandelberg
4
Affects Status Importance Assigned to Milestone
HTML Tidy
Unknown
Unknown
tidy (Debian)
Unknown
Unknown
tidy (Ubuntu)
Incomplete
Medium
Unassigned

Bug Description

When I run tidy (full command: "tidy -xml -access 3 --indent auto
--input-encoding utf8 --output-encoding ascii --quote-marks yes") on an xhtml
file, it removes whitespace between words when the first word is a link.

Revision history for this message
David Mandelberg (dseomn) wrote :

Created an attachment (id=4373)
xhtml file before running tidy

line 54: notice the space in 'bash</a> script'

Revision history for this message
David Mandelberg (dseomn) wrote :

Created an attachment (id=4374)
xhtml file after running tidy

line 57: notice the lack of a space in 'bash</a>script'

Revision history for this message
David Mandelberg (dseomn) wrote :

Note: if you're up for a ~15MB download from my ADSL connection and need more
info about how I'm calling tidy or something simillar, you can download one of
the source tarballs from http://eth0.is-a-geek.org/

Revision history for this message
David Mandelberg (dseomn) wrote :

It's not just links, but other tag(s) too.

On http://eth0.is-a-geek.org/about-me/ 'Mandelberg</em>and' should be
'Mandelberg</em> and'.

Revision history for this message
Ben (bugzilla-ubuntu-com-inelegant) wrote :

I can reproduce this with the latest binary from http://tidy.sourceforge.net/ as
well.

A simple test case seems to be:

echo "<p><a href='/'>foo</a> bar</p>" | tidy --quiet yes --show-warnings no
--input-xml yes

This produces:

"<p>
<a href='/'>foo</a>bar</p>"

There are multiple bug reports about whitespace removal/addition in "-xml" mode
on http://sourceforge.net/tracker/?group_id=27659 . For instance:
http://sourceforge.net/tracker/?group_id=27659&atid=390963&func=detail&aid=449348
and
http://sourceforge.net/tracker/?group_id=27659&atid=390963&func=detail&aid=1104229
. I _think_ creating a minimal testcase and reporting it upstream may be a good
next step...

Matt Zimmerman (mdz)
Changed in tidy:
status: Unconfirmed → Confirmed
Revision history for this message
rusivi2 (rusivi2-deactivatedaccount) wrote :

Thank you for posting this bug.

Edgy is in End of Life status. Please update and repost detailed error report.

Changed in tidy (Ubuntu):
status: Confirmed → Incomplete
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.