diff doesn't treat non-breaking space as whitespace

Bug #31738 reported by Andrew Pam
22
This bug affects 1 person
Affects Status Importance Assigned to Milestone
diffutils (Ubuntu)
Confirmed
Wishlist
Unassigned

Bug Description

When "diff" is run with the "-b" or "-w" options, it fails to treat non-breaking space characters (U+00A0) as whitespace.

Revision history for this message
Simon Law (sfllaw) wrote :

From what I understand, the non-breaking space character is a printable graphical punctuation that must not be considered whitespace.

Changed in diffutils:
status: Unconfirmed → Rejected
Revision history for this message
Andrew Pam (xanni) wrote : Re: [Bug 31738] Re: diff doesn't treat non-breaking space as whitespace

On Fri, Apr 28, 2006 at 06:09:42AM -0000, Simon Law wrote:
> >From what I understand, the non-breaking space character is a printable
> graphical punctuation that must not be considered whitespace.

Documentation please?

There are now programming languages such as Perl and Java in which the
code itself can be encoded as UTF-8, and the non-breaking space may
indeed be valid whitespace. Perhaps diff needs a new option to treat
this character as whitespace?

Regards,
 Andrew
--
mailto:<email address hidden> Andrew Pam
http://www.xanadu.com.au/ Chief Scientist, Xanadu
http://www.glasswings.com.au/ Partner, Glass Wings
http://www.sericyb.com.au/ Manager, Serious Cybernetics

Changed in diffutils:
status: Rejected → Unconfirmed
Revision history for this message
Connor Imes (ckimes) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. You reported this bug a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue for you. Can you try with the latest Ubuntu release? Thanks in advance.

Changed in diffutils:
status: New → Incomplete
Revision history for this message
Ralph Janke (txwikinger) wrote :

We'd like to figure out what's causing this bug for you, but we haven't heard back from you in a while. Could you please provide the requested information? Thanks!

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for diffutils (Ubuntu) because there has been no activity for 60 days.]

Changed in diffutils (Ubuntu):
status: Incomplete → Expired
Revision history for this message
Andrew Pam (xanni) wrote :

This issue still exists with diffutils 1:2.8.1-18 in Ubuntu 10.04 LTS (Lucid Lynx)

$ diff -w -u test1 test2
--- test1 2010-11-19 16:26:59.775179676 +1100
+++ test2 2010-11-19 16:27:11.912635830 +1100
@@ -1,2 +1,2 @@
 space
-non-breaking space
+ non-breaking space

Revision history for this message
Andrew Pam (xanni) wrote :
Changed in diffutils (Ubuntu):
status: Expired → New
Revision history for this message
Andrew Pam (xanni) wrote :

Note: MKS diff correctly handles Unicode.

Changed in diffutils (Ubuntu):
status: New → Confirmed
Revision history for this message
Hans Ginzel (hans-matfyz) wrote :

Bug still exists in Ubuntu 14.04 LTS (trusty).

What about thin space (U+2009) or narrow no-break space (U+202F)?

See http://en.wikipedia.org/wiki/Thin_space, http://en.wikipedia.org/wiki/Whitespace_character.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.