crash on checkout from SVN WC w/ unicode name
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Bazaar Subversion Plugin |
New
|
Undecided
|
Unassigned |
Bug Description
Using bzr-svn 0.4 branch rev 853, I tried the following:
1. svn co file://
2. bzr branch trunk trunk-bzr
This usually works flawlessly. However, this repository has one directory named "I²C" -- I've verified it's correct UTF-8 in SVN, and my locale is (and always has been) UTF-8. The directory name, when checked out with SVN, is definitely UTF-8 as well. See below for an example:
$ ls -d I²C/ | xxd
0000000: 49c2 b243 2f0a I..C/.
Note that 0xc2 0xb2 is U+00B2 ("²") in UTF-8.
Anyway, the error I get is the following:
$ bzr branch trunk trunk-bzr
bzr: ERROR: libsvn.
Traceback (most recent call last):
File "/usr/lib/
return run_bzr(argv)
File "/usr/lib/
ret = run(*run_argv)
File "/usr/lib/
return self.run(
File "/usr/lib/
accelerator
File "/usr/lib/
return bzrdir.
File "/usr/lib/
tree = self.open_
File "/home/
return SvnWorkingTree(
File "/home/
status = svn.wc.
File "/var/lib/
return apply(_
SubversionExcep
bzr 1.1.0.candidate.1 on python 2.4.4.final.0 (linux2)
arguments: ['/usr/bin/bzr', 'branch', 'trunk', 'trunk-bzr']
encoding: 'UTF-8', fsenc: 'UTF-8', lang: 'en_US.UTF-8'
plugins:
bzrtools /usr/lib/
email /usr/lib/
gtk /usr/lib/
launchpad /usr/lib/
multiparent /usr/lib/
rebase /usr/lib/
svn /home/wjlanda/
*** Bazaar has encountered an internal error.
Please report a bug at https:/
including this traceback, and a description of what you
were doing when the error occurred.
Please let me know if there is other information I can provide to help fix this.
This may or may not be related, but I'll mention it. If I try to do a branch directly from the svn repo (which also normally works great), I get the following error, after about 5 minutes:
$ bzr branch svn+file: ///path/ to/repo trunk-bzr
bzr: ERROR: Path "I²C" is not unicode normalized
I tried running with -vv but didn't get any extra output. The path seems encoded just fine, looking at it this way:
$ bzr branch svn+file: ///path/ to/repo trunk-bzr 2<&1 | xxd
0000000: 627a 723a 2045 5252 4f52 3a20 5061 7468 bzr: ERROR: Path
0000010: 2022 49c2 b243 2220 6973 206e 6f74 2075 "I..C" is not u
0000020: 6e69 636f 6465 206e 6f72 6d61 6c69 7a65 nicode normalize
0000030: 640a
Again note that the 0xc2 0xb2, which is correct UTF-8 for U+00B2 ("²").
(Using svn+https vs. svn+file doesn't make any difference here, BTW.)
Anyway, if this ends up being unrelated and the previous error gets fixed, I'll go ahead and file this part as a separate bug. I think these might be related because the "I²C" name is the only non-ASCII name in the whole repository.