unicodedecode still kills things
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
GNU Mailman |
Fix Released
|
Undecided
|
Mark Sapiro |
Bug Description
Smells like: 265976 is incurable...
# pkg_info | egrep "(python|mailman)"
mailman-2.1.11 A mailing list manager (MLM) with a user-friendly web front
python25-2.5.2_3 An interpreted object-oriented programming language
# uname -a
FreeBSD thurkler 6.2-RELEASE FreeBSD 6.2-RELEASE #0: Wed Jan 31 11:14:12 JST 2007
Dec 15 19:35:20 2008 (16881) Uncaught runner exception: 'ascii' codec can't decode byte 0xa1 in position 1: ordinal not in range(128)
Dec 15 19:35:20 2008 (16881) Traceback (most recent call last):
File "/usr/local/
self.
File "/usr/local/
keepqueued = self._dispose(
File "/usr/local/
mlist.
File "/usr/local/
h.processUn
File "/usr/local/
self.
File "/usr/local/
filename))
File "/usr/local/
f.write(
File "/usr/local/
atmark = unicode(_(' at '), cset)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xa1 in position 1: ordinal not in range(128)
Dec 15 19:35:20 2008 (16881) SHUNTING: 1229337318.
Date: Mon, 15 Dec 2008 19:36:04 +0900
From: Don <x>
User-Agent: Thunderbird 2.0.0.18 (Windows/20081105)
MIME-Version: 1.0
To: test@x
Content-Type: text/plain; charset="us-ascii"
Content-
Subject: [test] test 16 - shift JIS
...
HTML▒▒ź▒
(looks like it really was shiftjis...)
Changed in mailman: | |
milestone: | none → 2.1.12 |
status: | Fix Committed → Fix Released |
This is only related to (non) Bug 265976 in that the exception is the same.
It is not clear to me if this is a bug or not. The code in HyperArch.py is suspicious. The existing code fragment is
if mm_cfg. ARCHIVER_ OBSCURES_ EMAILADDRS: translation( )
atmark = unicode(_(' at '), cset)
i18n. set_language( self._lang)
body = re.sub( r'([-+, .\w]+)@ ([-+.\w] +)',
'\g<1> ' + atmark + '\g<2>', body)
finally:
i18n. set_translation (otrans)
otrans = i18n.get_
try:
And it seems it possibly should be
if mm_cfg. ARCHIVER_ OBSCURES_ EMAILADDRS: translation( )
i18n. set_language( self._lang)
atmark = unicode(_(' at '), cset)
body = re.sub( r'([-+, .\w]+)@ ([-+.\w] +)',
'\g<1> ' + atmark + '\g<2>', body)
finally:
i18n. set_translation (otrans)
otrans = i18n.get_
try:
but I'm not sure. I may be misunderstanding it.
However, I am unable to duplicate this exception or any other with the message fragment posted with either English or Japanese as the list language, so I don't know if the above change would have any effect.
It is clear the message is defective in that it claims the body is us-ascii and it isn't, but I don't *think* that should cause this particular error.
Can you email me a complete message that causes this exception and also tell me the preferred_language of this list and the mm_cfg.py setting of DEFAULT_ SERVER_ LANGUAGE if it is set to other than 'en'.