wget does not handle IDN domains

Bug #129752 reported by Daniel Hahler
2
Affects Status Importance Assigned to Milestone
wget
Unknown
Unknown
wget (Debian)
Fix Released
Unknown
wget (Ubuntu)
Confirmed
Low
Unassigned

Bug Description

Binary package hint: wget

wget does not support IDN domains:

$ LC_ALL=C wget 'http://www.köln.de/'
 --23:49:06-- http://www.k%C3%B6ln.de/
 => `index.html'
 Resolving http://www.k\303\266ln.de... failed: Name or service not known.

Translating the URL to IDN (xn--kln-sna.de) manually works.

Maybe its possible to use GNU libidn http://www.gnu.org/software/libidn/

Upstream bug report (mainly copied above): https://savannah.gnu.org/bugs/?20422

Daniel Hahler (blueyed)
description: updated
Revision history for this message
Áron Sisak (asisak) wrote :

Confirmed on Gutsy x86, wget 1.10.2-3ubuntu1

Changed in wget:
importance: Undecided → Low
status: New → Confirmed
Áron Sisak (asisak)
Changed in wget:
status: Confirmed → Triaged
Daniel Hahler (blueyed)
Changed in wget:
status: New → Unknown
Revision history for this message
Micah Cowan (micahcowan) wrote :

This issue has been addresssed in the recently-released Wget 1.12. It's currently in Debian unstable, so I imagine it will be made available in Lucid Lynx.

Changed in wget (Ubuntu):
status: Triaged → Fix Committed
Changed in wget (Debian):
status: Unknown → Confirmed
Revision history for this message
Andreas Olsson (andol) wrote :

Still not resolved in the current natty wget 1.12-2.1ubuntu1

  andreas@natty:~$ wget 'http://www.köln.de/'
  --2010-11-27 11:33:39-- http://www.k%C3%B6ln.de/
  Resolving www.köln.de... failed: Name or service not known.
  wget: unable to resolve host address `www.köln.de'
  andreas@natty:~$

That observation is by the way also matched by the corresponding Debian Bug. While it was initially closed per release of wget 1.12-1, it was later reopened again.

Changed in wget (Ubuntu):
status: Fix Committed → Confirmed
Revision history for this message
Andreas Olsson (andol) wrote :

I wonder how well this really is fixed upstream. Even when I build against libidn, and wget claims to have +iri the test still fails.

The libidn version being libidn11 1.18-1.

andreas@natty:~$ wget --version
GNU Wget 1.12 built on linux-gnu.

+digest +ipv6 +nls +ntlm +opie +md5/openssl +https -gnutls +openssl
+iri

Wgetrc:
    /etc/wgetrc (system)
Locale: /usr/share/locale
Compile: gcc -DHAVE_CONFIG_H -DSYSTEM_WGETRC="/etc/wgetrc"
    -DLOCALEDIR="/usr/share/locale" -I. -I../lib -Iyes/include -g -O2
    -D_FILE_OFFSET_BITS=64 -O2 -g -Wall
Link: gcc -g -O2 -D_FILE_OFFSET_BITS=64 -O2 -g -Wall -Wl,-Bsymbolic-functions
    -Lyes/lib /usr/lib/libssl.so /usr/lib/libcrypto.so -ldl -lrt -lidn
    ftp-opie.o openssl.o http-ntlm.o gen-md5.o ../lib/libgnu.a

Copyright © 2009 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later
<http://www.gnu.org/licenses/gpl.html>.
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.

Originally written by Hrvoje Nikšić <email address hidden>.
Currently maintained by Micah Cowan <email address hidden>.
Please send bug reports and questions to <email address hidden>.
andreas@natty:~$ wget http://www.köln.de/
--2010-11-27 14:23:58-- http://www.k%C3%B6ln.de/
Resolving www.köln.de (www.köln.de)... failed: Name or service not known.
wget: unable to resolve host address “www.köln.de”
andreas@natty:~$

Revision history for this message
Andreas Olsson (andol) wrote :

Another mention about failure to wget 1.12 to place nice with IDN: http://<email address hidden>/msg12518.html

Posted a question in the upstream bug report in regards to non obvious dependencies etc.

Changed in wget (Debian):
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.