all imdb-get searches return no results

Bug #1180225 reported by whitis
24
This bug affects 4 people
Affects Status Importance Assigned to Milestone
imdb-tools (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

This is similar to the bug reported in:
https://bugs.launchpad.net/bugs/313260
However, it appears that imdb has changed their content format again. Thus the original fix does not work.

Example:

$ imdb-get -s alien
Select a Film:

ProblemType: Bug
DistroRelease: Ubuntu 12.04
Package: imdb-tools 0.9-1
ProcVersionSignature: Ubuntu 3.5.0-18.29-generic 3.5.7
Uname: Linux 3.5.0-18-generic x86_64
ApportVersion: 2.0.1-0ubuntu17.2
Architecture: amd64
CheckboxSubmission: 1e1a99f31ec2933306456e1f141c45d8
CheckboxSystem: edda5d4f616ca792bf437989cb597002
Date: Wed May 15 02:23:51 2013
InstallationMedia: Ubuntu 11.04 "Natty Narwhal" - Release amd64 (20110427.1)
MarkForUpload: True
PackageArchitecture: all
ProcEnviron:
 LANGUAGE=en_US:en
 TERM=xterm
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
SourcePackage: imdb-tools
UpgradeStatus: Upgraded to precise on 2013-03-03 (72 days ago)

Revision history for this message
whitis (whitis) wrote :
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in imdb-tools (Ubuntu):
status: New → Confirmed
Revision history for this message
Daniel (hackie) wrote :

I attached a patch which partially fixes the problem for version 0.9-1. The problem is a probably a redesign of the imdb homepage. Unfortunately, the developer didn't add any comments about the expected input, which makes it complicated to find out how the new commands should look like.
With the patch, the tool is at least able to list search results again and it can extract the title (i got stuck at the extraction of the director). Personally, I think, bash and regular expressions are just the wrong language for parsing HTML, because it also leads to escaping problems (see below) and can return very surprising result if texts contain trigger words.

Here the result after patching:

$ imdb-get -s alien
1: Alien (1979)
2: Untitled Neill Blomkamp/Alien Project (2017)
3: Alien (2010) (Video)
4: Aliens (1986)
5: Alien³ (1992)
6: Outpost 37 (2014)
7: Alien: Resurrection (1997)
8: Aliens in the Attic (2009)
9: Cowboys & Aliens (2011)
10: Alienator (1990)
Select a Film: 1
Getting details from IMDB
id: tt0078748

$ imdb-get -a -s alien
1: Alien (1979)
2: Untitled Neill Blomkamp/Alien Project (2017)
3: Alien (2010) (Video)
4: Aliens (1986)
5: Alien³ (1992)
6: Outpost 37 (2014)
7: Alien: Resurrection (1997)
8: Aliens in the Attic (2009)
9: Cowboys & Aliens (2011)
10: Alienator (1990)
Select a Film: 1
Getting details from IMDB
id: tt0078748
title: Alien
year: 1979
director: <a href="/name/nm0000631/?ref_=tt_ov_dr" itemprop='url'>Ridley Scott Writers: <a href="/name/nm0639321/?ref_=tt_ov_wr"
writer:
tagline:
plot:
genre:
cast:
coverart:
rating:

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

The attachment "imdb-patch-unfinished.patch" seems to be a patch. If it isn't, please remove the "patch" flag from the attachment, remove the "patch" tag, and if you are a member of the ~ubuntu-reviewers, unsubscribe the team.

[This is an automated message performed by a Launchpad user owned by ~brian-murray, for any issues please contact him.]

tags: added: patch
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.