Run Publishers through Google Refine

Bug #616007 reported by George
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Open Library
New
Wishlist
arielb

Bug Description

Be very interested to see whether a quick pass through Gridworks could set us on the path to controlling publisher name.

Tags: publisher
George (george-archive)
Changed in openlibrary:
assignee: nobody → arielb (ariel-archive)
importance: Undecided → Wishlist
Jacob Gest (jakegest)
tags: added: publisher
Revision history for this message
Edward Betts (edwardbetts) wrote :

What's the problem you're trying to solve? I think the spec for publishers looks like this:

- Publisher pages that look like subject pages.
- Publish search
- Publisher drop down when editing a book.
- Merge publishers

We can add extra fields about publishers like: Founders, key people, Start date, End Date, street address, country, number of employees, web page, and Wikipedia links. These could come later.

Revision history for this message
arielb (ariel-archive) wrote :

gridworks may be a bit heavy handed for this - we can probably just use relevance or recon services:

http://data.labs.freebase.com/recon/query?q={"name": "Penguin", "type": "/book/publishing_company"}

http://api.freebase.com/api/service/search?query=penguin&type=/book/publishing_company

Revision history for this message
George (george-archive) wrote :

The problem I'm trying to solve is to do a first pass on reconciling publishers automatically. Since it isn't a controlled field in Library World, there's massive variation in names. If we want pages for publishers, this would be a good first step.

Also, we might be able to something slightly different for publisher pages... like, publishing trends, common subjects, common authors etc. The data/reporting we want might be similar, but I think we should make the page look a bit different so it isn't confusing.

Like to explore pages like /publishers/publisher/subjects too, perhaps.

summary: - Run Publishers through Freebase Gridworks
+ Run Publishers through Google Refine
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.