Feature request: download articles into ebooks

Bug #1006650 reported by Dom
68
This bug affects 14 people
Affects Status Importance Assigned to Milestone
calibre
New
Undecided
Unassigned

Bug Description

I would like a feature (if possible) that downloads articles or webpages and converts them into ebooks into your library. This could work like or using the dotepub conversion tool.

It would be great also if this feature could be accessed through a magnet link, for example:

- I am browsing the web and I come across an article like: http://www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
- I change the web address to: calibreconvert://www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
- Calibre sends the url to dotepub and saves the resulting ebook in your library

Happy for other ideas and suggestions, but I'm sure this would help anyone who spends a lot of time reading articles on the web from varied sources.

Currently using: version 0.8.52 on Windows 7

Cheers,
Dom

Revision history for this message
Vladimir Nunez (nunezvladimir) wrote :

Dom,
for this feature I use with success the following : http://dotepub.com/

That create an epub that I can directly add into Calibre and read...
Easy!
good luck

Vladimir

Revision history for this message
Dom (dom-the-busker) wrote : Re: [Bug 1006650] Re: Feature request - download articles into ebooks

Vladimir,

Thanks for your reply - I am already aware of dotepub, actually I mentioned
it in the request.

I am actually looking for an intelligent way of linking dotepub with
calibre seamlessly. I currently download many articles (say 50 to 100 per
week) and it is time consuming to convert them and add them to calibre by
hand, a tool to join dotepub and calibre would be wonderful.

And since calibre is already quite good at making things simple with
e-books I thought this would be a good fit.

Anyway I'll see if someone else comes up with something better soon.

Dom

On 4 June 2012 01:20, Vladimir Nunez <email address hidden> wrote:

> Dom,
> for this feature I use with success the following : http://dotepub.com/
>
> That create an epub that I can directly add into Calibre and read...
> Easy!
> good luck
>
> Vladimir
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1006650
>
> Title:
> Feature request - download articles into ebooks
>
> Status in calibre: e-book management:
> New
>
> Bug description:
> I would like a feature (if possible) that downloads articles or
> webpages and converts them into ebooks into your library. This could
> work like or using the dotepub conversion tool.
>
> It would be great also if this feature could be accessed through a
> magnet link, for example:
>
> - I am browsing the web and I come across an article like:
> http://www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
> - I change the web address to: calibreconvert://
> www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
> - Calibre sends the url to dotepub and saves the resulting ebook in your
> library
>
> Happy for other ideas and suggestions, but I'm sure this would help
> anyone who spends a lot of time reading articles on the web from
> varied sources.
>
> Currently using: version 0.8.52 on Windows 7
>
> Cheers,
> Dom
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/calibre/+bug/1006650/+subscriptions
>

Revision history for this message
John Card II (jcii) wrote : Re: Feature request - download articles into ebooks

Dom,
If you are able to sort downloaded files in your browser into different directories, you may be able to use the "Automatic Adding" directory in calibre to add .epub files generated by dotepub.com. This file-sort capability is a feature of some download managers such as http://www.freedownloadmanager.org/ , http://www.igetter.net/iGetter.html , etc.

I'm also interested in this feature, and I will now attempt to follow my suggestion.

Currently using version 0.8.55 on macosx 10.7.4
--
jcii

Revision history for this message
Dom (dom-the-busker) wrote : Re: [Bug 1006650] Re: Feature request - download articles into ebooks

John,

That is a great trick, I might see if I can set up something like that. At
the moment I am using Firefox - let me see if I can find a way to move them
to a specific folder.

Once they are uploaded to Calibre do you know if I need to keep the
originals or has it made a copy of the files?

Dom

On 12 June 2012 17:36, John Card II <email address hidden> wrote:

> Dom,
> If you are able to sort downloaded files in your browser into different
> directories, you may be able to use the "Automatic Adding" directory in
> calibre to add .epub files generated by dotepub.com. This file-sort
> capability is a feature of some download managers such as
> http://www.freedownloadmanager.org/ , http://www.igetter.net/iGetter.html, etc.
>
> I'm also interested in this feature, and I will now attempt to follow my
> suggestion.
>
> Currently using version 0.8.55 on macosx 10.7.4
> --
> jcii
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1006650
>
> Title:
> Feature request - download articles into ebooks
>
> Status in calibre: e-book management:
> New
>
> Bug description:
> I would like a feature (if possible) that downloads articles or
> webpages and converts them into ebooks into your library. This could
> work like or using the dotepub conversion tool.
>
> It would be great also if this feature could be accessed through a
> magnet link, for example:
>
> - I am browsing the web and I come across an article like:
> http://www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
> - I change the web address to: calibreconvert://
> www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
> - Calibre sends the url to dotepub and saves the resulting ebook in your
> library
>
> Happy for other ideas and suggestions, but I'm sure this would help
> anyone who spends a lot of time reading articles on the web from
> varied sources.
>
> Currently using: version 0.8.52 on Windows 7
>
> Cheers,
> Dom
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/calibre/+bug/1006650/+subscriptions
>

Revision history for this message
John Card II (jcii) wrote :
Download full text (3.9 KiB)

Dom,
For Firefox, check out http://flashgot.net/

Calibre, at least in my setup, copies the file to its managed library, and then deletes from the "Automatic Adding" directory. This seems to be by design, and a documented feature. And the way you want the program to work, unless you want to add the same file many many times.

There may also be something available with calibre's command line tools as well, I am also looking at that.
--
jcii

On Jun 12, 2012, at 6:59 AM, Dom wrote:

> John,
>
> That is a great trick, I might see if I can set up something like that. At
> the moment I am using Firefox - let me see if I can find a way to move them
> to a specific folder.
>
> Once they are uploaded to Calibre do you know if I need to keep the
> originals or has it made a copy of the files?
>
> Dom
>
> On 12 June 2012 17:36, John Card II <email address hidden> wrote:
>
>> Dom,
>> If you are able to sort downloaded files in your browser into different
>> directories, you may be able to use the "Automatic Adding" directory in
>> calibre to add .epub files generated by dotepub.com. This file-sort
>> capability is a feature of some download managers such as
>> http://www.freedownloadmanager.org/ , http://www.igetter.net/iGetter.html, etc.
>>
>> I'm also interested in this feature, and I will now attempt to follow my
>> suggestion.
>>
>> Currently using version 0.8.55 on macosx 10.7.4
>> --
>> jcii
>>
>> --
>> You received this bug notification because you are subscribed to the bug
>> report.
>> https://bugs.launchpad.net/bugs/1006650
>>
>> Title:
>> Feature request - download articles into ebooks
>>
>> Status in calibre: e-book management:
>> New
>>
>> Bug description:
>> I would like a feature (if possible) that downloads articles or
>> webpages and converts them into ebooks into your library. This could
>> work like or using the dotepub conversion tool.
>>
>> It would be great also if this feature could be accessed through a
>> magnet link, for example:
>>
>> - I am browsing the web and I come across an article like:
>> http://www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
>> - I change the web address to: calibreconvert://
>> www.inc.com/paul-schoemaker/6-Habits-of-Strategic-Thinkers.html
>> - Calibre sends the url to dotepub and saves the resulting ebook in your
>> library
>>
>> Happy for other ideas and suggestions, but I'm sure this would help
>> anyone who spends a lot of time reading articles on the web from
>> varied sources.
>>
>> Currently using: version 0.8.52 on Windows 7
>>
>> Cheers,
>> Dom
>>
>> To manage notifications about this bug go to:
>> https://bugs.launchpad.net/calibre/+bug/1006650/+subscriptions
>>
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1006650
>
> Title:
> Feature request - download articles into ebooks
>
> Status in calibre: e-book management:
> New
>
> Bug description:
> I would like a feature (if possible) that downloads articles or
> webpages and converts them into ebooks into your library. This could
> work like or using the dotepub conversion tool.
>
> It would be great also if this feature...

Read more...

Revision history for this message
Lenny DiBrango (lennyd120) wrote : Re: Feature request - download articles into ebooks

Have you checked out longform.org ? It converts any article on the web to an E-reader format and sends it to your reader through wi-fi. It's free and it works very well.

Revision history for this message
EcLiPsE (softinn72) wrote :

Please check also readlists.com (based on readability algorithms) where you can build your read list from several web articles and lets you export/download/email it as an epub/mobi.

Hope it helps,
Paolo

Revision history for this message
J M (hobohassin) wrote :

Here's a work around:

Print your webpage into a pdf printer (I use CutePdf, http://goo.gl/XKnU).
Then, convert through calibre.

I have not tested this method, so use at your own risk.
If you can simplify the web page (like Article mode on the kindle), then that would make the process much more effective.
Good luck!

Revision history for this message
J M (hobohassin) wrote :

Here is some extra research I just completed (attached).

Revision history for this message
lostinlodos (lostinlodos) wrote :

My choice for easy downloads:
Kindle-it/Push To Kindle
Works in most cases for yanking articles without the advertisements and extra stuff, and turns them into a MOBI, PDF, or EPUB

http://fivefilters.org/kindle-it/

Revision history for this message
Nigel Mackay (nmackay) wrote :

In any browser you specify your download folder. Just make that the watched folder. Then use dotepub. Files handled this way will be loaded into calibre and then deleted from the folder. Other files will be left alone, so you will be able to do with them whatever you want.

Can't get much simpler than that.

Revision history for this message
Simon (simsamsom) wrote :

You can also use a tool like Pocket to save all your articles, and then subscribe to your pocket RSS feed and download on a weekly (daily) schedule.

http://help.getpocket.com/customer/portal/articles/361724-how-to-configure-calibre-with-pocket

You can also use the pocket app on your phones.

Revision history for this message
Alok Singh (alokbeniwal) wrote :

I also recommend the tool Instapaper. It's worth the money and there's a calibre recipe to import articles as a MOBI file, which is easy to convert to EPUB.

Revision history for this message
kojapsid (kojapsid) wrote :

A way would be to convert the Javascript from http://www.grabmybooks.com/ into calibre's python

Leonis (tv-games)
summary: - Feature request - download articles into ebooks
+ Feature request: download articles into ebooks
Revision history for this message
wangzheng (hbyw618) wrote :

I just login to post the same title, IMO this is absolute needed a great feature for reasons below:
1. many browser plugins provided the similar functions, but they lack of unified process result, and many of them just not function well enough.
2. most of them dont support math symbols, such as latex, the layout is messy when I got the result in my kindle.
3. I need to manage several account just for downloading a page, and almost no way to download my articles in other formats.
4. my articles distributed into many sites, very hard to find, search, and manage them.
5. the BIG trouble is, when we those sites, where can I get all my articles back?

so my suggestion is:
1. offline saving is safe.
2. centralized management is good.
3. unified looking will make ur reading fast and efficient.
4. good support of math for scientific articles is a great hit but we need a stable software, and Calibre is the way!
5. searching, convert is base fuction of Calibre, we just need a plugin.

We really need this function, Calibre, plz!!!!!

Revision history for this message
Yunior Rahmawan Usop (yonk-yunior) wrote :

WebToEpub
https://chrome.google.com/webstore/detail/webtoepub/akiljllkbielkidmammnifcnibaigelm

I've manually change the parser to match Blogspot/wordpress

Element with Chapter Content: (filling it after viewed with developer tools)
or manually add URL with Edit Chapter URL

Revision history for this message
Francesco Turco (fturco) wrote :

Wallabag downloads articles in a readable format. You can later export them in EPUB format and add them to Calibre.

Revision history for this message
cmfy (cmfy) wrote :

https://addons.mozilla.org/en-US/firefox/addon/epub-read-the-web-offline/

That one used to work well, it's been a long time I don't play with it though

Revision history for this message
Zbigniew Nowacki (zbign) wrote :

For the site I like to read I implemented the receipt where I can just add the link and title and generate epub. This way I can have sth like magazine. Ofc it would be better to have it automatized.
Still it is better than having tenths of 1-article books.
the recipe includes method like that:
   def parse_index(self):
        section, articles = 'Content', []
        feeds = []

        articles.append({ 'title':'My title',
            'url': <some url>,
            'description': '' })

        feeds.append((section, articles))
        return feeds
So it demands some input but not much. HTH

Revision history for this message
cmfy (cmfy) wrote :

I'm attaching some hopefully useful code in case this feature gets looked into.
It's the open source add-on listed at https://addons.mozilla.org/en-US/firefox/addon/saveasebook/ from Alex Adam, also available at <https://github.com/alexadam/save-as-ebook>!

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.