When using --self-contained option in writing HTML pandoc cannot download materials accessible only via HTTPS

Bug #1610722 reported by Stephen Michael Kellat
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
pandoc (Ubuntu)
Expired
Undecided
Unassigned

Bug Description

When attempting to use pandoc to convert a file to a "self-contained" HTML file, pandoc failed as it was unable to download a graphic that was located in an HTTPS location. It failed out on the first such HTTPS location encountered.

ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: pandoc 1.16.0.2~dfsg-1
ProcVersionSignature: Ubuntu 4.4.0-31.50-generic 4.4.13
Uname: Linux 4.4.0-31-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.1
Architecture: amd64
CurrentDesktop: XFCE
Date: Sun Aug 7 14:27:05 2016
InstallationDate: Installed on 2016-06-26 (41 days ago)
InstallationMedia: Xubuntu 16.04 LTS "Xenial Xerus" - Release amd64 (20160420.1)
SourcePackage: pandoc
UpgradeStatus: No upgrade log present (probably fresh install)

Revision history for this message
Stephen Michael Kellat (skellat) wrote :
Revision history for this message
Stephen Michael Kellat (skellat) wrote :

The situation where this reprocessing fails is when files are taken from the output of the RSS aggregator rawdog. I've attached an example source file here.

$ pandoc -o Feeds-reprocessed.html -f html -t html5 --self-contained Feeds.html

pandoc: Could not fetch https://2.bp.blogspot.com/-Bnny6lL9DkY/V67nsSYFLMI/AAAAAAAABmU/vII8_kOujh8ujsy6OamtcvNtL_E-8vzUACLcB/s640/ADDX.jpg
TlsExceptionHostPort (HandshakeFailed Error_EOF) "2.bp.blogspot.com" 443

In contrast, if I attempt instead:

$ pandoc -o Feeds-reprocessed.epub -f html -t epub --self
-contained Feeds.html
pandoc: Could not find media `https://2.bp.blogspot.com/-Bnny6lL9DkY/V67nsSYFLMI/AAAAAAAABmU/vII8_kOujh8ujsy6OamtcvNtL_E-8vzUACLcB/s640/ADDX.jpg', skipping...
pandoc: Could not find media `https://4.bp.blogspot.com/-18CvlS_4n8o/V66lnPeUbXI/AAAAAAAANew/ZOHUdJ-C9O4_nIMjcXxzF0OzbNfKDg4yQCLcB/s320/samurai.png', skipping...
pandoc: Could not find media `https://1.bp.blogspot.com/-L_wwB-xz9Jw/V66ceT9fyEI/AAAAAAAANeg/MWU1HetVZ_4ubL1rc2FUqcFt0r5EJcOqQCLcB/s320/hillary%2Bwall.png', skipping...
pandoc: Could not find media `https://2.bp.blogspot.com/-MT4dyuHl0nY/V6z92fpQGhI/AAAAAAAAAgo/_5XZQoat46w8_QYHSvPYtsBxBaZA-Io0gCLcB/s1600/clockboyhonda.jpg', skipping...
pandoc: Could not find media `https://2.bp.blogspot.com/--RayZ7bxBQ0/V6z_fUp6JlI/AAAAAAAAAg0/kvPgPYP-kN0ghfgQBct_mH_wUVEcJZ1HwCLcB/s320/balloonboytrump.PNG', skipping...
pandoc: Could not find media `https://2.bp.blogspot.com/-ObMhaEUnAuw/V60CZUnpofI/AAAAAAAAAhQ/tjAq6TBcMwsSga-ewoOZBvWVwN53C8zWgCLcB/s320/batboy_1_2.jpg', skipping...
pandoc: Could not find media `https://1.bp.blogspot.com/-pIqJp7lHVfU/V60Cs1PUkTI/AAAAAAAAAhU/Oe1JQZOAUcAJVU4JL0m5m0bXkmBofOnLwCLcB/s1600/spacealienbill.jpg', skipping...
pandoc: Could not find media `https://4.bp.blogspot.com/-7auuE8vY4sQ/V60CuVJVW4I/AAAAAAAAAhY/e8UNIgHCn44eSQuTC9kHHY8wvtOSl8vdQCLcB/s1600/spacealienhillary.jpg', skipping...
pandoc: Could not find media `https://4.bp.blogspot.com/-0MQp4idRRR4/V60CwBQzapI/AAAAAAAAAhc/Jw8l3jei7wYah33g_lFPylXWmsK6fExwACLcB/s1600/spacealienobama.jpg', skipping...
pandoc: Could not find media `https://1.bp.blogspot.com/-JhSifWBK1pI/V6byCGJV9cI/AAAAAAAAAdY/Tm7ud8FOvpArQPbuht7-HwLGdSK8VleSQCLcB/s1600/icon_neutral.gif', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/facebook.png', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/twitter.png', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/google_plus.png', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/reddit.png', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/linkedin.png', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/tumblr.png', skipping...
pandoc: Could not find media `http://www.blazingcatfur.ca/wp-content/plugins/add-to-any/icons/whatsapp.png', skipping...

And a somewhat usable EPUB is generated.

Changed in pandoc (Ubuntu):
status: New → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for pandoc (Ubuntu) because there has been no activity for 60 days.]

Changed in pandoc (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.