search & replace doesnt work with "newlines" in PDF to MOBI conversion

Bug #1917386 reported by Richard Reamer
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
calibre
Invalid
Undecided
Unassigned

Bug Description

(I reported this in detail with screen prints on forum as well: https://www.mobileread.com/forums/showthread.php?t=337710)

But basically...

Hello,
(btw, this program is great, have used it on several conversions without issue)
I am trying to convert a PDF to MOBI for a Kindle keyboard (called a 3rd edition, that does 3G and WiFi built in)

Anyway -- on straight conversion - several "paragraph" or "section" headers in PDF get converted with the text squashed together without spaces.
Meaning in pdf file you would see "first second third" but after conversion to MOBI you get "firstsecondthird"

I tried doing "search & replace" in conversion - to no avail -- in my test (screen prints attached) you can see i tried replacing "newline" with a string of letter "j"s -- which didnt happen.

From the "log file" -- I see:
Conversion options changed from defaults:
read_metadata_from_opf: 'C:\\Users\\whatever\\AppData\\Local\\Temp\\calibr e_pxif23tg\\48a3wg9b.opf'
output_profile: 'kindle_pw3'
search_replace: '[["[a-z]\\\\n", "jjj"]]'
verbose: 2
++AND++
'search_replace': '[["[a-z]\\\\n", "jjj"]]',

No other references to "\n" to remove the newline or linefeed.

See attached screen prints for actual screens, "test" results and ebook viewer.

Is this is bug? Or do you have another regular expression that would match the paragraph/section "newlines" that the converter would except?

Thanks for your Help !

Revision history for this message
Richard Reamer (fred4104) wrote :
Revision history for this message
Richard Reamer (fred4104) wrote :
Revision history for this message
Richard Reamer (fred4104) wrote :
Revision history for this message
Richard Reamer (fred4104) wrote :

this is the source PDF I was converting.

Revision history for this message
Kovid Goyal (kovid) wrote :
  • t.epub Edit (59.4 KiB, application/octet-stream)

Convert your PDF with default settings and the following expressions in search and replace and you will be fine

\u2029
\u0020

Changed in calibre:
status: New → Invalid
Revision history for this message
Richard Reamer (fred4104) wrote :

will do !
thank you - am tired so i'll try that tomorrow.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers