Saturday, May 4, 2024
 Popular · Latest · Hot · Upcoming
26
rated 0 times [  26] [ 0]  / answers: 1 / hits: 116044  / 3 Years ago, tue, july 13, 2021, 5:12:58

I'm using pdftotext (part of poppler-utils) to convert PDF documents to text. It works, for the most part, but one thing I wish it did was to insert blank lines between separate paragraphs instead of mashing them together.



Is there way to get pdftotext to do this? And if not, is there another pdf to text utility that can do this?


More From » pdf

 Answers
1

You could try ebook-convert from Calibre.



If anything, I'd say it errs in the other direction: too many line breaks.



Another thing I'd definitely consider though is converting to HTML using pdfreflow, and then convert the HTML to TXT.


[#44318] Thursday, July 15, 2021, 3 Years  [reply] [flag answer]
Only authorized users can answer the question. Please sign in first, or register a free account.
assionortly

Total Points: 423
Total Questions: 121
Total Answers: 115

Location: Chad
Member since Wed, Sep 30, 2020
4 Years ago
;