Looking for a tool to convert an article from a webpage into a 'clean' reader-view-type epub file.

6 replies [Last post]
Staircase
Offline
Joined: 02/24/2022

Hi everyone, I am looking for a tool to convert an article or a blog post from a webpage into an epub file. The tool should not only convert HTML to epub (Pandoc's job) but also 'clean' the page as so to produce what some call a "reader view" versoin of the same page. I learned that rdrview does that.

I found this script https://tildegit.org/exquisitecorp/bookmobile that does the job but it depends on rdrview which is also not in Trisquel's repository.

I would prefer to rely on tools from the repository.

I would like to know if there is a tool in Trisquel's repository that can convert a webpage into "reader view" and then an epub file.

Any ideas?

Magic Banana

I am a member!

I am a translator!

Offline
Joined: 07/24/2010

You can save the HTML Abrowser displays in Reader View: https://support.mozilla.org/kb/firefox-reader-view-clutter-free-web-pages

For more options, there is that add-on: https://gnuzilla.gnu.org/extension.php?id=994899

Sunny Day
Offline
Joined: 01/05/2023

Thank you Magic Banana for the info and links + more thanks to Staircase for asking such a great question!!

I've been looking for a thing like that for a long time - this is superb!

Staircase
Offline
Joined: 02/24/2022

Thank you, MB. I had not thought of looking at extensions. I found a few on gnuzilla.gnu.org, I tested three of them and this one: ePub Creator, so far, does the job of converting webpages into a "Reader Viewed" epub file I can download.

As with all extensions, I am always wondering whether an extension sends data to a remote server. To check ePub Creator, I downloaded the source code and ran the command 'grep https *' on all files. I found nothing unexpected. Happy to take advice from others on how to check code. Also, the extension is licensed is MPL-2.0.

ePub Creator: https://gnuzilla.gnu.org/extension.php?id=774935

PS: I was surprised to find a proprietary extension on gnuzilla.gnu.org (e.g. dotepub - no source code available).

PS 2: you're welcome Sunny Day. I am glad this thread is useful to others.

Sunny Day
Offline
Joined: 01/05/2023

Great to read all you wrote, thank you Staircase!

Happy to take advice from others on how to check code

Yes please, would be briliant to learn how to check code!! I will be keeping my eye here for the responses.

Avron

I am a translator!

Offline
Joined: 08/18/2020

I was surprised to find a proprietary extension on gnuzilla.gnu.org (e.g. dotepub - no source code available).

The License is MPL-1.1. Just download the .xpi file and run "unzip" on it, you will see the code. I have zero knowledge of .js but I have the impression that it is sending data to a remote server.

Staircase
Offline
Joined: 02/24/2022

> > I was surprised to find a proprietary extension on gnuzilla.gnu.org (e.g. dotepub - no
> > source code available).
> >
> The License is MPL-1.1. Just download the .xpi file and run "unzip" on it, you will see the code.

My mistake. Thanks for catching this.