(github.com)

1. Mkengine ◴[21 Oct 24 15:49 UTC] No.41905419[source]▶

Does it support http://fanfiction.net/ ? I never found an easy solution for that one.

replies(4): >>41905667 #>>41905946 #>>41906438 #>>41908224 #

2. maoserr ◴[21 Oct 24 16:14 UTC] No.41905667[source]▶

>>41905419 (TP) #

You can import a csv of all the chapter links, looks like it's just incremental numbering in the url

replies(1): >>41906788 #

3. pasc1878 ◴[21 Oct 24 16:46 UTC] No.41905946[source]▶

>>41905419 (TP) #

I use a calibre add-in https://www.mobileread.com/forums/showthread.php?t=259221

It sort of works ie some stories just work others just get the first page.

4. seridescent ◴[21 Oct 24 17:41 UTC] No.41906438[source]▶

>>41905419 (TP) #

you can export epubs from https://fichub.net/

5. t-3 ◴[21 Oct 24 18:15 UTC] No.41906788[source]▶

>>41905667 #

The issue is most likely cloudflare blocking most the best scraping methods. If the site can be pulled down with eg. wget or curl without a bunch of options that you definitely aren't writing by hand, pandoc can just be used to directly make an epub.

6. kemayo ◴[21 Oct 24 20:30 UTC] No.41908224[source]▶

>>41905419 (TP) #

Fanfiction.net is trivial... apart from it having Cloudflare bot blocking turned up to aggressive levels. I've not seen an approach that works, other than using headless browsers to fetch the content.

replies(1): >>41909356 #

7. theultdev ◴[21 Oct 24 22:45 UTC] No.41909356[source]▶

>>41908224 #

headless browsers won't work by default for cloudflare captchas.

open source stealth plugins don't really work now either.

you have to use real browser fingerprints.

↑

Show HN: Epublifier – scrape pages (books, manuals) for offline reading