←back to thread

262 points maoserr | 7 comments | | HN request time: 0.436s | source | bottom
1. Mkengine ◴[] No.41905419[source]
Does it support http://fanfiction.net/ ? I never found an easy solution for that one.
replies(4): >>41905667 #>>41905946 #>>41906438 #>>41908224 #
2. maoserr ◴[] No.41905667[source]
You can import a csv of all the chapter links, looks like it's just incremental numbering in the url
replies(1): >>41906788 #
3. pasc1878 ◴[] No.41905946[source]
I use a calibre add-in https://www.mobileread.com/forums/showthread.php?t=259221

It sort of works ie some stories just work others just get the first page.

4. seridescent ◴[] No.41906438[source]
you can export epubs from https://fichub.net/
5. t-3 ◴[] No.41906788[source]
The issue is most likely cloudflare blocking most the best scraping methods. If the site can be pulled down with eg. wget or curl without a bunch of options that you definitely aren't writing by hand, pandoc can just be used to directly make an epub.
6. kemayo ◴[] No.41908224[source]
Fanfiction.net is trivial... apart from it having Cloudflare bot blocking turned up to aggressive levels. I've not seen an approach that works, other than using headless browsers to fetch the content.
replies(1): >>41909356 #
7. theultdev ◴[] No.41909356[source]
headless browsers won't work by default for cloudflare captchas.

open source stealth plugins don't really work now either.

you have to use real browser fingerprints.