/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
Show HN: Epublifier – scrape pages (books, manuals) for offline reading
(github.com)
262 points
maoserr
| 2 comments |
21 Oct 24 13:18 UTC
|
HN request time: 0.475s
|
source
Show context
Mkengine
◴[
21 Oct 24 15:49 UTC
]
No.
41905419
[source]
▶
>>41903864 (OP)
#
Does it support
http://fanfiction.net/
? I never found an easy solution for that one.
replies(4):
>>41905667
#
>>41905946
#
>>41906438
#
>>41908224
#
1.
kemayo
◴[
21 Oct 24 20:30 UTC
]
No.
41908224
[source]
▶
>>41905419
#
Fanfiction.net is trivial... apart from it having Cloudflare bot blocking turned up to aggressive levels. I've not seen an approach that works, other than using headless browsers to fetch the content.
replies(1):
>>41909356
#
ID:
GO
2.
theultdev
◴[
21 Oct 24 22:45 UTC
]
No.
41909356
[source]
▶
>>41908224 (TP)
#
headless browsers won't work by default for cloudflare captchas.
open source stealth plugins don't really work now either.
you have to use real browser fingerprints.
↑