[lug] Bulk wget from archive.org

Jed S. Baer blug at jbaer.cotse.net
Thu Jul 20 14:52:43 MDT 2017


On Thu, 20 Jul 2017 14:30:39 -0600
Tyler Cipriani wrote:

> hrm, SNI extension to TLS has been around for a bit. Anyway I think
> archive.org supports plain old http.

Nope, redirects to https:

> Anyway, I can't resist a good shell/archive.org task. I managed to get
> all the pdfs downloaded with the info in this gist:
> https://gist.github.com/thcipriani/bd59e25a5db7c3551bcf5d6041a9a0c3

Well cool. But there are 355 issues, so, if you include the "text
overlay" PDFs, something more that 355 pdfs.


More information about the LUG mailing list