[lug] Bulk wget from archive.org

Tyler Cipriani tyler at tylercipriani.com
Thu Jul 20 14:30:39 MDT 2017


hrm, SNI extension to TLS has been around for a bit. Anyway I think
archive.org supports plain old http.

Anyway, I can't resist a good shell/archive.org task. I managed to get all
the pdfs downloaded with the info in this gist:
https://gist.github.com/thcipriani/bd59e25a5db7c3551bcf5d6041a9a0c3


On Thu, Jul 20, 2017 at 2:24 PM, Jed S. Baer <blug at jbaer.cotse.net> wrote:

> On Thu, 20 Jul 2017 13:40:49 -0600
> Tyler Cipriani wrote:
>
> > I made a script to do these bulk downloads and (as is my wont) saved it
> > in my dotfiles it may yet be useful in this situation[1].
>
> Thanks. But it gets all barfy over SSH.
>
> /usr/local/lib/python2.7/dist-packages/urllib3/util/ssl_.py:335:
> SNIMissingWarning: An HTTPS request has been made, but the SNI (Subject
> Name Indication) extension to TLS is not available on this platform. This
> may cause the server to present an incorrect TLS certificate, which can
> cause validation failures. You can upgrade to a newer version of Python
> to solve this.
>
> And Mint doesn't have a newer version. Or maybe they do, but I'd have to
> upgrade to v18.
>
> I guess I'll fall back to awk and wget.
> _______________________________________________
> Web Page:  http://lug.boulder.co.us
> Mailing List: http://lists.lug.boulder.co.us/mailman/listinfo/lug
> Join us on IRC: irc.hackingsociety.org port=6667 channel=#hackingsociety
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lug.boulder.co.us/pipermail/lug/attachments/20170720/006b7f29/attachment.html>


More information about the LUG mailing list