[lug] Wget Behavior Differnces on Different Machines

David L. Anselmi anselmi at anselmi.us
Tue Nov 8 19:03:26 MST 2005


Bill Thoen wrote:
[...]
> The place it's hanging on is the line:
> 
> Connecting to catalog.mso.census.gov[148.129.75.139]:443...

That's a link at the bottom of the page (Product Catalog) and that 
machine seems down at the moment.  Seems like the older version of wget 
processes the links in a different order than the newer (all robots.txt 
files first?).  Probably the new version would hang too, after it got 
through to the bad link.

You can probably use --exclude-domains to keep things on this server. 
You might also like -A.zip if all you want is the zip files.  And maybe 
--no-parent as well.

HTH,
Dave



More information about the LUG mailing list