[lug] Stupid WGET question
Evelyn Mitchell
efm at tummy.com
Wed Feb 16 13:04:15 MST 2005
I believe this may be what you're looking for:
man wget
HTTP Options
-E
--html-extension
If a file of type application/xhtml+xml or text/html is
downloaded
and the URL does not end with the regexp \.[Hh][Tt][Mm][Ll]?,
this
option will cause the suffix .html to be appended to the local
filename. This is useful, for instance, when you're mirroring a
remote site that uses .asp pages, but you want the mirrored
pages
to be viewable on your stock Apache server. Another good use
for
this is when you're downloading CGI-generated materials. A URL
like http://site.com/article.cgi?25 will be saved as arti-
cle.cgi?25.html.
* On 2005-02-16 20:02 George Sexton <gsexton at mhsoftware.com> wrote:
> Anyone know how to have wget retrieve non-HTML files when it traverses an
> HTML page?
>
> For example, I have an HTML page that has links to iCal files on it. I want
> WGET to retrieve the .HTML file, and all .ICS files referenced from that
> page.
>
> Here's the URL:
>
> http://www.mhsoftware.com/caldemo/iCal.html
>
>
>
> George Sexton
> MH Software, Inc.
> http://www.mhsoftware.com/
> Voice: 303 438 9585
>
>
> _______________________________________________
> Web Page: http://lug.boulder.co.us
> Mailing List: http://lists.lug.boulder.co.us/mailman/listinfo/lug
> Join us on IRC: lug.boulder.co.us port=6667 channel=#colug
--
Regards, tummy.com, ltd
Evelyn Mitchell Linux Consulting since 1995
efm at tummy.com Senior System and Network Administrators
http://www.tummy.com/
More information about the LUG
mailing list