Situation: Say you needed to download all pages linked from http://example.com/page.html and you don't want to do it manually one page at a time. (Additional benefit: the directory structure is kept.)
Solution: Use Backstreet Browser with the following settings:
(Note: There might be a more elegant way using Python.)
Like this post? Please help me pay for my Internet access.
Labels: web scraping
Solution: Use Backstreet Browser with the following settings:
- Project URL: The URL of the page (in our example, http://example.com/page.html)
- Maximal link depth: 2 (so that only page.html and the pages linked from page.html will be downloaded)
- Under the Others tab, clear the Filename for the Starting HTML page (by default index.html)
(Note: There might be a more elegant way using Python.)
Like this post? Please help me pay for my Internet access.
Labels: web scraping

