If you're new here and like what you read, please subscribe to my blog feed or sign up for free email updates. Thanks for visiting!
Enter your email address to subscribe:

How to download all pages linked from a page

No comment - Post a comment (nofollow disabled)

Situation: Say you needed to download all pages linked from http://example.com/page.html and you don't want to do it manually one page at a time. (Additional benefit: the directory structure is kept.)

Solution: Use Backstreet Browser with the following settings:
  • Project URL: The URL of the page (in our example, http://example.com/page.html)
  • Maximal link depth: 2 (so that only page.html and the pages linked from page.html will be downloaded)
  • Under the Others tab, clear the Filename for the Starting HTML page (by default index.html)
Note 1: All pages linked from the current page will be downloaded, whether they are relevant to your goal or not.

(Note: There might be a more elegant way using Python.)

Like this post? Please help me pay for my Internet access.
email post

Possibly Related Posts

This post has no comment. Add your own!