I picked up on a tool called WinHTTrack from Dale Meredith’s Pluralsight course on reconnaissance (which I will review in a couple of days). The tool has a simple premise: grab website files based on links.
When I ran it against Catallaxy Services, it pulled back results based on each link. The app handles subdomains separately and presents a reasonable picture of the site. The advantage to using a tool like this is that you can grab a website and browse it locally later. This lets you perform site analysis without actually being on the website.
Also, this tool will preserve external links but by default, it will not grab files from external sites. You don’t want to try to collect the whole internet (right? Right?), so being able to target downloads to one domain, subdomain, or even directory is very helpful.