Home Projects Downloads Links


Assimilator Web Crawler

Assimilator is a handy tool for automatic search and download of files from the web. The approach is like to drift net fishing for files. AWC will extract all the links from a web page and then parse them for files. Any files matching the type and name specifications will then be queued for downloading. Any links that aren’t files will be queued for parsing and then the process repeats with the next page found. This idea is based on a sort of six degrees of separation theory, i.e. If you know site that contains information related to what your looking for, chances are it will have a link to another related site that my have more information or files that you are searching for. This works well with using a search engine results page.

Future areas of development

Bugs and Issues

Current Status

Completely re-written with support for Mac. Although the Mac version is not posted here due to size restraints, it is available on request. The crawler is now HTTP/1.1 compliant and supports transparent redirections and resuming of downloads. HTML parsing is greatly improved but support for java script parsing for links has yet to be added. New version will search images, hyperlinks and frames for links and can search any sort of file as specified by the user.

Last version 0.030630b

Home Projects Downloads Links

Copyright 2002 Nick Lott & BrokenToaster Software
Generated by Head to Toe at 11:52:23 p.m. on Friday, 10 October 2003.
1