Related projects, literature and other links
HarvestMan on the web
- Linuxlinks.com Page on Search Utilities
- HarvestMan in FSF directory
- Superdownloads at ubbi.com
- Unalog tag for HarvestMan
- HarvestMan@linuxsoft.cz
- Darwin port of HarvestMan
- Freshmeat project page
- HarvestMan@Python Cheeseshop
Projects/places using HarvestMan
Robets, Crawlers and Gatherers
- Combine - Open system for harvesting
- Crawling in Perl - A Quick Tutorial
- Larbin - Multi-Purpose Web Crawler
- Puf - Parallel URL Fetcher
- SuckMT - Multiconnection NNTP Downloader
- HTTrack - Website copier
- Heritrix - W3C archival quality web crawler
- Mercator web crawler from Compaq
- Ubicrawler - Scalable, fully distributed web crawler
Literature on crawlers, distributed search systems
- Ubicrawler - Scalable, fully distributed web crawler
- Minimizing network distance in distributed crawlers - Odysseas Papapetrou and George Samaras
- Distributed crawling using migrating crawlers - Odysseas Papapetrou, Stavros Papastavrou and George Samaras
- Anatomy of a Search Engine - The original paper on Google
Distributed Computing projects
- The HarvestMan Web Crawler