oBot is the web crawling bot of the Content Security Division of IBM Germany Research & Development GmbH. We use several computers to crawl webpages and a large computer cluster to categorize the content of these pages. The result of this analysis is a compact webfilter database that is made available to our customers in several content filtering products including an SDK for OEM partners. Using several algorithms we can assign more than 65 different categories (http://www.cobion.com/support/techsupport/dbcategories/) to webpages.
http://filterdb.iss.net/crawler/