Mixnode favicon

Mixnode

Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web. Mixnode allows you to think of all resources on the web as rows in a database table; a giant database table with billions of rows that you can query using the standard Structured Query Language (SQL). So, rather than running web crawlers/scrapers you can write simple queries in an ultra-flexible language to retrieve all sorts of interesting information from the web.

StormCrawler

StormCrawler

StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache licens ...

Darcy Ripper

Darcy Ripper

Darcy Ripper is a powerful pure Java multi-platform web crawler with great work load and speed capabilities. Darcy is a standa ...