- 3 Viable Ways to Extract Data from the Open Web - Mar 11, 2016.
We look at 3 main ways to handle data extraction from the open web, along with some tips on when each one makes the most sense as a solution.
Crawler, import.io, Web Mining, Web services, Webhose.io
- How to get structured data from the web without crawling - Mar 10, 2016.
When you need data from the web, you don't have to build a crawler. Webhose.io does the heavy lifting for you. Its crawlers download and structure millions of posts a day, and store and index the data so all you have to do is to define what data you need.
Crawler, Unstructured data, Web services, Webhose.io
- DIY Crawlers vs. Crawlers as Service - Dec 22, 2014.
Crawling structured data from the web has been made easier with the choice between crawlers as a service, like webhose.io, and do-it-yourself, like import.io.
Big Data Services, Crawler, Data Preparation, import.io, Spider, Webhose.io