How to build a Jobs Aggregation Search Engine with Nutch, Apache Solr and Views 3 in about an hour
Nutch is an open web crawler that lets you do fine grained or Internet wide web crawling. In this session I will introduce you to the Drupal Nutch module, which will help with the setup and control of your crawls. We will combine this with some of the new features in the Apache Solr, Views 3 and Apache Solr views to create hybrid search engine vertical that interleaves your content with supporting web content.
The Agenda will be:
1. An introduction to the Apache Nutch crawler
2. An introduction to the Features of the Drupal Nutch module
3. Technical Design decisions on combining crawled data with your Drupal data in Apache Solr
4. Bringing it all together with a demo of a jobs aggregation search engine
Industry: education, entertainment, library, media