8 Search APIs for Hack Days
I spent most of yesterday at The Guardian's first ever internal Hack Day. I gave one of the short five minute 'lightning' talks that got the event kicked off. I wanted to outline a few of the search related APIs out there that might prove useful as people put their hacks together. Here's the written down version...
Google AJAX Search API
Google AJAX Search API Wizards
The tool takes the 'wizard' approach to building your code, but I can't guarantee it will be a more pleasant user experience than a Microsoft Office wizard.
Hang on....scratch that.
Of course I can guarantee that it will be a better user experience than a Microsoft Office wizard.
Google Custom Search Engine
Another tool from Google is their Custom Search Engine. Here you can build your engine using results from one site, a sub-section of a site, a group of sites, or even just a specific collection of web pages. It is how I built Chipwrapper. There are also additional tweaks you can apply to the results, and you can change the look and feel of the results to match your own site.
Yahoo! Search BOSS
Now a rather less homicidal system called BOSS exists: Yahoo!'s Build your Own Search Service.
This is your gateway into the Yahoo! Search engine, with access not just to their web index, but also their image search and a 30 day index of news stories.
SearchMonkey is a different kind of search API from Yahoo!. Instead of extracting information from the Yahoo! engine, you add value to the results by blending in your own structured data. This allows you to control the appearance and information presented in search results for your content.
Yahoo! SearchMonkey Wizard
Like the Google AJAX API, Yahoo! offer a wizard-style approach to building SearchMonkey hacks. I haven't used it myself, but it has a great big cartoon of a monkey wearing glasses, so how bad can it be?
Yahoo! Term extraction engine
Another useful little tool from Yahoo! for hacks is the term extraction engine. You send it a chunk of text or an article, and in return you get an XML file which includes the key terms and concepts used in the text.
Open Calais is a similar tool to the Yahoo! term extraction engine, but with a more modern spin. Billed as part of 'Web 2.0, Web 3.0, the Semantic Web or the Giant Global Graph', Open Calais also takes text and returns to the user what it thinks the article is about. In this case it identifies entities like people, places, companies, geographies, and returns rich semantic mark-up indicating what it found, and how the entities relate to each other.