I spent most of yesterday at The Guardian's first ever internal Hack Day. I gave one of the short five minute 'lightning' talks that got the event kicked off. I wanted to outline a few of the search related APIs out there that might prove useful as people put their hacks together. Here's the written down version...
I'll start with the big one - the Google AJAX Search API. This lets you put Google Search in your web pages with JavaScript. As well as the web index, you can embed search boxes and results from blog and news search, access the information in Google Local, and search over video content from YouTube.
If, like me, the last bit of JavaScript coding you did was to put a scrolling marquee in the furniture of the Netscape browser back in the late '90s, then you'll be relieved to know that Google offer a second approach to their API. Without having to know anything about how the code works, you can build custom search pages and features for your hack.
The tool takes the 'wizard' approach to building your code, but I can't guarantee it will be a more pleasant user experience than a Microsoft Office wizard.
Hang on....scratch that.
Of course I can guarantee that it will be a better user experience than a Microsoft Office wizard.
Another tool from Google is their Custom Search Engine. Here you can build your engine using results from one site, a sub-section of a site, a group of sites, or even just a specific collection of web pages. It is how I built Chipwrapper. There are also additional tweaks you can apply to the results, and you can change the look and feel of the results to match your own site.
Doctor Who nerds will recognise BOSS as the name of the evil supercomputer in the Jon Pertwee story officially known as "the one with the maggots". [1]
Now a rather less homicidal system called BOSS exists: Yahoo!'s Build your Own Search Service.
This is your gateway into the Yahoo! Search engine, with access not just to their web index, but also their image search and a 30 day index of news stories.
SearchMonkey is a different kind of search API from Yahoo!. Instead of extracting information from the Yahoo! engine, you add value to the results by blending in your own structured data. This allows you to control the appearance and information presented in search results for your content.
Like the Google AJAX API, Yahoo! offer a wizard-style approach to building SearchMonkey hacks. I haven't used it myself, but it has a great big cartoon of a monkey wearing glasses, so how bad can it be?
Another useful little tool from Yahoo! for hacks is the term extraction engine. You send it a chunk of text or an article, and in return you get an XML file which includes the key terms and concepts used in the text.
Open Calais is a similar tool to the Yahoo! term extraction engine, but with a more modern spin. Billed as part of 'Web 2.0, Web 3.0, the Semantic Web or the Giant Global Graph', Open Calais also takes text and returns to the user what it thinks the article is about. In this case it identifies entities like people, places, companies, geographies, and returns rich semantic mark-up indicating what it found, and how the entities relate to each other.
Happy hacking!
[1] The acronym stands for 'Biomorphic Organisational Systems Supervisor', appearing in the story "The Green Death" [Return to article]
Search
Training
"Learn to blog smart: join the conversation" with Martin Belam - London 26 February, 2009
Download e-books
About Martin Belam
I'm a London-based internet consultant and writer, with 8 years experience in product management, information architecture, and user experience design for global brands like Sony, Vodafone, The Guardian and the BBC. I specialise in advising on search, widgets, RSS, online news publishing and bulk email delivery.
Martin Belam CV
email: martin.belam@currybet.net
tel: +44 (0) 7801 828718
twitter: currybet
About Martin Belam and this site
Recent posts
Recent links
Recent comments
Popular posts
Popular categories
BBC, Doctor Who, Ghost Walks, Media, Music, Newspapers, Search, Social media, Web
2 comments so far
Hey, it doesn't matter what the API is like. Anything with a cartoon monkey wearing glasses wins in my book!
Many thanks for including Calais, Martin.
Come January, we'll be returning dereferancable URIs, connecting the the Linked Data cloud. See details at OpenCalais.com/blog.
-Krista from the Calais team