8 Search APIs for Hack Days

Martin Belam
Written by
Published 14 November, 2008
Categories: , , ,

<< previous | next >>
2 comments so far 
Add your comment Add your comment

I spent most of yesterday at The Guardian's first ever internal Hack Day. I gave one of the short five minute 'lightning' talks that got the event kicked off. I wanted to outline a few of the search related APIs out there that might prove useful as people put their hacks together. Here's the written down version...

Google AJAX Search API

I'll start with the big one - the Google AJAX Search API. This lets you put Google Search in your web pages with JavaScript. As well as the web index, you can embed search boxes and results from blog and news search, access the information in Google Local, and search over video content from YouTube.

Google Ajax Api

Google AJAX Search API Wizards

If, like me, the last bit of JavaScript coding you did was to put a scrolling marquee in the furniture of the Netscape browser back in the late '90s, then you'll be relieved to know that Google offer a second approach to their API. Without having to know anything about how the code works, you can build custom search pages and features for your hack.

Google Ajax Wizards

The tool takes the 'wizard' approach to building your code, but I can't guarantee it will be a more pleasant user experience than a Microsoft Office wizard.

Hang on....scratch that.

Of course I can guarantee that it will be a better user experience than a Microsoft Office wizard.

Google Custom Search Engine

Another tool from Google is their Custom Search Engine. Here you can build your engine using results from one site, a sub-section of a site, a group of sites, or even just a specific collection of web pages. It is how I built Chipwrapper. There are also additional tweaks you can apply to the results, and you can change the look and feel of the results to match your own site.

Google custom search engine

Yahoo! Search BOSS

Doctor Who nerds will recognise BOSS as the name of the evil supercomputer in the Jon Pertwee story officially known as "the one with the maggots". [1]

Jon Pertwee Boss

Now a rather less homicidal system called BOSS exists: Yahoo!'s Build your Own Search Service.

Yahoo! Search BOSS

This is your gateway into the Yahoo! Search engine, with access not just to their web index, but also their image search and a 30 day index of news stories.

Yahoo! SearchMonkey

SearchMonkey is a different kind of search API from Yahoo!. Instead of extracting information from the Yahoo! engine, you add value to the results by blending in your own structured data. This allows you to control the appearance and information presented in search results for your content.

Search Monkey Diagram

Yahoo! SearchMonkey Wizard

Like the Google AJAX API, Yahoo! offer a wizard-style approach to building SearchMonkey hacks. I haven't used it myself, but it has a great big cartoon of a monkey wearing glasses, so how bad can it be?

Search Monkey Platform

Yahoo! Term extraction engine

Another useful little tool from Yahoo! for hacks is the term extraction engine. You send it a chunk of text or an article, and in return you get an XML file which includes the key terms and concepts used in the text.

Yahoo Term Extractor

Open Calais

Open Calais is a similar tool to the Yahoo! term extraction engine, but with a more modern spin. Billed as part of 'Web 2.0, Web 3.0, the Semantic Web or the Giant Global Graph', Open Calais also takes text and returns to the user what it thinks the article is about. In this case it identifies entities like people, places, companies, geographies, and returns rich semantic mark-up indicating what it found, and how the entities relate to each other.

Open Calais

Happy hacking!



[1] The acronym stands for 'Biomorphic Organisational Systems Supervisor', appearing in the story "The Green Death" [Return to article]

2 comments so far

Hey, it doesn't matter what the API is like. Anything with a cartoon monkey wearing glasses wins in my book!

Many thanks for including Calais, Martin.

Come January, we'll be returning dereferancable URIs, connecting the the Linked Data cloud. See details at OpenCalais.com/blog.

-Krista from the Calais team

Leave your comment

A limited set of HTML tags are allowed in comments: a href, strong, em, ul, li, blockquote
To protect against spam your comments will not appear on the site until I have manually published them.
Your email address will never appear on the site.

  

  

  


Alan Turing wouldn't be impressed with this crude test, but please prove you are a person and type toothpaste into the box below.

Search

Get updates by Email or RSS


Email icon    RSS icon

Sign up to get free updates by email
  

Training

"Learn to blog smart: join the conversation" with Martin Belam - London 26 February, 2009

About Martin Belam

I'm a London-based internet consultant and writer, with 8 years experience in product management, information architecture, and user experience design for global brands like Sony, Vodafone, The Guardian and the BBC. I specialise in advising on search, widgets, RSS, online news publishing and bulk email delivery.
Martin Belam CV
email: martin.belam@currybet.net
tel: +44 (0) 7801 828718
twitter: currybet
About Martin Belam and this site

Recent posts

Popular categories

BBC, Doctor Who, Ghost Walks, Media, Music, Newspapers, Search, Social media, Web