8 Search APIs for Hack Days

Martin Belam
Written by
Published 14 November, 2008
Categories: , , ,

<< previous | next >>
2 comments so far 
Add your comment Add your comment

I spent most of yesterday at The Guardian's first ever internal Hack Day. I gave one of the short five minute 'lightning' talks that got the event kicked off. I wanted to outline a few of the search related APIs out there that might prove useful as people put their hacks together. Here's the written down version...

Google AJAX Search API

I'll start with the big one - the Google AJAX Search API. This lets you put Google Search in your web pages with JavaScript. As well as the web index, you can embed search boxes and results from blog and news search, access the information in Google Local, and search over video content from YouTube.

Google Ajax Api

Google AJAX Search API Wizards

If, like me, the last bit of JavaScript coding you did was to put a scrolling marquee in the furniture of the Netscape browser back in the late '90s, then you'll be relieved to know that Google offer a second approach to their API. Without having to know anything about how the code works, you can build custom search pages and features for your hack.

Google Ajax Wizards

The tool takes the 'wizard' approach to building your code, but I can't guarantee it will be a more pleasant user experience than a Microsoft Office wizard.

Hang on....scratch that.

Of course I can guarantee that it will be a better user experience than a Microsoft Office wizard.

Google Custom Search Engine

Another tool from Google is their Custom Search Engine. Here you can build your engine using results from one site, a sub-section of a site, a group of sites, or even just a specific collection of web pages. It is how I built Chipwrapper. There are also additional tweaks you can apply to the results, and you can change the look and feel of the results to match your own site.

Google custom search engine

Yahoo! Search BOSS

Doctor Who nerds will recognise BOSS as the name of the evil supercomputer in the Jon Pertwee story officially known as "the one with the maggots". [1]

Jon Pertwee Boss

Now a rather less homicidal system called BOSS exists: Yahoo!'s Build your Own Search Service.

Yahoo! Search BOSS

This is your gateway into the Yahoo! Search engine, with access not just to their web index, but also their image search and a 30 day index of news stories.

Yahoo! SearchMonkey

SearchMonkey is a different kind of search API from Yahoo!. Instead of extracting information from the Yahoo! engine, you add value to the results by blending in your own structured data. This allows you to control the appearance and information presented in search results for your content.

Search Monkey Diagram

Yahoo! SearchMonkey Wizard

Like the Google AJAX API, Yahoo! offer a wizard-style approach to building SearchMonkey hacks. I haven't used it myself, but it has a great big cartoon of a monkey wearing glasses, so how bad can it be?

Search Monkey Platform

Yahoo! Term extraction engine

Another useful little tool from Yahoo! for hacks is the term extraction engine. You send it a chunk of text or an article, and in return you get an XML file which includes the key terms and concepts used in the text.

Yahoo Term Extractor

Open Calais

Open Calais is a similar tool to the Yahoo! term extraction engine, but with a more modern spin. Billed as part of 'Web 2.0, Web 3.0, the Semantic Web or the Giant Global Graph', Open Calais also takes text and returns to the user what it thinks the article is about. In this case it identifies entities like people, places, companies, geographies, and returns rich semantic mark-up indicating what it found, and how the entities relate to each other.

Open Calais

Happy hacking!



[1] The acronym stands for 'Biomorphic Organisational Systems Supervisor', appearing in the story "The Green Death" [Return to article]

2 comments so far

Hey, it doesn't matter what the API is like. Anything with a cartoon monkey wearing glasses wins in my book!

Many thanks for including Calais, Martin.

Come January, we'll be returning dereferancable URIs, connecting the the Linked Data cloud. See details at OpenCalais.com/blog.

-Krista from the Calais team

Leave your comment

A limited set of HTML tags are allowed in comments: a href, strong, em, ul, li, blockquote
To protect against spam your comments will not appear on the site until I have manually published them.
Please don't 'keyword stuff' your name to promote your site.
Your email address will never appear on the site.
To get a picture icon that will appear here, and on many other sites, please visit Gravatar

  

  

  


Alan Turing wouldn't be impressed with this crude test, but please prove you are a person and type toothpaste into the box below.

Search this site

Get free updates


Email icon   RSS icon

Sign up for free email updates
  

Posts of the moment


Michael Jackson headlines

"Michael Jackson's death spares the BBC"
What would have been on Friday's front pages.


Martin Belam presenting at the Social Media Summit

How major publishers are using social media to drive traffic
An article based on my presentation at May's International Social Media Summit.


London IA Mini

"London IA Mini II"
My write-up of the recent London IA Mini Conference at the Sense Loft in London.