BBC Complaints site in RSS

Martin Belam
Written by
Published 11 February, 2005
Categories: ,

<< previous | next >>
1 comment so far 
Add your comment Add your comment

So I've mentioned before that the BBC has launched a new Complaints site. From my point of view the most important thing over the next couple of months is to monitor how much mail the BBC receives, and how much publishing is done on the site, which is acting as a central hub for official responses. Both of those will give me a good idea of how likely the site is to impact on the resources I have available to work on it, and, to be honest, I'm very interested in how the BBC reacts as a corporate entity to having this new channel to communicate with the audience.

Screen grabs from the BBC Complaints site

Being a busy and impatient person I really didn't want to have to check the site every day to see if a new response has been published. At the moment the BBC does not habitually publish on the live site RSS feeds of content, even when they have been generated by a mini-CPS that could do such a thing. So I've written my own script to scrape the site and generate an RSS feed of the BBC's responses to complaints.

Screen grabs of BBC responses to complaints published on bbc.co.uk

The script works by scraping the "Most Recent Responses" section from the Read Our Responses page, then running off to fetch the opening couple of paragraphs of each response to act as the <description> element in the RSS feed. So I can now pop it in as a subscription within my bloglines account, and keep up-to-date on it for work purposes without having to visit the site.

Screen grab of the RSS feed from the BBC Complaints site as displayed in bloglines

Of course, being based on screen-scraping the HTML, it is no doubt going to be very brittle, and a case of break followed by iterate followed by break followed by iterate. I suspect it won't validate as proper RSS 0.91 either, as I can't control which HTML tags might end up included in the <description>. And I so far haven't bothered to extract the date-stamp of the actual posts.

But, if these guys can parse something as difficult as Hansard, surely even my tiny grasp of perl should help enable me to glean the information I need to do my job :-)

1 comment so far

What a disaster giving the BBC monopoly coverage of the Olympics, if medals were awarded for chat and spin plus useless interviews then the BBC would top the table.
They used to be the greatest at cover but its all slipped with continual studio guests and less action than ever.
Could we hope for a return to previous cover with professionals fronting events and the focus being on the competitor rather than the presenter.
Very disappointed sports fan.

Leave your comment


Alan Turing wouldn't be impressed with this crude test,
but please prove you are a person and type toothpaste into this box:
  

A limited set of HTML tags are allowed in comments: a href, strong, em, ul, li, blockquote
To protect against spam your comments will not appear on the site until I have manually published them.
* Your email address will never appear on the site.

Search


Search powered by Google

Subscribe

Subscribe via email or RSS RSS icon
Get updates to currybetdotnet sent to you via email

About Martin Belam

I'm an Internet consultant and writer, with 8 years experience in product management, information architecture, and user experience design for global brands like Sony, Vodafone, The Guardian and the BBC. I specialise in advising on search, widgets, RSS, online news publishing and bulk email delivery.
Martin Belam CV
email: martin.belam@currybet.net
tel: +44 (0) 7801 828718
About Martin Belam and this site

Popular categories

BBC, Doctor Who, Ghost Walks, Media, Music, Newspapers, Search, Web

See all Categories