How many British newspapers use sitemap.xml to help search engine indexing?
" 'a newspaper site search ought to be able to index content directly from a CMS faster than Google can crawl a site' - Perhaps, but this seems kind of irrelevant. Any well built site will be creating a sitemap.xml and pinging Google whenever any updates are made so that it's indexes will be updated immediately... "
For those unfamiliar with the protocol, a Sitemap in this sense is an XML file that should list the URLs of all of the pages a site wants to see included in search engine indexes, with additional information about how regularly they are updated, and how important they are relative to each other. Unlike ACAP, the protocol is actively supported by Google, Microsoft and Yahoo!
Now, I'm guessing that Bruce is not too familiar with newspaper and news organisation content management systems, if he expects that over the last decade or so they have all been built to generate well-formed XML sitemaps and automatically ping Google. Of course, it begged the question, how many British newspapers do have a Sitemap file?
The other newspaper to implement the protocol is The Scotsman, which always seems to have a very forward thinking website. Rather than split their sitemap.xml file up as the Mail does, they've chosen to include the URLs in a single file at the root of the domain.
From the client-side, though, it doesn't appear that any of the other papers are currently taking advantage of the technology.