google book search, info from the source

James Jacobs — the guy from diglet who had been writing to Google to try to get “find in a library” added to ALL Google Book Search results — went to see Daniel Clancy, the Engineering Director for the Google Book Search Project speak at Stanford. While the talk wasn’t to librarians and wasn’t really about the social implications of the book search, James did learn a few things.

– Clancy mentioned that Google was NOT going for archival quality (indeed COULD not) in their scans and were ok with skipped pages, missing content and less than perfect OCR — he mentioned that the OCR process AVERAGED one word error per page of every book scanned
– about 70% of the book project use was coming from India.
– 92% of the world’s books are not generating revenues for copyright holders or publishers

If Googl Book Search really interests you, you might also like to read The Google Library Project: Both Sides of the Story [pdf, today’s library link o’ the day] which discusses some of the misinformation and lawsuits surrounding the Google Library porject and comes down on the side of Google’s fair use position.

Google Books public domain book curiosities

Hey clue club, any Harvard or Boston area librarians want to solve the what the heck is this mystery alluded to on this blog post? It looks like a handwritten version of the poem printed in the book, but without page numbers or any other indication that it’s part of the book. Table of Contents is mum on what’s going on. Anyone know, or want to go check out the book at Harvard and see? [thanks chase]

a small foray into Google Books

You can use the date operator to browse public domain books in Google Books. I’m not entirely sure why the covers of some of these books remain under copyright. Any ideas? I’ve also noticed a few scanning errors and some pretty neat finds like this one which gives the name of every librarian in the US and Canada working in a library holding over 1,000 volumes. Google Books clearly uses keyword indexing to make these books searchable. How great would it be to have this one in a database? You can see a few images that I particularly liked over at Flickr.