code4lib Day 1 morning talks
Feb 27th, 2008 by Karen
Brewster Kahle’s Keynote
How to Build a great digital library together!
We’re paid to give stuff away
Spend our money better!
Time to work together even though we are bulkanized into our own organizations
What is the Internet Archive is trying to do to work towards this?
- Works with libraries
- Lots of copies keep stuff safe
- Mirrors in other places
- Collecting the World Wide Web
- centralize an archive of the web
- Wayback Machine
- 500 hits / second
- Audio
- only 2-3 million volumes of music out there
- lots of niches not served well by the classic publishing industry
- Tradition of tape trading of rock concerts
- Talk to community
- got okay from bands
- community created metadata
- Digitize disks - Open Audio
- Play the role of dumb back-end storage
- Moving Images
- 150,000 - 200,000 feature films
- most are under copyright
- Popular - movies that people couldn’t get any other way
- Government films
- Training films
- Educational films
- Public domain collection
- Put up people’s personal collections
- homemade movies
- support communities
- Television
- what did the world see?
- News comes with a point of view
- record the stuff
- what did the world see?
- Software
The idea of providing universal access to everything ever published is within our grasp
Texts
Journals
- We’ve given it to 2-3 companies and now we have to rent it back!
Books
- 26 million books in LOC - 26 tetrabytes of storage
- One webpage for every book ever published
- scan books
- your own books!
- robots don’t work well enough. People are better
- catalog data
- mass digitization within the public sphere
- scan books
- Need to fight for the right to give stuff away!!
- Support the scanning centers!!
- Scan on Demand
- materials out of copyright
- materials out of print
- Need Open OCR
- need support for lots of languages
- cultural diversity - cultures dying
- need support for lots of languages
- Scan all microfilm
- loan scanner
- library has to pay for the labor only
- Need better selection
- critical mass for books in each subject area
- communities that care
- Libraries see there responsibility only towards digitized materials that they own physically
- Building a catalog
- building library and non-library records
- FRBR is a must!!
- Open Library Project
- Talis is giving their records!!
- Displays
- open APIs
- Great search
- Many views
- Wiki++
- Printing out books again
- Bookmobile
- Study at Harvard can be cheaper to give a book away than loan a book
- Print on demand kiosks
- Library in a mall slot
- One laptop per child as an open e-book reader
How is this going to happen?
- Public or private?
- Open or Proprietary?
- Negotiate contracts for content solely or providing actual services and content
Fundemental Question
What services will library perform and which will patrons get from companies or public institutions that act like companies?
Open Content Alliance
- Build joint repositories of digital materials
Working together we can build a great library
Contribute
- time/code
- digital materials
- catalog records
- selction help
- labor to digitize
- Links to the Open Library
Access drives preservation - Keep things in the flow
Finding Relationships in MARC - Rob Styles from Talis
Rob Styles gave a really cool presentation on mapping and showing relationships in MARC data using RDF. My notes just don’t do it justice because I’m lacking the really excellent graphics he used for explanations. So check out the video on the code4lib site when it goes up later.
MARC data and Authority Data is used to create RDF graph
RDF
- Triples
- something -> title -> Harry Potter and the Prisoner of Azkaban
- subject -> predicate -> object
Relationships built out of the MARC are very complex
Generate URIs from data in MARC record
Makes it easier to merge datasets
Apply this to the authority data as well
- Built relationships to authorized form
What this allows you to do
- Build a Relation Browser
- Visualization of the data
- Get the semantics of the data
There were four other morning sessions that I missed significant pieces of because of dealing with some conference logistics and personal stuff. Luckily all the presentation are being videoed (YAY Noel!) so I can them out later. Maybe on the plane ride home if I’m lucky.
I was most interested in the presentations Delivering Library Services in the Web 2.0 environment: OSU Libraries Publishing System for and by Librarians - Kim Griggs, Margaret, Mellinger, and Jane Nichols and Working with the WorldCat API - David Walker. The tool created by the folks at Oregon State to help librarians publish course web pages is really neat. It has a lot in common with our home grown CMS and I want to download it and check it out in more detail later. I also am interested in spending some time with the WorldCat API because I think it may be useful to integrate this into our CMS and blogs. So I’ll have to watch David Walker’s presentation too!

