Today I spent some time talking with Brad about various things, cleaning up the computer I've been using down in the Spec room (Systems apparently replaced the machine since I was here last - some shortcuts were missing, I couldn't find any documents (including my revised scanning procedure - yikes!) but found them eventually, and all of the browser bookmarks are gone!). So, it took me a bit of time to clean things up as best I could. I made sure all the shortcuts that used to be on the desktop were there, added some shortcuts to the Start menu, I set Gaylord's main page as the home page on each of the browsers, and I started adding some bookmarks to Firefox. I hope no one had anything important bookmarked on this machine - we might be able to get them back from SOS, but I'm not sure...
Connexion was okay except that the authorizations had disappeared so I'll have to bring over that info tomorrow and fix it (unless Mark needs to fix it before I get here). At least version 1.6 is still on it - hooray! Several machines mysteriously migrated back to 1.5 recently...
I also spent some time looking at the DLXS website for more information about BibClass. I found a document that is essentially a simple tree structure for the bib.dtd. I think this will be helpful as we figure out how things are going to be mapped/transformed. I found information about a script that was used to transform MARC records from NOTIS into bib.dtd, but I don't know how useful this would be since a lot of information would need to be changed (a lot of it was specific to NOTIS). It's a nice idea, though. If I knew more about programming, I'd write a script to transform MARC records from III into bib.dtd!
I explored a little bit about Unicode also. Apparently if the data does not come to you in Unicode UTF-8 encoded XML, conversion (one or two steps) will be necessary. I was wondering how diacritics would transfer. I'm not really sure what will be needed in this regard. I'm embarassed to say I'm not even sure what encoding our MARCXML document ended up with - that's something I'll need to look into. It's probably just ASCII...?
I also read some more about broker20, the program that produces XML responses to OAI verbs as dictated by version 2.0 of the OAI protocol. This step seems relatively simple, once everything is living happily in BibClass (knock on wood).
I'm having trouble with some of the BibClass syntax. For example, I'm not sure how "collections" and "groups" relate, except that as far as I can tell, one can have several collections in a group? A collection is essentially a bibliographic index that consists of all bibliographic fields in SGML or XML format, conforming to bib.dtd. I'm not sure I quite follow that, either, now that I think about it...
Reading more about the BibClass DTD it was reiterated that it is not intended to replace the robustness of MARC. It is simple and succinct, and it is a "tight" data model which means that it often constrains which fields may or must be used as well as what sorts of information can go into those fields (in stark contrast to Dublin Core, which is very flexible (fields are repeatable and optional)).
Another useful thing I found is an example of a UMR (Univ. of Michigan Reports, I believe) record in BibClass DTD. It shows the populated data fields as the user would see them and then also as the code source appears. I think this will be very helpful in trying to figure out what's going to go where.
I learned that the basic fields currently include: author, title, entire record, publisher, place of publication, year (of publication), series, notes, collection ID, format, type, language, ID (of the record, like the III .b number I would assume), and dt (OAI-specified date of last update for a record). Apparently it IS possible to define additional fields or substitute values for fields already specified, but one must create a NEW bib.map instead of revising the old one. I'm not entirely clear on what the bib.map is to begin with, but it's good to know that things aren't quite as hard-and-fast as we perhaps initially thought.
Hours today: 3 (4:15-7:15pm)
Hours this week: 3
Total hours completed: 83.5