Today I spent some more time looking at scanning standards and experimenting a bit with different resolutions. Here is a synopsis of what I discovered about other institutions' (a select number and with varying degrees of specificity!) sheet music digitization standards:
1) Duke (Historic American Sheet Music) - all scanned at 150 dpi with 75 dpi access images and thumbnails produced later by an automated process
2) Detroit Public (E. Azalia Hackley) - 300 ppi (ppi is more accurate than dpi, since we're really talking about pixels per inch on a monitor instead of dots per inch when printing, but generally people just use dpi anyway), 24-bit RGB color; access image 72 dpi, 600 pixel width; thumbnail with standard height of 150 pixels generated using
IrfanView (I'd like to look at IrfanView further at some point, just to see what it can do)
3) 19th Century California Sheet Music - 400 dpi
4) Brown African American Sheet Music - 300 dpi master; high-res download (JPEG, 1650 p tall); printing (JPEG, 860 p tall); page-turner (JPEG, 400 p tall).
Brown claims that 300 dpi (as opposed to 150 dpi) made a significant difference in detail and color fidelity, particularly in the printing version.5) LC Music for the Nation - bitonal TIFF first at 300 dpi and then later switched to 400 with Group IV compression; 200 dpi grayscale - "tonal capture was able to suppress some of the print-through" (print-through was one problem I was coming across in scanning the pieces from the bound volume, but I had trouble finding more information about this "tonal capture")
6) Sheet Music from Canada's Past - 300 dpi, 24-bit RGB for color covers; 600 dpi for Line Art (all pages, including cover, scanned as bitonal images) - this one didn't really make sense to me, need to find more info about bitonal images
7) Indiana University Sheet Music Collections - 300 dpi, 24-bit RGB; for interior pages and covers printed in b&w - 8-bit grayscale; IN Harmony: Sheet music from Indiana collection scanned at 400 ppi
8) UC Boulder - 400 dpi, copied as JPEG for web site, music converted to grayscale for web (better image and print copy)
So, the general consensus seems to be that 300 dpi is sufficient for sheet music purposes. In my experimentation, though, I failed to see a huge difference that would make switching to 300 dpi worthwhile. Though I did not experiment with printing the PDF images (I didn't want to waste too much ink!) - doing so might change my mind, but I still can't image that the difference would be THAT remarkable. I did see a big enough difference, with the particular things I'm scanning, between black & white and grayscale that I highly recommend using grayscale for older, discolored items that have some bleed-through. The black and white scans would certainly print better (more quickly and using less ink), but I think the quality was really not acceptable in this case. I saved examples of each type that I can show Brad and we can compare. Again, this is just for one type of sheet music - I don't know that I'll have time to experiment with any other things (like color covers, etc., but I believe the procedures for that already call for scanning at 300 dpi, so I expect that that is quite sufficient). Brad had mentioned that if at all possible he prefers actual music to be scanned in black and white since it will print better, which I agree with. The problems come when you have older paper with lighter ink, bleed-through and such.
I looked again at Howard Besser's (Getty)
Introduction to imaging - he talks a lot about the need to find the right balance between getting a high quality image and the file size. The better the quality, the bigger the file size and the harder it is to manage (store, load, display, print).
Though of course one usually creates a large, high quality, uncompressed master file (usually a tiff file) and then one can create smaller, more manageable access files from that master. One can create access files for viewing, for download, for printing, thumbnails, etc. He goes into great detail about image reproduction and explains resolution and compression quite clearly. The details and issues surrounding color management are very complex.
I like this quote in the section on Image capture:
"Digitizing to the highest possible level of quality practical within the given constraints and priorities is the best method of "future-proofing" images to the furthest extent possible against advances in imaging and delivery technology. Ideally, scanning parameters should be "use-neutral," meaning that master files are created of sufficiently high quality to be used for all potential future purposes. When the image is drawn from the archive to be used for a particular application, it is copied and then optimized for that use (by being compressed and cropped for Web presentation, for instance). Such an approach minimizes the number of times that source material is subjected to the laborious and possibly damaging scanning process, and should emerge in the long term as the most cost-effective and conservation-friendly methodology. "
I also did a bit of cataloging today. Note to myself to remember to pick up aforementioned book at WC tomorrow.
Hours today: 2.5 (4:15-6:45pm)
Hours this week: 2.5
Total hours completed: 65
(I need to get in about 6 hours per week in order to finish by the end of November (shouldn't be a problem), so I'm starting to keep track of hours per week as well as per day and total)