« Contest: $1,000 Reward for Ellis Island's Little Orphan Annie | Main | U.S. Archives Reopens After Flooding »

July 17, 2006

Are Digital Genealogy Libraries Going to Replace Traditional Books?

No_books Is it time to stop the presses?

It seems that every week I report in this newsletter about more and more genealogy books that are being converted to electronic format. Sure, old books have been digitized for several years now. However, even new books are now appearing as electronic publications.

One example is the 5th Edition of The Genealogist's Address Book, by Elizabeth Petty Bentley, published by Genealogical Publishing Company. It is available on a CD-ROM disk or as a traditional (paper) book. The first four editions of The Genealogist's Address Book were printed on paper, but the economics caught up with reference books. Each new edition costs more and more to print. As prices escalated, sales decreased. Many people could not afford the higher prices. The new 800+ page 5th Edition now costs $49.99 for the paper version, but the CD-ROM version costs only $19.99. The CD-ROM version reportedly has sold more copies than has the paper version.

This is only one such example; there are many more. Is this an indication of the end of book publishing as we know it? Will simple economics drive printed books out of existence?

Many bibliophiles cringed when the Internet search engine Google announced plans to digitize the book collections of five major libraries. To be sure, there isn't as much personal "touch and feel" with a plastic disk or an online web site. I have read many comments about this, such as, "no one will ever want to read an entire novel on their computer screen," or, "online books will succeed only when every bathroom has a high-speed Internet connection!" I recently read another statement from a librarian: "There's just a coziness with a book. The smell. Can you smell a laptop?"

I believe that librarian's view is a bit too simplistic. Very few people would suggest that all books should be printed forever on paper.

For the rest of this article, let's divide the topic of books into two major categories: (1.) books that are meant to be read from cover to cover (such as a novel) and (2.) reference books that typically are only read in small segments at a time (such as an encyclopedia).

Novels and other books that are meant to be read from cover to cover probably will never become popular on today's computers. The glare from the screen is enough to dissuade a reader. The bulky, electronic nature of a computer discourages people from reading novels and other books meant to hold your attention from cover to cover. My guess is that most readers will continue to pay a premium price to read a printed novel in place of an electronic one.

Sharppaper All this will obviously change as computers improve. The computers and electronic "book readers" ten or twenty years from now probably will be wafer-thin, flexible screens the size of a piece of paper that you can roll up and stuff into a pocket or purse. They will produce no more glare than a piece of paper, perhaps even less. They will be easier to read than paper. They will operate on batteries that last for twenty, fifty, or even more hours before needing to be recharged. Today's "book readers" are already about the size of a paperback novel and weigh less than one pound. As technology continues to improve, they will become even smaller and lighter. Until that day arrives, however, nobody will want to read "War and Peace" on a computer screen while sunbathing at the beach.

Reference books are an entirely different matter. Encyclopedias, dictionaries, operators' manuals, and other reference materials are generally read only a few pages at a time. Such reference material seems to be much better suited for online or CD-ROM distribution. The bulk of a computer and the screen glare do not seem like major issues when reading only a few pages. Indeed, online encyclopedias such as Wikipedia and Encarta have seen skyrocketing success even as printed reference books (Encyclopedia Britannica) produce reduced sales figures every year.

Think of all the genealogy books you have consulted. Aren't most of them reference books? Didn't you only consult a page or two, or maybe five or ten pages? How many genealogy books have you read from cover to cover? I bet it is very few. The Genealogists' Address Book is an excellent example: it is a reference book, and nobody will ever be spellbound by it as they read it from cover to cover.

The conversion of genealogy books to digital formats would seem to make sense, even when "War and Peace," "Gone with the Wind" or "The Da Vinci Code" probably should remain only on paper.

Google has quickly become the dark horse in this topic. No one knows exactly how Google will handle the paper-to-electronic transition. However, the company will scan at least 15 million public domain titles from Stanford, Harvard, Oxford, the University of Michigan, and the New York Public Library. The project is already well underway but could take years to complete and may cost $10 a book. We can assume that the number of genealogy books scanned in this process will be less than one percent of the total. However, even one percent of 15 million is still a lot of books!

In 1450, technologies developed over the previous thousand years were combined to produce a revolutionary new process of printing. By integrating paper invented by Chinese, movable type first tested by Koreans, and oil-based ink developed by Italian painters, Gutenberg's printing press supplanted laborious hand copying with mass production. A similar revolutionary change is in the works: the digital revolution.

Computer-aided desktop publishing and digitization processes have finally replaced paper and printing press, making computers and the Internet the modus operandi for information exchange. However, neither the printing press nor any computer technology by itself causes revolutionary changes. But the processes enabled by new technologies is already changing the way we use information and will eventually make printed books obsolete.

Both the printing press and the digitization have similar effects on the economics of information: reducing production costs and making knowledge (and entertainment) more accessible. It is a small step to convert encyclopedias, novels, and magazines into digital format.

The Gutenberg revolution brought more than an improvement in printing processes. Lowering costs and the increased availability of books contributed to rising literacy, civic and political participation, and dissemination of news and ideas. The printing technology made possible daily newspapers and magazines, and they, in turn, opened new business opportunities and processes of information exchange and consumption. In short, changes in production technology improved the living conditions of all mankind. I believe we will see a repeat of that improvement as the newest technologies will again improve literacy, productivity, and worldwide living conditions.

I suspect that economics will drive the entire topic of reference books. Paper and printing prices continue to rise year after year. Prices of CD-ROM disks and online access charges continue to drop year after year. At some point, the two lines cross: it becomes more cost effective to publish electronically than on paper. In fact, I think those two lines crossed years ago. For a number of years, electronic publishing has been more cost-effective than printed books.

I suspect that many more genealogy publishers will soon follow the Genealogical Publishing Company's lead. It simply makes sense to sell a book for $19.99 instead of $49.99. Electronic publishing allows this. Likewise, old out-of-print books can be republished electronically at far better prices than doing the same in print. Archive CD Books USA and other companies have created thriving business by making out-of-print books available to genealogists in electronic format at reasonable prices.

Will printed books disappear? Certainly not. I suspect that non-reference materials will be around in print for many more years. However, genealogy and other reference books will be digitized whenever possible. Within the next decade, I suspect that almost all genealogy publishers will convert to electronic publication, whether it is on CD-ROM disks or on an Internet file server somewhere. In fact, the costs of online publishing and online wireless access are dropping so fast that I believe CD-ROM disks will be obsolete within a decade.

Today's information usage has shown why computer-assisted consumption is superior. Reference material - dictionaries, encyclopedia, directories, databases, and other collections - have proliferated simply because online access offers better methods of searching, indexing, clipping, and cross-referencing, a lesson well learned by Encyclopedia Britannica. The benefits afforded to consumers plainly favor reference online web sites and even CDs over printed books as a form of information usage.

A related topic is the role of librarians. No matter what the future distribution method is, librarians will still be needed to sort the wheat from the chaff. A good librarian is essential when trying to find a particular source of information. In fact, as more and more material becomes available electronically, the role of the librarian will increase, not decrease. All the Google programmers combined will never be able to replace a good reference librarian.

Genealogy reference librarians are valuable, and we have far too few of them. Those who are available have always been limited to helping only a few genealogists: those who walk in the front door of the library. Even worse, most expert genealogy librarians spend far too much time doing unrelated tasks, such as clearing paper jams in the photocopier or directing patrons to the nearest restroom. Wouldn't it be better to provide access to these experts in an online environment? Instead of having each person respond only to those who walk in the door, how about making it easy for the experts to help people across the country? How about across the world? Access to the librarian might be by e-mail, by keyboard instant messaging, or by some form of voice, such as a telephone. Indeed, if we could make hundreds of genealogy expert librarians available at once, each online "patron" could be electronically routed to the single librarian who is most expert in the particular geographic area and timeframe of interest.

The researcher in Oregon who is researching Vermont ancestry in the early 1800s could ask questions of the Vermont expert reference librarian. That expert, by the way, might live in Vermont or in Oklahoma or in some other distant location. The expert might be sitting in a physical library at the time, or they could be at home. Due to family commitments, he or she might be restricted to only working evenings and weekends or perhaps "mothers' hours." He or she might even be physically handicapped. Physical condition is not important in a virtual library; knowledge and the ability to work with people are the major requirements.

As we move further and further into the brave new world of online information anywhere and everywhere, "information workers" are becoming more valuable than ever. (Information workers are those who utilize large amounts of information in their jobs in order to better enable the worker to make decisions or to share expertise.) Librarians, archivists, and other "information workers" have always been valuable to the average genealogist. I believe they will be equally valuable, or even more so, as we move on to high tech publishing with hundreds of thousands of genealogy books available at our fingertips. In fact, as the volume of available material increases, the need for expert librarians (information workers) will grow, not decrease.

Libraries will undergo radical changes in the next decade or two. Most libraries and librarians will survive and grow; a few will not. Indeed, some libraries may cease to exist as physical buildings. The genealogy reference librarians who are best able to adapt will find their services to be in greater demand than ever before. Library patrons and all other consumers certainly will benefit from increased access to knowledge at reduced prices. 

These are exciting times in which we live.

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Great editorial! In the 60s and every decade since, we've been told that "soon" machines will be able to produce quality translations. That's still nonsense. Since at least the 80s we've been told that offices soon would be paperless. Nonsense. Then both schools and libraries were to be paperless/bookless. They aren't.

You mention production costs but not postage/distribution (perhaps meant to be included). A CD is cheaper to send than a book, a download even cheaper. You mention searching and cross-referencing but not updating which is at least as great an advantage for reference works.

As usual, your timing is very good. Your prophecies are right on. And neither librarians nor teachers will be out of work.

I'm sure that I'm not alone in wanting to cozy up with a good history of the areas that I'm researching in the quest for ancestors. However,I,like many others, have been burned too many times by purchasing books that one "hopes" will contain information about the surname you are seeking only to find that there is no mention. Now you are stuck with a useless book that costs anywhere from $25.00 to maybe $60 or $70. Most publishers do not give you an index on-line so you are shooting in the dark. I now have many hundreds of dollars invested in books that I cannot use. I have sworn off anymore blind purchases of either books or CD's. Until publishers start giving you the benefit of a searchable index I believe that they will lose customers who have been repeatedly disappointed.
Carly Henderson

"All the Google programmers combined will never be able to replace a good reference librarian."

Amen to that!

Keyword searching is a powerful, useful tool -- but you have to know where to search before you can take advantage of it. And unfortunately it seems patrons are less likely to ask for help in the electronic environment. Archivists and librarians are working out how we can best help patrons who are offsite. So please don't hesitate to use the "Live Help" buttons if and when you find them.

The LDS Church is working towards having that "Online Genealogy Reference Librarian". It will be part of the Research Forums on "new FamilySearch". Sure are exciting times ahead.

Electronic books are more difficult to highlight, correct, make notations and bookmark to read later. I have a few reference books on my pda but still prefer to use a printed book. I also find it easier to locate what I want in a printed book. And it is easier to pick up a pencil and make notations than to electronic highlight the passage. I think electronic has a way to go before they will replace printed.

If you want a book that will last you for years and that your kids will be able to use when they get older - printed is the way to go. So far, it's the only thing that will be useable in the future. Electronic is okay for books that get updated frequently like encyclopedias.

Dick,

One huge advantage of digitizing that I feel you missed in your article is OCR (Optical Character Recognition) software. Just as you state about other technologies, OCR likewise continues to improve in quality and decrease in price at a rapid rate.

For those that are unfamiliar with this technology, OCR effectively recognizes numbers and letters while scanning. The scanned documents can then be searched just like the internet for surnames, place names, dates, etc. I specifically reference EBSCO's Newspaper Archive Elite and HeritageQuest's PERSI databases as excellent examples of using OCR in conjuction with digitized media. Additionally, Dick has previously mentioned the Google Book Search website http://books.google.com and does so again in this article. That site is another fine example of using OCR to produce digitized images that are also searchable.

> I also find it easier to locate what I want in a printed book.

I completely disagree. In PDFs, the text is searchable and you can highly customize bookmarks. Additionally, a high resolution PDF allows one to zoom in on a document, such as a gazateer, to a level that would rarely be possible with the original.

We are getting used to seeing records added to the web and are expecting the web to be even better in the future.

With a book, the information stays on the shelf from year to year or may be accessible in some form in different locations.

The thing that scares me about our reliance on databases and websites is the future.....
Cities, states, businesses and people are constantly having budgetary problems and change is normal. Profit is the medium businesses operate on. What happens if a business goes out of business or is sold to another company (a la Everton Publishers) that does not have the same vision as the first?

At the turn of the Millenium things were really looking up for genealogists. All kinds of records were being posted online. Birth, marriage and death databases were being added frequently. Then someone yelled PRIVACY- Poof!, there went those databases. Now states are trying to prevent even the peope whose record it is from accessing them.

While the present and current future looks great I fear a good(?) ecconomic depression or privacy scare could wipe all that out. While databases are fabulous, I feel safer if I know the same data is also available in a paper format on an accesible bookshelf.

And to think, I consider myself an optimist?
MIC

Even people like Winston Smith who are employed maintaining databases understand that unless a record is fixed, and not subject to continual tinkering, there can be no real history.

I much prefer books. I love to use them and prefer them over PDF versions of books. But it's awful nice to have books, like those from www.archivecdbooks.com on CD, with me when I am in a library doing research. So, I guess I'd take the best of both worlds.

But nothing, absolutely nothing, beats the books ability to survive 500 years of changes in technology and still be plucked from a shelf in read. We can't do that with compuer documents that are only 25 years old.

To chime in regarding OCR - I have seen far too many OCR projects where the type was too faded or ornate for the software to decipher and so it guessed! Strange results sometimes. OCR has a ways to go before it is the product it is touted to be. As a genealogy librarian I love both formats. If I had to choose however, I would still choose print. The format is still the most stable and reliable. Electonic materials are only available when you have a power source.

As to OCR: Please keep in mind that most of this entire article talks about NEW books. OCR is not an issue with new books.

Older books bring an entirely different set of circumstances to deal with.

- Dick Eastman

Although old books may be a different topic, they can still be helpful. As Heather said there are significant problems with OCR'ing old books. One plus is that OCR'd text is searchable. The main problem is that the OCR process is, at BEST, only 99% accurate. The Gutenberg Project (www.gutenberg.org) posts out-of-copyright books (pre-1923). In order to increase the accuracy, volunteer proofreaders triple check the copy so that the text matches the original. Books from many places, including some from the French Biblioteque Nationale and Google have been OCR'd, proofread and posted. I personally have proofread a couple of books in French about French postal regulations which were probably requested by someone interested in philately. If someone knows of an old, pre-1923 text that has been scanned but needs to be OCR'd, or even one that hasn't yet been scanned, I can try to find a project manager so we can get it on line for all of us to use.
Vic

I still much prefer to reach for a book in my personal library, over reading the same thing on the monitor. Thanks, but I'll stick with Gutenberg's printing press for anything I want to have true archival quality!

Dick,

Slightly off topic, but, just out of curiosity, what are "mothers' hours"?

On topic -- As for books vs. digital text, i subscribed to Britannica online before Wikipedia was started. Digitizing is definitely the way to go for current reference books. Budgetary problems are a matter of prioritizing governmental expenditures. Some jurisdictions are much better at it: Virginia's Prince William County vs. Rappahannock County come to mind instantly. The former has all kinds of information online and available by ILL. The latter has done zilch.

Mother of 3

Are you aware of the new Sony Reader? It overcomes many of the objections mentioned above and makes eBooks much more desirable. I know of at least one other similar product due to be on the market this summer. I'm sure the price will come down quickly. Take a look at http://www.learningcenter.sony.us/assets/pa/prs/index.html.

Post a comment

Receive FREE daily newsletter updates by email

  • Enter your email address


    Preview

    (You can unsubscribe at any time. We will not send you any other e-mail messages. NO SPAM!)
My Photo

Search This Site for Past Articles

Meet Dick Eastman in Person

  • Jan. 16 to 20, 2009 - Australasian Federation of Family History Organisations Congress - Auckland, New Zealand

    Feb. 21, 2009 - Tallahassee Genealogical Society Annual Spring Seminar - Tallahassee, Florida

    Feb. 27 to March 1, 2009 - Who Do You Think You Are? LIVE - London, England

    April 22, 2009 - New England Regional Genealogical Conference - Manchester, NH

    May 13 to 16, 2009 - NGS Conference in the States - Raleigh, NC

January 2009

Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31

Amazon Kindle

Offers

Blog powered by TypePad

Amazon Picks

Receive daily newsletter updates by email

  • Enter your Email


    Preview

    (You can unsubscribe at any time.)