I prepared this forecast for the winter issue of Slice, the newsletter of Potomac Indexing with a view to sharing it with the readers of ForeWord Magazine’’’s blog site. It was of interest to me that one way of looking at the cultural transformation of our industry, is the role of indexing, classifying, and keywording in the Web 2.0 world of today.
As we begin a new year, and the dust has settled on America’’’s watershed election, those of us who have been around awhile can look back with wonder at the remarkable transformations that have taken place in our society as a whole as well as in the publishing industry. We can also look forward with curiosity, if not some concern, at what the future portends. What is the state of our industry, and where is it going?
For indexers, the past, present and future of publishing as an information art and science should hold some comfort and predictability. It could not have escaped your attention that as digital search techniques and venues have expanded, the fundamentals of indexing have increasingly become the touchstones to information access and transfer. Knowing about keywords is not enough – one needs to intuit which words people will select in their searches, and how to incorporate the most effective selection in titles, captions, tables and text. One needs an organizing taxonomy to aggregate content for specific purposes and contexts, to provide a checklist of what has been searched and what has not been searched.
Recently, Google announced its new mobile device’’’s Barcode Scanner application that would enable someone to scan the barcode on a book and call up the work on Google’’’s Book Search, where the full power of keyword search would be available. ““Why would someone want to do that if they are holding the book in hand and can use the index to search the book?”” some have asked. Well, not all books have indexes, and no index, however complete, can reference every keyword on every page. And, ““find”” on the computer is a faster trip to a reference than is flipping to the page and looking for the right sentence in which the reference is lodged.
The concept of ““the wisdom of crowds”” is a major recent insight into mining true contextual keywording by drawing on the actual phrases people use when searching for and identifying information and concepts. Wikipedia is founded on this premise – that with enough people commenting on and contributing to a citation, with accountable mediation, the closest thing to an accurate definition can be arrived at. The now-common practice of blogs and other social networking sites to aggregate the keyword searches that visitors have used for an article, and to rank them by type size so that the most frequently used tags are highlighted, provides indexing cues as to how the ““crowd”” thinks of accessing a subject or theme.
So, what does this mean to the book industry? All of its new directions are based on technologies that are driven by algorithms that can distinguish the structure as well as the definitions of content based on these intelligent search insights. These insights are what make possible all of the developments I will be discussing.
The book, whether in its printed or electronic form, endures as a container that captures narrative, imagination, instruction, compilation – sacred or profane – in words or in pictures and even with embedded sound and animation. So, it would be fair to say in any review of book publishing industry trends that while its content formats and delivery systems are being radically transformed by disruptive technologies, we should not be thrown off balance. The fundamental editorial purpose of the book remains, whatever its form – and authors, artists, editors and publishers (or whatever they might be called – a rose is a rose) will continue to provide and shape that which everyone is busily indexing and keywording in this new age of search. The problem is how to make a living at it.
By the numbers
The Book Industry Study Group describes a $37.3 billion industry in 2007, which moved 3.13 billion net units (books) into the marketplace. Bowker reported more than 400,000 new ISBNs issued to over 80,000 self-identified publishers for the year – clearly a reflection of the explosive growth of independent, self-issued and on-demand publishing in the last five years. Ten years ago, new title output was around 60,000.
Growth in sales volume over the next five years is estimated at an average of around 2.5-3 percent per year – but unit sales will remain practically flat at 0.3 percent per year growth. This data only marginally reflects electronic publishing revenues – especially in the reference market – and only portions of the more than $1 billion audio book market. This year, the BISG has embarked on a major revision of its data reporting to take into account the new media and distribution channels that have emerged in the past ten years and whose dynamics are not measured by traditional unit print sales metrics.
While conventional publishing is projected to remain static or decline as a proportion of our population and gross national product growth each year, the internal dynamics of the industry are bubbling with possibility and invention – and if one looks at cyberspace, the Internet and Web 2.0 social networking as the new marketplace for ideas, information and stories, there are horizons out there yet to be measured.
So, here are ten of the most significant events and trends on the horizon:
1) Self-publishing and print on demand (POD) services, made possible by file-transfer technologies and such online publishers as Lulu, Blurb, Author House, iUniverse and xLibris, and POD services such as Lightning Source and Book Surge.
2) Reinvention of the bookstore through on-site book production and sales in book stores and libraries – one such mechanism is the Expresso Machine, now being installed in bookstores, libraries and airports, which enables paperback titles to be printed to order in minutes. The ABA has recently launched a program under which a bookseller can publish and sell on demand classic, out-of-print books.
3) The Book Rights Center, an ASCAP-like agency resulting from the landmark Google Book Search settlement, to be operated by the AAP and the Authors Guild as a clearing house for payments to publishers and authors by Google and others who are providing search access to copyrighted books. This agreement has also set the pattern for electronic rights business models.
4) Book Industry Study Group’’’s active role in developing and facilitating adoption of data management and transmittal protocols. Examples are the new Start with XML initiative to redefine publishing work-flow standards from a print-based to a digitally based platform, and its new Product Data and Product Label certification program for qualifying publishers.
5) Integration of conventional and electronic distribution of inventoried books with print on demand and electronic format services. The latest such initiative is by Perseus, the largest U.S. independent distributor to provide a digital asset management and POD distribution service. The University of Chicago has operated Bibliovault, a similar service for university presses, for years.
6) Cloud computing, in which data (in the case of publishing, content files) and software applications are stored and provided as an outsourced service through global data storage servers operated by aggregators such as Google, Amazon, and Microsoft, as well as by smaller targeted outsourcers.
7) E-book readers, such as Amazon’’’s Kindle, whose e-ink technology and wireless downloads have advanced from a limited special interest device to a more popular mobile device. Sony and other European firms are building markets for dedicated e-book readers as well.
8) Website widgets, enabling owners of content to plant a window with access to their intellectual property that they can control, on any web site that will accept it.
9) Web 2.0 and social networking tools that create content through feedback to authors and publishers, by anthologizing, customizing, and mashups.
10) Simultaneous multimedia publication of books in print formats, e-book, and audio form, with free online content (chapters or whole books) as promotional tools for sale of print products.
The aforementioned trends have yet to make a significant dent in anyone’’’s revenue or profit margin base. According to industry watchers and early adopters, the future revenue-generating power of these innovations in lies in the hands of the younger Internet generation for whom all forms of information access are equal as long as they are immediately at hand.
Of this I am certain: there is an exciting and creative future ahead.
Posted by: Eugene Schwartz, Editor-at-Large