Digital Document Quarterly

Perspectives on Trustworthy Information

Volume 4, Number 2, 2Q2004

 

 

DDQ Home

Citations

Glossary

 

HMG Consulting

Saratoga, CA 95070

©  2005, H.M. Gladney

 

ISSN: 1547-8610

 

Digital Preservation

Library Institutions vs. Preservation Technology?

“[T]echnology is rather easy.  Or more exactly, technology is the branch of human experience that people can learn with predictable results.  … a good many Englishmen have been skilled in mechanical crafts for half-a-dozen generations.  Some­how we've made ourselves believe that the whole of technology was a more or less incommunicable art.  It's true enough, we start with a certain advantage.  Not so much because of tradition, I think, as because all our children play with mechanical toys.  They are picking up pieces of applied science before they can read.”                                                                 C.P. Snow, 1959 [1]

The final sentence of a Library of Congress 2004 press release—“Librarian protects digital artifacts”—startled me: “As [Laura] Campbell often reminds herself and others, it is not technology that preserves important cultural and historical works. ‘Institutions preserve,’ she said.”

Institutions without technology?  And what about people?  Why not write, “Technology preserves” or “Money preserves” (referring to the $100M expenditure that Campbell is managing for the Library of Congress, with little evidence that her staff has examined what preservation technology offers)?

The rhetoric of “Institutions preserve” invites consideration of whether institutions like the big research libraries are, in fact, essential for digital preservation.[2] 

Consider the issues.  Who should select?  Who can package for preservation?  How will people find information?  What people want to preserve will include many more objects than traditional libraries can handle, and include works outside traditional library scopes.  Self-achiving  has already begun on a significant scale.[3]  Tools to make packaging and durable descriptions[4] easy and convenient are likely to be much improved from what is already available.[5]  As with other information services, skills previously found only among trained librarians will be acquired by many computer users.[6]  An illustration of what is happening is provided by:

“A common set of file formats has the potential to be the most meaningful advancement for free software on the desktop.  

Desktop integration begins with documents, not with any toolkit or bundle of applications.  If files can be read and written by every application, users can communicate, work together and become integrated.    The significance of [the OASIS effort] to promote a file format … cannot be [over]estimated.  Free as in free formats is even more important than free software.  Only with them and the internal structuring that comes from XML can data be exchanged, with new or different programs without any need for converters, or be directly edited, indexed, analyzed and exchanged between heterogeneous groups or servers—like Web services without the hype.  Data will start belonging exclusively to end users.”          Fioretti, 2004 [7]

How can bitstrings be made safe against loss?  A modest extension of existing data grid tools and protocols would suffice.[8]  For instance, the rules portion of LOCKSS could be extended to seek accepting storage sites for replicas of documents for which preservation is wanted and to ensure that enough copies exist.  Such mechanism can be devised to be insensitive to the demise of the originating computing environment by mimicking the propagation of micro-organisms.[9]  The copies would survive as long as the computing and network infrastructure survived.[10]  Their durability would be at least as good as can be provided by any institution of the kind that Campbell has in mind—probably better.

The first imperative of any institution is its own continued existence, and this is closely followed by its urge to increase its influence.  The press release can be seen as part of a rearguard action to protect the jobs of research librarians.[11]  If this is, in fact, part of what the Library of Congress and its NDIIPP (National Digital Information Infrastructure Preservation Program) partners have in mind, their ends would be better served by thinking through what unique social values they can offer.[12]  Only if they do this effectively will their roles in the digital world be as important as it already is in traditional libraries.[13]  The rhetoric “Institutions preserve” lulls and leads in a futile direction.

Preserving Dynamic Digital Objects

The Digital Curation Center (DCC) is preparing a “best practices” manual for which it is inviting experts to contribute individual fascicles.  The editors asked me to write Preserving Dynamic Digital Objects, whose abstract reads:

“Most articles about digital preservation come from the cultural heritage community.  The needs they express will expand to those of businesses wanting safeguards against diverse frauds, attorneys arguing cases based on the probative value of digital documents, and our personal medical records.  The U.S. NDIIPP  expresses urgency for preserving authentic digital works.  We know how to accomplish this reliably for every kind of information, with packaging that will seem convenient to all kinds of user.

“Among the digital record types that we might want to preserve, “dynamic records” are thought to … pose unique challenges.  The nature of [expressed] concerns suggests review of what it means for a record to be dynamic …  Because confusion about ‘dynamic’ occurs even among preservation experts, an exacting analysis is merited.  We provide this by using methodology [based on] early 20th century philosophy.

“For a work to be eligible for copyright protection, it must be fixed—written in a stable representation.  Whenever a computing system saves a record copy, or shares one with a remote user, this is a stable version.  Since dynamic digital objects introduce no technical problem beyond those already solved for stable documents, it is unnecessary to prescribe any new best practices for preserving them.”
                                                                                                                                                                  Gladney, 2005

Choosing Digital Repository Software

As part of volunteer work for a local museum, I have been looking into how that institution might choose software for managing and sharing history of computing materials—including obsolete software that it would like to offer in executable versions.  Early thinking suggests that making the best choice might not be easy, because there are about 80 open-source offerings[14] and about 20 commercial offerings worthy of consideration.  Perhaps this embarras de richesse is best handled by an orderly approach, starting with a pedantically careful evaluation of requirements. 

To that end, I inspected several Web-accessible analyses: a Canadian Heritage Collections Management Software … Criteria list,[15] a British archive requirements analysis that has been refined by many participants,[16] a ten-year old IBM product development analysis,[17] and a German prescription for criteria development. 14  A starting assumption was that each of these would contain many line item requirements worthy of consideration by any cyber-museum, but still not be completely satisfactory.  In retrospect, the assumption seems valid.  The Canadian document seems to be derived from considerations for traditional museums[18] that are not extended sufficiently to cyber-museums.  The British document is more current, but makes extensive use of statements whose satisfaction cannot be objectively tested.  The IBM document is out of date and was not published.  The German document is short on criteria, but does prescribe a useful scheme for criteria development (Figure 1). 

Figure 1: Development of the evaluation scheme and roles in the decision process (translation of Borghoff Abbildung 1)

I also inspected the British Standard Proce­dures for Collections Recording Used in Museums.[19]  Although this is oriented toward traditional museums, it is a good reference for metadata that might accompany any kind of holding.  In contrast, the Model Requirements for the Management of Electronic Records[20] proves to be not applicable, as it deals primarily with operational data management of business records rather than the much less dynamic management of archival information.

Drawing on all these sources, I have drafted a new requirements analysis for museums working to extend their offerings to digital holdings.  This focuses on repository functionality, leaving to other efforts requirements for packaging individual digital holdings for preservation, integration of digital holdings to be interesting and informative extensions of traditional museum artifacts, organization of exhibits for different classes of museum visitor, and presentation on the World Wide Web. 

This draft is available to a limited number of qualified reviewers.  If you are interested, please send me an e-mail indicating what use you would make of it assuming that it proves to be useful for your institution.

Making Repository Software Replaceable

The recently approved Content Repository API for Java (JSR 170) is a standard interface whereby applications can retrieve from and manage a digital library.[21]  Its authors are from commercial enterprises and consortia: the Apache Software Foundation, Day Software, Fujitsu, IBM, SAP, Microsoft, BEA, Documentum, SUN, Novell, Macromedia, etc.

"As the number of ... proprietary content repositories has increased, the need for a common programmatic interface … has become apparent.  The aim of the Java Content Repository (JCR) API specification is to provide such an interface, [thereby laying] the foundations for a true industry-wide content infrastructure.

"Application developers and custom solution integrators will be able to avoid the costs associated with learning the particular API of each repository vendor.  Instead, programmers will be able to develop content-based application logic independently of the underlying repository architecture or physical storage.

"Customers will also benefit by being able to exchange their underlying repositories without touching any of the applications built on top of them."                                                                                 Content Mgmt. API for Java Spec., §2.1

The specification lists its goals as:

Ø     Not bound to any particular underlying architecture, data source, or protocol.[22]

Ø     Easy to use from the programmer’s point of view, representing the core functionality of a content repository without venturing into “content applications”.

Ø     Easy implementation on top of as wide a variety of existing content repositories as possible.[23]

Ø     Also standardizing some complex functionality needed by advanced content-related applications.

To mitigate tension between the last two goals, JSR 170 specifies two compliance levels.  Level 1 defines read-only functionality: reading repository content, inspectng of content-type definitions, supporting namespaces, content export to XML, and searching.  Level 2 adds methods for writing content, content-type assignment, and content import from XML.  Finally, the specificatiion defines as optional interfaces for atomic transactions and locking, versioning, access control, and some search extensions.

Careful reading of the JSR 170 specification leaves me impressed and confident that implementations will achieve the stated goals, perhaps with modest extensions that implementers discover to be desirable.  Since it is new, not many supporting offerings are available yet.  Perhaps JSR 170 support be regarded as a sine qua non requirement for any institution’s future acquisiton of repository software, even if this somewhat delays digital library implementation.[24]

Epistemology

“The correct method in philosophy would really be … to say nothing except what can be said, i.e. propositions of natural science … and then, whenever someone else wanted to say something metaphysical, to demonstrate to him that he had failed to give a meaning to certain signs in his propositions.”                                                                                          Wittgenstein, Tractatus 6.53

A friend recently asked what was meant by ‘epistemology’.  He was pleased to have both the definition from the Concise Oxford English Dictionary and my own construction.  Both follow.[25]

epistemology

theory of knowledge, especially with regard to its methods, validity, and scope. (Concise OED).

branch of philosophy that deals with the origin, nature, methods and limits of human knowledge; the branch of philosophy dealing with theory of knowledge (in contrast to belief or opinion).  It analyzes the possibilities and limitations of answers to, “What do people know?  What can be known?” and “What can people communicate?  How can they minimize the occurrence of misunderstandings when they communicate?”

Most philosophy belongs to one of four branches: epistemology (about knowledge), metaphysics and religion (about beliefs), aesthetics (about beauty and taste), and ethics (about correct behavior).

The Word ‘Dialectic’

I have long been puzzled about what ‘dialectic’ means, and so am happy to recommend analysis beginning:

“The term ‘dialectic’ is almost as old as the practice of philosophy.  Like many other labels of great antiquity, it has been used as a tag for concepts, activities, and situations of the most heterogeneous variety.  Few philos­ophers have ever employed the term in the same sense as any of their pred­ecessors. Indeed, rarely is it the case that any philosopher has consistently adhered to any one meaning in his writings.  What the dialectic is, there­fore, can no more be adequately treated short of a history of its definitions in use from Plato to the present than we could straightway say what the empirical, the reasonable, the sensible, the romantic, and similar terms mean in the history of philosophy.  

“There are two generic conceptions of dialectic under which the vari­ous meanings … may be subsumed. The first is the conception of dialectic as a pattern of existential change either in nature or society or man where the ‘or’ is not exclusive.  The second is the view that dialectic is a special method of analyzing such change.  Usually, but not always, it is held that the method of dialectical analysis in some sense ‘reflects’ or ‘corresponds to’ the dialectical pattern of change.  In any case, there is always a distinction drawn, though with no great regard for consistency, between the dialectical type of change and other kinds.”
                                                                                                                                                      
Sidney Hook, 1951 [26]

The Word ‘Scientific’

“If it’s got ‘Science’ in its name, it ain’t one.”               O ften heard in the halls of a computer science laboratory

Working as a computer scientist, I am still happy to concede that ‘Computer Science’ is less a science than an engineering discipline.  However, many physicists, chemists, and biologists are annoyed with the extension of ‘science’ to other disciplines, especially those that are not careful with what they think of as ‘scientific method’ or with empirical observation.  That such irritation is far from new is illustrated by:

“The great Poincaré once remarked that while physicists had a subject ­matter, sociologists were engaged almost entirely in considering their meth­ods.  Allowing for the inevitable divergence between the sober facts and heightened Gallic wit, there is still in this remark a just rebuke (from one who had a right to deliver it) to those romantic souls who cherish the per­sistent illusion that by some new trick of method the social sciences can readily be put on a par with the physical sciences with regard to definiteness and universal demonstrability.  The maximum logical accuracy can be at­tained only by recognizing the exact degree of probability that our subject­ matter will allow.”                                                               Cohen, 1931 [27]

“To a very great extent the term ’science' is reserved for fields that do progress in obvious ways.  Nowhere does this show more clearly than in the recurrent debates about whether one or another of the contemporary social sciences is really a science.  These debates have parallels in the pre-paradigm periods of fields that are today unhesitatingly labeled science.  Their osten­sible issue throughout is a definition of that vexing term.  Men argue that psychology, for example, is a science because it possesses such and such characteristics.  Others counter that those characteristics are either unnecessary or not sufficient to make a field â science.  Often great energy is invested, great pas­sion aroused, and the outsider is at a loss to know why.”                                                                                                                                               Kuhn, 1962 [28]

Purists are likely to point out that, until the mid 19th century, physics and chemistry were not called science, but rather “natural philosophy”, and that ‘science’ originates from the Latin verb scire—to know.

Russell's Paradox Revisted

Since DDQ 2(1) commented on Russell's Paradox, further reading has much increased my estimate of its significance as a stimulus for philosophers’ care with language.  The paradox arises from “Is the set of all sets that do not contain themselves a member of itself?”  The difficulty—that “the set of all sets that do not contain themselves” is nonsensical, i.e., does not denote a mathematical or empirical entity—came as a shock to logicians a century ago.[29]

“Russell's paradox created an air of despair in the logicists' camp, from which Frege never recovered.  The traditional notion of a set, of the one-one correlation between the intension and extension of a coherent predicate, is central …, and this paradox yanked out the rug from under it.  ‘Without a single object to represent an extension,’ says Russell, ‘mathematics crum­bles.’  Frege admitted that … math­ematics can be reduced to logic only if ‘set' is a logical notion; Russell has inad­vertently proven that it is not.  Frege sank into despair, and gave up on his lifelong project.”                          Sullivan, 2003 [30]

Ordinary language allows phrases that are meaningless.  Frege was aware of this hazard

“I see the greatest difficulty for philosophy: the instrument … for its work, namely ordinary language, is little suited to the purpose, for its formation was governed by requirements wholly differ­ent from those of philosophy.  So also logic is first of all obliged to fashion a usable instrument from those already to hand.  And for this purpose it initially finds but little in the way of usable instruments available.”                          Frege, 1918 [31]

Twelve years earlier, he had already written:

“… only true thoughts are admissible premises of inferences.  It isn't strictly sentences, it is thoughts which have contradictory counterparts.  If you always bear this in mind, that we cannot legitimately infer from sentences, but only from true thoughts, and that proper names and concept-words must be meaningful, then … contradiction … cannot obtain.  before we go on to proofs at all, we must have assured ourselves that the proper names and concept-words we employ are admissible. …  Only if someone makes a proper name from the corresponding concept-word by means of the definite article or demonstrative, does he fall into error.   ‘the set of all sets which do not contain themselves as elements' is not a concept-word, it is a proper name, and it can only be a question of whether this proper name is meaningful.    We may distinguish two different ways of using the word ‘set', going with two different conceptions, which are probably most plainly identified by the words ‘aggregate' and ‘extension of a concept'.  But frequently these conceptions do not occur in their pure form, but mixed together and this makes for unclarity.  The aggregative conception is the first to offer itself, but the requirements of mathematics pull towards the opposite side, and so confusions easily arise.” Frege, 1906 [32]

That a phrase or sentence might be nonsensical should not surprise us.  ‘The flight of an ostrich’ is linguistically nonsensical because no bird we call ‘an ostrich’ can fly.  ‘The flight of a 2-ton bird’ is empirically nonsensical because no known material is both strong and light enough for the bones of such a bird.  Although Russell’s paradox might not be easily understood (perhaps because it touches on unbounded sets), that other mathematical phrases might be nonsensical should be obvious to anyone who thinks about “the integer that is greater than 2 and less than 3”.

The drawings of M.C. Escher and his imitators[33] illustrate that graphic pictures can be as misleading as word pictures.  See, for instance, Escher's Waterfall.

News Reports

Copyright Class Action Suits: Use Beyond Specific Permissions

On June 14, the Toronto Globe and Mail discussed this topic and published a full page advertisement seeking class members for a lawsuit (one among several) in which a settlement is in the offing.  The case it described has to do with onward use of authors’ material; having been given permission to use material in print, the defendants then offered the same material electronically without specific permission. 

Sniffing Out Source Code

Stimulated partly by the lawsuit filed by SCO Group against IBM, developers’ tools for detecting likely violations of open source software licences are emerging.  These work by comparing source code with libraries of open source offerings.  Information Week provides a summary.

Right-click here to download pictures. To help protect your privacy, Outlook prevented automatic download of this picture from the Internet.Capacity of and Demand for Persistent Storage

Hitachi Global Storage Technologies[34] demonstrated 230 gigabits per square inch using perpendicular magnetic recording.  Hitachi expects to ship products with this density in 2007—enabling as much as 20 gigabytes on a one-inch hard drive and one terabyte on a 3.5-inch hard drive.[35]

Coughlin’s 2005 Entertainment Content Creation and Digital Storage Report analyzes requirements and trends in worldwide data storage for entertainment content acquisition; editing; archiving and digital preservation; as well as digital cinema, broadcast, satellite, cable, and voice-over-digital distribution.  The report predicts over 300% annual growth in digital storage demand driven by higher resolution content as well as archiving and digital preservation.  The forecasts include that:

·       1,600 Terabytes may be required for a digital movie at 4K resolution.

·       Optical media, flash memory, and hard disk drives will displace video tape in digital cameras.

·       90% of the total capacity is now used for content archiving.  That this will decline to 49% in 2010.

·       Currently 60% of the media shipped for entertainment content segments is tape with 40% optical, 0.4% hard disk drives, and a small amount of flash memory.

On a related topic, see a review of second-generation SATA HDDs.

IEEE Documents Searchable Through Yahoo! Search Beta

Last week, Yahoo! Inc. rolled out the beta launch of Yahoo! Search Subscriptions, a service that enables users to search multiple online deep web subscription content sources and the Web at the same time. This deep web subscription content includes for-pay news and reference Web sites.  IEEE is one of seven initial publishers whose content may be located through this service, which is currently available in the U.S. and the UK.  For more information, visit: http://www.ieee.org/portal/pages/newsinfo/yahoo.html.

Doubtful Patents a Political Issue

Software and business process patents have overwhelmed the U.S. Patent and Trademark Office (USPTO).  Many are of dubious quality, failing (in my opinion) to satisfy the legal criterion “not obvious to someone versed in the state of the art.”

The patent review process relies on the examiner finding prior art that would invalidate a claim, but the USPTO seems to be stretched too thinly for careful searching.  Its workload has doubled over the past decade to 355,400 patents last year, but resources have not kept pace.  Examiners often rely on previous patents in their search, which is a limiting factor, given that the field is relatively new.  According to USPTO's Brigid Quinn, patent examiners spend about 30 hours researching each application.  With about 25 claims per application, they need to make 7.5 million decisions annually![36]

Among the more egregious patents issued are one for deleting "undesired data" from a computer, one for matching professionals to email addresses and Web subdomains, and one allowing a Website company to charge others for browser cookie storing data structures.

This problem is hardly new.  A central claim of what might be the earliest software patent, Prater-Wei[37], is in fact matrix inversion.  However, the description in the patent application used linear algebra notation similar to that of textbooks published late in the 19th century, rather than more obvious matrix notation that was invented much later.  The patent examiner might be excused for being duped.

Reading Recommendations

This DDQ number recommends one epistemological work and three books about American physics and engineering during World War II.  The first will not appeal to all DDQ readers; however, it is among the most important works of the early 20th century and will amply reward anyone brave enough to take it up.  The latter three can be understood by any educated person—even scientific laymen.  They present human sides of military technology and will inform their readers for participation in public policy issues.

Rudolf Carnap’s The Logical Structure of the World[38]

For a recommendation of this classic, DDQ can rely on one of Carnap’s Vienna Circle colleagues:

“A magnificent attempt at such construction of con­cepts was made by Carnap in his book Der logische Aufbau der Welt, 1928.  His ‘constitution-system’, how­ever, claims to be no more than a sketch, not a final sys­tem; it is intended to serve only as an illustration of the task of such a systematic construction.  Above all it is intended to show the method to be employed for such a construction and ‘to prove that it is in principle pos­sible to construct a unitary system of all scientific objects (concepts).’  That which the axiomatic method achieved in such brilliant style for the propositions of individual disciplines, viz. their logical deduction and thus reduction to their logical foundations, Carnap has attempted to do for the concepts, specifically the principal classes of concepts, of the whole of science.”                                                                  Kraft, 1950 [39]

Carnap starts pragmatically:

The word ‘object’ is here always used in its widest sense, namely, for anything about which a statement can be made.  Thus, among objects we count not only things, but also properties and classes, relations in extension and intension, states and events, what is actual as well as what is not.” Carnap, §1 38

It continues by grounding a few objective definitions in ostensive use of relations and teaches how to construct articulations of more complex objects.

Jennet Conant’s Tuxedo Park

"Alfred Loomis, who lived in [the exclusive] gated Tuxedo Park, hated FDR, rarely communicated with his wife and three sons, stole his best friend's wife, and with icy disdain helped drive an aide to take his own life.  Yet the Allies may not have won World War II without this man whom history forgot.  As Jennet Conant's heart­thumping book recounts, Loomis was a public-spirited citizen with the brilliance and ability to galvanize the scientific community to [develop] radar to [help protect] London from bombs and to [find] U-boats, and later contributed to … the atom bomb." Ken Auletta, 2005 [40]

Jennet Conant’s 109 East Palace

Dorothy McKibben, Los Alamos’ gatekeeper for twenty years from its 1943 beginning, kept notes in anticipation of writing memoires.  Conant used these to write the book that McKibben never completed.

“Stan Ulam, who had watched Oppenheimer [organize the Los Alamos] lab from top to bottom, marveled at … the ‘American talent for coop­eration’ and how it contrasted with what he had experienced in Europe:

People here were willing to assume minor roles for the sake of contributing to a common enterprise.  This spirit of teamwork must have been characteristic of life in the nineteenth century and was what made great industrial empires possible.  One of its humor­ous side effects in Los Alamos was a fascination with organizational charts.  At meetings, theoretical talks were interesting enough, but whenever an organizational chart was displayed, I could feel the whole audience come to life with pleasure at seeing something concrete and definite ("Who is responsible to whom," etc.).

“[E]qually striking … was the conviviality, not only among the physicists, … who dif­fered greatly in temperament, but also among mathematicians, chemists, and engineers. ‘People visited each other constantly at all hours after work,’ he wrote.  ‘They considered not only the main problem—the con­struction of the atomic bomb and related physical questions about phe­nomena that would attend the explosion, … but also general questions about the nature of physics, … the impact of nuclear experiments on technology … and its influence on … theory.’  Beyond this, there were wide-ranging discussions of the philosophy of science, and of course the world situation, from daily progress on the war fronts to the prospects of victory in the months to come.  ‘The intellectual quality of so many interesting persons and their being constantly together was unique [in] the entire history of science.’”                           Conant, 2005 [41]

Richard Garwin and Georges Charpak, Megatons and Megawatts

"Two of the world's leading experts on nuclear power, weapons, and policy choices have pooled their formidable talents in an authoritative and readable analysis of these important issues. A timely book that will be of great value to anyone seeking a better understanding of what is at stake in the current nuclear policy debates on arms control, reactors for civilian power, and ballistic missile defense."
                                                                                                                                                                   Sidney
Drell [42]

Amusements

Cryptic Code

What was the shortest coded message announcing a major historical event.  Arguably, it was Charles Napier’s 1843 telegram to the British Government on conquering Sind (part of what is now Pakistan).  The entire text was, “Peccavi”—Latin for “I have sinned!”

Words At Play

Letters come to life at this playful and subtle advertisement, which celebrates the power of words and the highest in typographical ideals.  With single Open Type fonts as its only descriptive medium, Words at Play has created 21 portraits of literary legends using only the letters in each author's name. Each portrait is accompanied by one of the author's quotations that speaks of the power of words, inside of which is another hidden typographical treat. 

Practical Matters

Mozilla Firefox (my favorite browser) doesn't come with RSS[43] integrated, but it is easy to add with the Sage extension.  You can search for threads that interest you or click a magnifying glass whenever you're on a site to see if it offers RSS.  The program avoids much of the fat of competitors.

Sandboxie isolates and quarantines history, cookies, cached contents, and parastiic software into a "sandbox" during Web surfing.  Deletion of the sandbox removes the undesirable residue.  Compatible with Windows XP and 2000, it is a free download.

SpyWareGuide and Kephyr list known spyware.  You might want to use one of these before downloading a program that seems appealing, but also suspicious.

CiteULike is a free service to help academics to share, store, and organize on-line readings.  Someone who encounters interesting work on the web can click one button and have it added to a personal library.[44]  CiteULike automatically extracts the citation details, so there's no need to type them.

KNOPPIX® is a Linux version and utility software collection packaged on a bootable CD, automatic hardware detection, and support for many graphics cards, sound cards, SCSI and USB devices and other peripherals.  It can be used as a Linux demo, educational CD, rescue system, or adapted for use as a platform for software product demos.  It is not necessary to install anything on a hard disk.  With on-the-fly decompression provided, the CD can have up to 2 GB of executable software installed on it.

LogMeIn enables using a remote networked PC from any networked PC by way of a browser interface.  It traverses firewalls to allow login to the remote PC (which must, of course, be running) and includes utilities for synchronizing and/or copying files between the two machines, transmitting alerts from the remote to the local PC, and permits all management tasks on the remote machine.

BusinessWeek® Courses: Setting up a Home Office

BusinessWeek Online Courses offers subscribers free instructor-led courses for computing to support working from home.  For instance, in May it offered "Designing and Equipping Your Home Office" in six lessons with techniques, tips, and strategies for:

-   Determining basic home office requirements and designing the office;

-   Making choices of equipment and supplies;

-   Options for world-wide connectivity via phone, e-mail, fax, mail, and the web;

-   Home office financial and tax implications; and

-   Strategies for storing and managing files, records, and office supplies.

Other BW free instructor-led courses include:

-   “The Wireless Mobile Office

-   “Setting Up a Wireless Home Network”

-   “Writing a High-Impact Business Plan”

Memory evolution (especially DDR2)

Extreme Tech provides an excellent synopsis of the current round of improvements in PC memory.  This article ends with good advice about system upgrades.

Building a Performance PC

The June 2005 number of Extreme Tech recommends components for a build-your-own leading edge PC that is not also a “bleeding edge” configuration.  This is for a general-purpose machine that can handle almost any task—typical office applications, downloading music and video, feeding a music player, burning DVDs, video editing, playing games, watching movies, editing photos, and so on.  This is a starting point—a set of recommended components for an all-around PC.

PC Component Price Watch[45]

Notebook PC

Presario m2007us, Celeron® M 1.5 Ghz, 256 Mb, 15” display

$482.

each

USB 2.0 Drive

512 Mbyte

$22.

each

CD/DVD ROM

16x DVD

$22.

each

DVD writer

16x Dual ±R/±RW Double Layer

$43.

each

Hard disk drive

160 Gb ATA (parallel)

$40.
$0.25

each
per Gbyte

Hard disk drive

Western Digital 250 Gb SATA (serial)

$120.
$0.48

each
per Gbyte

PC3200 DDR memory

512 Mbyte

$48.

per Gbyte

CRT display

Envision 17”

$60.

each

TFT flat panel display

Proview K515S 15” 1024x768

$120.

each

TFT flat panel display

ViewSonic 17”

$174.

each

TFT flat panel display

ViewSonic 19” 1280x1024 with built-in speakers

$290.

each

TFT flat panel display

Samsung 19” w/contrast 1000:1

$417.

each

TFT flat panel display

Samsung 21.3” 1600x1200 w/contrast 500:1

$635.

each

Sound & game adapter

Creative Sound Blaster Audigy 2 ZS

$44.

each

Wireless keyboard & mouse

Logitech LX500  (I strongly favor the IBM Trackpoint over a mouse, but have not found it on a wireless keyboard.)

$44.

each set

Laser printer (B/W)

Samsung ML-1740

$82.

each

Laser printer (color)

Magicolor 2300DL

$292.

each



[1]     C.P. Snow, The Two Cultures, Cambridge UP, 1964, page 44.

[2]     Presumably Campbell has in mind institutions typified by the members of the Research Libraries Group.  Notice that ‘essential’ can be answered in the negative by identifying plausible methods for handling each realistic threat to preservation.  That these methods are better than alternatives need not be argued to justify that ‘essential’ would be an unwarranted claim.

[3]     Swan, Alma. Brown, Sheridan. Open access self-archiving: An author study, Technical Report, Joint Information Systems Committee (JISC), UK FE and HE funding councils, 2005.

[4]     Barton, Jane. Currier, Sarah. Hey, Jessie, M.N. Building Quality Assurance into Metadata Creation: an Analysis based on the Learning Objects and e-Prints Communities of Practice, 2003.  In Sutton, S. and Greenberg, J. and Tennis, J., Eds. Proceedings 2003 Dublin Core Conference: Supporting Communities of Discourse and Practice - Metadata Research and Applications, Seattle, Washington (USA).

[5]     For instance, see Liu, Xiaoming, Lyudmila Balakireva, Patrick Hochstenbach, Herbert Van de Sompel, File-based storage of Digital Objects and constituent datastreams: XMLtapes and Internet Archive ARC files, Digital Libraries, June 2005. 

[6]     Some observers will surely argue the superiority of professionally-generated metadata—an argument with some merit.  However, that those who self-archive will be strongly motivated to make their works easily found largely offsets such arguments, particularly with tools likely to emerge in the next decade.  For instance, the Electronic Literature Organization has published guidelines for creating enduring e-literature and digital media works.  See Montfort, Nick. Wardrip-Fruin, Noah. Acid Free Bits: Recommendations for Long-Lasting Electronic Literature, July 2004.

“The document is a plea for writers to work proactively in archiving their own creations, and to bear these issues in mind even in the act of composition.”                                                                                                                 Montfort and Wardrup Fruin, 2004

[7]     Fioretti, Marco. The OASIS Standard for Office Documents: How All Users and Developers Can Benefit”, Linux Journal 119, 64-7, March 2004.   See also Fioretti, Marco. Converting e-Books to Open Formats, Linux Journal 134, 89-91, June 2005.

[8]     Venugopal, Srikumar. Buyya, Rajkumar. Ramamohanarao, Kotagiri. A Taxonomy of Data Grids for Distributed Data Sharing, Management and Processing, 2005. 

[9]     Measures to prevent excessive propagation would be needed.  Providing them would not be difficult.

[10]    In addition, a different bitstring preservation environment is already coming into existence; see Ourmedia.  Campbell might argue that this is an institution.  However it is a quite different kind of institution than she seems to have had in mind.

[11]    The reader should not infer any suggestion that research libraries are in jeapordy.  Collections on paper and other physical media will continue to increase in value as more and more people have the time, education, and inclination to read.  Paper is a highly refined technology with features that digital media are unlikely to replace.  However, the fraction of the world’s information that is on paper is likely to decrease, and the use of paper collections will be a smaller fraction of total information use simply because digital media are readily at hand.

[12]    A Toronto Globe and Mail report on the recent SLA (Special Libraries Association) conference suggests one measure by its title: “Librarians as Sleuths”.  The SLA is an organization for librarians working in other sectors than education.  It says of itself, “Special librarians are information resource experts who collect, analyze, evaluate, package, and disseminate information to facilitate accurate decision-making in corporate, academic, and government settings.”

[13]    The issues are identified in Roush, Wade. The Infinite Library, Technology Review 108(5), 54, May 2005.

[14]    This is from a listing in Borghoff, Uwe M. et al.  Vergleich bestehender Archivierungssysteme (Comparison of Existing Archiving Systems), Nestor Kompetenznetzwerk Langzeitarchivierung, 2005. 

      Further content management offerings are listed on the Resources for Java Server-side Developers Web page.

[15]    Canadian Heritage Information Network, Collections Management Software Review: criteria checklist, v. 3, 2000.

[17]    Gladney, H.M.  Mantey, P.E.  Requirements Analysis for a Document Storage Subsystem, IBM Research Report RJ 7085, 1989.  Gladney, H.M.  Digital Library Requirements Analysis, unpublished IBM Research Report, February 1995.

[18]    By ‘traditional museum’, we mean a museum holding primarily physical artifacts and printed documents, without a large digital collection or Internet accessibility.

[19]    McKenna, Gordon. Patsatzi, Efthymia. SPECTRUM: The UK Museum Documentation Standard, 2005.

[20]    Cornwell Management Consultants, Model Requirements for the Management of Electronic Records, 2001.

[22]    The main challenge is to allow enough API flexibility for both hierarchical and non-hierarchical repository models. This is done by providing for both hierarchical, path-based addressing of content items and direct, UUID-based addressing.

[23]    The JSR 170 designers represented the development teams of about 15 content management software vendors.

[24]    The Jackrabbit Project is developing an open source implementation of the Content Repository for Java Technology API (JCR).

[25]    A more careful (and lengthy) analysis can be found in a chapter written by Rudolf Carnap in Blackmore, J. Itagaki, R. Tanaka, S. (ed.) Ernst Mach's Vienna, 1895-1930, or, Phenomenalism as philosophy of science, Boston studies in the philosophy of science; v. 218, Dordrecht, Boston: Kluwer Academic Publishers, 2001, pp.159-186.

[26]    Hook, Sidney. Reason, Social Myths and Democracy, Humanities Press, New York, 1951, Ch. II.  Reproduced in Feigl, Herbert. Broadbeck, May.  Readings In the Philosophy Of Science, Appleton-Century- Crofts, 1953.

[27]    Cohen, M.R. Reason and Nature, Harcourt, Brace & Company, 1931

[28]    Kuhn, Thomas S. The structure of scientific revolutions, 1962, page 160.

[29]    To the reader who would like more detailed accounts of Russell’s Paradox, DDQ recommends:

·   Audi, Robert (ed.). The Cambridge Dictionary of Philosophy, Cambridge U.P., 1999, entry for “semantic paradoxes”.

·   Coffa, A. The Semantic Tradition from Kant to Carnap: To the Vienna Station. Cambridge U.P., 1991, Chapter 7bb.

·   Everdell, William R. The first moderns: profiles in the origins of twentieth-century thought, Chicago; London University of Chicago Press, 1997, Chapter 12.

·   Monk, Ray. Ludwig Wittgenstein: The Duty of Genius, Penguin,1990, pp.31-3 and 306-7.

·   Monk, Ray. Bertrand Russell: the spirit of solitude, Simon and Schuster, 1996, pp.142-183.

[30]    Sullivan, Arthur, Logicism and the philosophy of language: selections from Frege and Russell, Broadview Press, 2003.

[31]    Frege, Gottlob, Logical investigations, translated and edited by P.T. Geach and R.H. Stoothoff. Yale U.P., 1977. 

[32]    Frege, Gottlob.  Posthumous writings. Translation of selections, originally published as Nachgelassene Schriften und wissenschaftlicher Briefwechsel, vol. 1, by Felix Meiner, Hamburg, 1969.  See On Schoenflies: the Logical Paradoxes of Set Theory, originally On Schönflies: Die logischen Paradoxien der Mengenlehre, 1906.

[33]    Many Web pages display such drawings.  See, for instance, Escher for Real, Beyond Escher for Real, and Geometry Junkyard.

[34]    Until two years ago, this was IBM’s joint Research and Development group.  It is still housed in the IBM Almaden Research Center.

[35]    See also Anderson, Dave. You Don't Know Jack About Disks, ACM Queue 1(4), 20-31, 2003.

[36]    For a fuller description, read the Wired Magazine article that prompted these paragraphs.

[37]    See Galbi, Elmer W.  Software and Patents: A Status Report, Comm. ACM 14(4), 274-280, 1971. 

[38]    Carnap, Rudolf.  The logical structure of the world; pseudoproblems in philosophy, translated by Rolf A. George, Univ. California Press, 1967.  ISBN 0-812-69523-2   Originally published in 1928 as Der Logische Aufbau der Welt.

[39]    Kraft, Viktor. The Vienna Circle: The Origin of Neo-Positivism: a chapter in the history of recent philosophy, Philosophical Library, New York, 1950, page 84.

[40]    From the dust cover of Conant, Jennet, Tuxedo Park: A Wall Street Tycoon and the Secret Palace of Science That Changed the Course of World War II, Simon & Schuster, 2002.

[41]    Conant, Jennet. 109 East Palace, Simon & Schuster, 2005, page 255.

[42]    From the dust cover of Garwin, Richard L. Charpak, Georges. Megawatts and Megatons, Knopf, 2001.

[43]    ‘RSS’ is an acronym for ‘Really Simple Syndication’.

[44]    I do not myself use CiteULike because I have had well-oiled procedures for constructing bibliographies and notes.

[45]    Prices include California sales tax @ 8.25%. 

      Some prices seem to be loss leaders.  For those products whose advertisement identified a manufacturer, this manufacturer is identified.  However, some advertisements do not identify manufacturers.  All the advertisements used come from the San Jose Mercury News or trade magazines.