UPF User Survey


Complete results as of 2/98


Challenges for the long-term storage and preservation of media assets:
Challenges of storage media obsolescence and reliability got the most votes, but additional comments mentioned the challenge to formulate a strategy for migrating materials to a digital storage, dealing with budgetary constraints and, in some cases, the apathy of administrators. Establishing new strategies must factor in costs, and a recommended practice might look at the costs of migration to digital formats, perhaps looking at how a UPF might lower certain costs. This material must be presented in plain English. It is one thing to convince archivists of the need for a UPF; it may take more to give archivists the resources for convincing administrators and board of directors of this need.


What do you think are the greatest challenges for the long-term storage and preservation of media assets?
a. Finding a climate-controlled, stable storage environment to physically house a collection. 14
b. Chosing a storage technology that will not become obsolete in a few years. 39
c. Physical reliability of storage medium. 26
d. Meeting costs of state-of-the-art storage. 18
e. Accessing analog materials once they are in a digital format. 13

Designing an open system that you can exchange parts [...] to take advantage of new/better technologies as they become available. - Baker
[Physical reliability of storage medium] may be the "greatest" challenge, but [Chosing a storage technology that will not become obsolete in a few years] and [Meeting costs of state-of-the-art storage] are, I think, the ones that will make the most difference over the next century to the greatest amount of material. The solutions to [Finding a climate-controlled, stable storage environment to physically house a collection] and [Accessing analog materials once they are in a digital format] require education/action of individual institutions. - Carli
...developing strategies for migrating records to new technologies as old technologies become obsolete [...] that ensure the critical components of a records content, context, and structure are captured and moved forward through time. - Conrad
...data migration - moving it from today's format to tomorrow's. - Eiteljorg
...hardware to play the formats is the crucial topic. - Gaustad
The challenge is in preparing for technology obsolescence. The library and archive community are beginning to talk about encapsulation of behaviors with "documents" (a document can be a memo or a film) such that the software that created it can be emulated. - Graham
...the existing image data bank in analog form may be "traditional" but I do not have to worry about changing softwares every few months that make it impossible for me to retrieve what records/ images I had made last year.
... What happens if the caretaker is removed? - Hehman
...format obsolescence, and the difficulty of managing the migration of data to new environments (especially if establishing preservation standards for the conversion of material from one format to another is required). - Hirtle
Format migration is one solution, but it requires careful planning and is expensive, requires also a flexible storage area design. - Lee
The act of migrating from an old medium or format to a new one for which it was not originally designed. It is precisely that moment of transition when information is in danger of being lost. - Lowe
...convincing administration of the need for long-term storage and preservation of media assets. - McLemore
It is a fact that the rate of physical and chemical deterioration is linked to environmental conditions. If the "native" format can be stabilized through proper storage environment, then (at least in theory) it can be accessed well into the future for the purposes of duplication and format migration. Also, since doing the actual piece by piece reformatting is a costly, time consuming proposition, a good environment is economical since its has a stabilizing effect on all materials in the collection. - Messier
...software systems that manage and access the hardware storage systems [...] are unpredictable, inflexible, buggy, and they change every few years, causing us [...] to be unable to read data even though the hardware storage was stable. -
Ogle
...a bigger problem is ensuring continual funding in keeping / maintaining the environment in the storage facility stable. - Phan
Migrating the data as new technologies develop. - Sinclair
The big word here is money. Trained staff and adequate space and time to do the work .... - Skarstad
...getting people to care. - Vanderlinden
...cost of data migration as technology changes. If the technology wave-period can be extended, as UPF seems to offer, the stability of storage media becomes significant, along with the conditions required to make it last the distance. - Webb
If paradigm shift occurs (switch to digital), backward compatability.- Weiss
...convinc[ing] the companies to produce a product for us to purchase and the equipment to do the transfers. - Wilson
The greatest challenge is continuing to provide material to users IN THE FORMAT THEY FIND CONVENIENT AND USEFUL. This implies format conversion. The logistical challenge is performing such conversions in a cost-effective manner. - Wright


Concerning digital and non-digital storage types or formats:
The vast majority of those who answered our survey said they had at least some knowledge of data storage formats. Due to the technical nature of some of our questions, the survey may have scared off many who do not have this level of knowledge.

a. I am familiar with a fairly wide range of digital and analog storage types or formats. 24
b. I have knowledge about some types or storage types. 20
c. I have a minimum knowledge of types or storage types. 5
d. I have absolutely no knowledge of types or storage types. 2


Concerning metadata
In contrast to the storage format question, metadata as a concept seemed to be understood at least on some level by almost all of those who responded. The term has been somewhat popularized by various search engines or services on the World Wide Web, but many archivists know the concept through finding aids initiatives.

Metadata is a term that can bewitch, bother and bewilder us, but it's really quite easy to explain: it's simply data about data. An example of metadata are a library's catalogue records: information that refers or directs one to particular objects. Users of the World Wide Web may be familiar with the HTML tag, META. Among its many uses, the META tag enables Web designers to insert indexing materials or "keywords" that can be processed by certain search engines such as Hot Bot and Alta Vista. In addition, Web designers can use the META tag to insert a their own descriptions of their sites, which is displayed when site information is retrieved by some of these search engines. Based on functions, metadata may be categorized into four areas:
Format: information to decode the Essence
Descriptive: cataloguing information, information used for search & retrieval
Association: information that synchronizes or interleaves the Content Components
Composition: instructions for the various components to work together in unison

a. I have a good understanding of metadata and have followed one or more metadata initiatives or projects. 20
b. I have some understanding of metadata and metadata initiatives or have followed one or more metadata initiatives or projects. 17
c. I have minimal understanding of metadata, but have not followed any metadata initiatives or projects. 10
d. Until this survey, I had absolutely no knowledge of metadata. 2


Concerning knowledge of SMPTE:
Considering that we were appealing to archivists in this survey, it should not be too surprising that so few people knew about the Society of Motion Picture Engineers. Until now, SMPTE has not sought input from the archival or library science communities. One of our missions is to make these groups aware of this technical standards organization.

The Society of Motion Picture and Television Engineers (SMPTE), a standards-making organization, has recently (Sept. 23, 1997) established a Study Group to "document the requirements of data formats for the preservation of electronically generated media and related information. Extensive input from the archival community will be gathered through the use of surveys and meetings."

a. I am an active member of SMPTE. 3
b. I am aware of the aims and functions of SMPTE. 8
c. I have heard or read something about SMPTE, but don't know much about the organization or its relevance to a universal preservation format or framework. 21
d. Until reading this survey, I have never heard of SMPTE and so cannot determine how it might relate to a universal preservation format or framework. 17


Concerning wrappers or "bento containers":
Interestingly, only two people thought they had a good understanding of wrappers. This may be attributed to the elusive nature of the concept. More likely, archivists and librarians understand the concept without quite connecting it to the new terminology.

Bento is a Japanese term that refers to a lunchbox which contains many individual compartments. The Bento storage specification is like this lunchbox in that it holds several pieces and formats of a compound document within a single container. Though the phrase "Bento container" refers to a technical specification developed and last revised by Apple Computer, Inc in 1993, we may use the term in a generic or metaphorical sense to mean a container for packing up and moving both the essence of a media format (a movie, a musical composition, etc.) and the information about this essence (the metadata) as a single unit.
Concerning your knowledge of wrappers or "bento containers":

a. I have a good understanding and have followed one or more related initiatives or projects. 2
b. I have some understanding of the concept. 13
c. I have a bare minimum knowledge of wrappers or containers. 14
d. I have absolutely no knowledge of wrappers or containers. 21


Ultimate goals concerning digital preservation
This question evoked a commentary similar to the earlier "challenges" question. Some of the same issues were raised, particularly budgets. The majority stated that they intended to let public demand determine which analog materials would be converted into digital. Also mentioned throughout the commentary was the determination to continue all analog preservation efforts, even while dipping toes into the digital waters. Implicit throughout many of these commentaries was a "wait and see" approach to digital archiving. Establishing a standard for a universal preservation format through SMPTE and other engineering groups would be like placing a life guard before these troubling waters.

Which of the following statements best reflects your thinking about the ultimate goals of institutions concerned with the digital preservation of media collections?
a. We will save the non-digital originals as long as possible, converting them to digital when the originals are in measurable danger. 1
b. We will convert non-digital materials into digital formats for access purposes (such as Internet and user requests). 28
c. Only materials in measurable danger will be converted to a digital format. 0
d. Non-digital collections will continue to be maintained through traditional storage methods, and only selected materials will be digitized for preservation purposes. 13
e. There are no plans to digitally preserve our collection at present. 5

Funding [...] is of great concern to my institution. - Allen
We will convert archives to a digital format when we can raise the money to move those collections. Others will migrate to digital if we think public access is important for them. - Baker
Our current access/preservation methods are based on microfilm usage, and we now plan to film any items (practically all our collections of interest are on paper) that we digitize for access purposes. We use RLG (Research Libraries Group) standards for microfilm metadata. - Carli
Given the present problems with migration and technological obsolescence, digital "preservation" is probably not the best idea. - Conrad
We will save the non-digital originals as long as possible, with plans to reformat them onto other analog formats as funds permit. For those items where digital copies are needed (for wide access/preservation/internet availability), the item will be reformatted onto analog and digital formats *simultaneously*." - Dale
...sav[e] the non-digital originals as long as possible, but [...] digitizing for preservation and access should be done *before* there's any clear sign of danger/deterioration. While ideally one would like to save everything forever, the cost of conversion and ongoing maintenance of digital versions probably means that appraisal strategies will have to determine what gets digitized and saved... Access and user demand as well as preservation needs should be factors in deciding what to digitize. - Hadley
...our thrust is to identify images which are used often [...] and/or images which best represent a critical component and get it in a medium for quick and easy retrieval and use company-wide. - Hamal
We will use digital preservation for some things when they are in danger... Other things we may convert to digital form to improve access, but when we do so, we should convert them at a preservation level... (see the remarks by Anne Kenney and Steve Chapman on "Full Informational Capture" in DLIB magazine). Hybrid solutions (digital conversion, but an analog preservation copy on traditional media) may still be necessary. - Hirtle
We convert non-digital 35 mm neg. stock to digital so we can use it. We keep the original always. - Lee
Materials should be stored in multiple formats, both analog and digital to minimize risks of catastrophic unrecoverable loss. - Lindner
...I advocate the hybrid approach to the library collection, with preservation reformatting options that include paper, microfilm, electro-magnetic analog, and digital options. - Lowe
...I'd digitize the lot and preserve the originals in the best possible environment with as little disturbance as possible. - Lucas
...office files are likely to be handled differently from library collections, and within the latter, rare books and manuscripts are likely to be handled differently from newspapers. - Maher
For this question, I think it is useful to talk about two categories of artifacts: 1. Machine readable, electronic artifacts (video, computer files, art etc) and 2. "Material" artifacts (e.g. books, photographs, paintings, etc.). For category 1 objects, digitization will emerge as the only viable way to preserve this class of artifacts. For category 2, digitization should almost never be used as a preservation tool. For artifacts in category 2, digitization would result in facsimiles that might be useful for the purposes of access but would be nearly useless for the purposes of preservation. - Messier
...digitize as needed for projects and to offer selections to colleagues, but to wait until standards are in place to do a major digitizing project. - Pollock
We are developing guidelines for digitisation... It is important to encourage people to digitise their collections before they deteriorate [and] present many other problems. - Rymer
I [...] would like to explore available [formats] and convert deteriorating materials first, then move on to "all" materials for access purposes... - Schwan
...everything should be digitised as a preservation and a distribution strategy, but [...] original materials should be kept in cold storage. I feel we should get rid of film only when (a) very high resolution, non compressed images can be stored in computers or (b) material was originally shot in digital format (film prints might be made for distribution). - Turner
...provide [...] functional access to all [our] materials, chosing from [...] storage and transmission techniques as the needs and resources of those we serve allows. - Vanderlinden
[Because] staff and money is a great concern, [...] we will have to be selective as to which non-digital materials are transferred... - Vetter
1. For our audio collections we will digitise everything that is worth keeping because it is in danger of obsolescence as market support for analogue technology shrinks...
2. For multi-media, [...] we will [...] preserve [...] only selected materials for ongoing access. That preservation decision will be based on collection development values...
3. We have a program of digitising to improve access to our rare and valuable Australian non digital collections. While access is the prime goal of digitisation in these programs, there is a preservation connection, partly to protect the originals from wear and tear caused by use, and partly to discover more about the technology in order to preserve the digital copy.
4. Finally, we are involved in programs to digitise material with preservation as the main goal. At this stage they are experimental in that the technology is still too immature to be able to regard digitisation as an appropriate preservation format... Distinctions between preservation and access are less definite in the digital environment. - Webb
This decision will be most affected by costs of conversion. - Wilson
Our plans vary slightly with media type. For photographs, we will mainly ... [save the non-digital originals as long as possible, converting them to digital when the originals are in measurable danger].. For video and audio, we need to provide material in a useful format, hence [We will convert non-digital materials into digital formats for access purposes (such as Internet and user requests)]. For documents and for commercial, recorded music we [have no plans to digitally preserve our collection at present]. - Wright


Migration
The majority of respondents saw the need to maintain a dual storage strategy, though we must add that the "no opinion" choose received the second highest number of votes, which -- considering the thoughtfulness of the commentaries -- may be seen more as further evidence of a "wait and see" strategy than as indifference.

Which statement best characterizes your thinking about institutions converting from analog to digital formats for long-term storage?
a. Institutions should actively investigate digital formats with a plan to eventually convert all collections to a digital storage format. 6
b. Institutions should plan to maintain a dual set of storage for materials. 29
c. Institutions should wait for a digital storage format that addresses the issues of tape fragility and rapid and irretrievable deterioration before making any new investments. 7
d. No opinion at this time. 13

Eventually the conversion will need to be made because of a lack of playback devices for the old analog medium... - Baker
...we currently plan to fold preservation microfilming into any digitization program, until we are satisfied with longevity standards (in terms of technology and expense) for digital copies. - Carli
Collections should not be "preserved" solely in digital form until the problems of technological obsolescence and migration strategies are addressed. The fragility of the media and the obsolescence of storage technology can be addressed today. Finding a format for storing images, audio, video that can be easily and completely migrated is the key issue. - Conrad
...in the best of all worlds, institutions prefer to aim for an analog copy for long-term preservation and a digital copy for easy and readily available access. - Dale
Digital storage for our materials makes sense when either the paper media are inadequate (CAD models and complex databases) or when the physical media are deteriorating. Digital storage may also be appropriate when access is needed for more than a small number of scholars. - Eiteljorg
It will never be possible to convert everything, which means a triage process must be in place. - Graham
I think it very much depends upon what type of collection is involved and the usage it's to receive. - Hamal
...administrators [mistakenly] only see value in the newest technology and do not see any merit in keeping [...] analog materials for future access when the digital environment fails....- Hehman
Holders of copyright must be much more flexible if any progress is to be made in this area. - Hughes
I think it is premature to consider conversion of non-digital information to digital format for the purpose of long term preservation, due to the unresolved problems of managing systems of migration and refreshment, and the cost of doing so. I'm not sure the physical fragility of digital media is a particular obstacle, if we assume the physical media will be discarded with migration. - Jones
I'd like to see digitization happen as rapidly as possible for access and preservation both, but of course preserve originals in the best possible environment. - Lucas
...I'm inclined not to make any NEW investments in techology to handle things we want to preserve "forever", but I WOULD consider investing in a technology which provided long enough shelf-life and media availability to justify the costs, for use with things like office files. - Maher
With technology in such rapid flux, I do not think enough information is available about the shelf life [...] or future retrieval capabilities of current digital storage formats to commit to any particular plan at this time... Much of the media I work with has intrinsic value in its original format. - McLemore
institutions should make it their priority to both 1. Preserve original artifacts through a stable environment and 2. Begin the process of coming to terms with what it means to preserve electronic records. - Messier
There is no way a digital storage format can ever replace a physical collection. It is important to maintain both side by side to allow scholars [and others] access... - Rymer
Standardization needs to be complete before long-term investments are made by anyone. - Schwan
The original image is the best image - a maxim passed down to me in archives school. - Skarstad
...everything should be converted eventually, but that it's probably wise to wait for a permanent support rather than lock into a cycle of copying every few years. - Turner
I think institutions need to be concerned about preserving original formats as well as creating preservation formats. - Vetter
"Institutions should actively investigate digital formats with a plan to eventually convert all collections to a digital storage format" [...] so long as "all collections" means all the material worth keeping accessible within the technology-dependant collections. We have no plans to digitise all of our collections. Need to remember that investigation is still very active, and will influence whatever conversion plans we develop. "Institutions should plan to maintain a dual set of storage for materials" is also true for some collections for some time. For example, we are maintaining both analogue and digital versions of our sound recordings while the analogue set can be accessed, because we want that level of redundancy in case one medium fails - that was our approach with analogue where we used at least two different kinds of tape to spread the risk. With our multi-media collections our preservation path is migration but we are not confident that we can migrate publications successfully, so we will probably need to maintain dual storage arrangements for the "originals" which may be in good condition but not accessible with current technology, and for reformatted versions which have lost some of the functionality and look and feel of the original but at least provide some access using current technology. This approach would be based on an assumption that technology won't always drive us away from accessibility - that there will be recurrent cycles of innovation and incompatability, followed by tools to bring the inaccessible back into view. "Institutions should wait for a digital storage format that addresses the issues of tape fragility and rapid and irretrievable deterioration before making any new investments" is true to some extent, but in many cases we just can't wait: the problem reaches crisis point before the ideal solution is available. - Webb
Broadcasting is already digital, and much new material coming into the archive is digital. Transmission is also rapidly moving to digital. So the 'conversion' is a process being thrust upon us. - Wright


Compression for long-term storage:
Most people answered this question with a "Well, it depends..." Commentaries linked this issue with an institution's budget and with the types of media to be archived.

SMPTE defines compression as "the process of reducing the number of bits required to represent information by removing redundancy." There are basically two kinds of data compression: lossy and lossless. Lossy data compression results in some loss of data and is best suited to graphics and sound, where some loss of data is acceptable. Lossy compression algorithms can often be modified to allow a relative degree of data compression. The more you compress, the less accurate your decompressed data. Considering that compression of data files is potentially a trade-off between file size and a loss of digital information, what degree of compression do you feel is acceptable for long-term physical storage?
a. No compression. 6
b. Only lossless compression is acceptable. 21
c. Moderate lossy compression is acceptable, dependent upon function or purpose of long-term storage and ratio of file space saved. 8
d. Standard should allow for both non-compression and an infinite variety of compression formats. 26
e. No opinion at this point. 1

There is so much variety of function in the data to be stored that it seems best to begin with as high a degree of flexibility as possible in terms of standard-setting. It's pointless, I think, for any one organization (or generation, for that matter) to spend a great deal of time determining precise standards where judgment depends so much on so many variables. - Carli
...lossless compression is the most agreeable, but our member institutions also realize that the function and purpose of creating the copy should drive the creation. As with digital image creation, if the function is to totally replace the original, no compression or lossless compression is the *only* option. But, if the copy is only meant as an access/temporary surrogate copy or to transmit files over the internet, a derivative digital copy with lossy compression could be acceptable. - Dale
...a moderate lossless compression with plenty of error correction capability. - Gaustad
...Just as with artifacts, we will make both general and ad hoc decisions; we need tools to do both. - Graham
For critical records, all original data must be preserved, but the standard should recognize various categories of preservation importance. - Hadley
The issue of compression [...] is highly dependent upon the type of collection and [its] use... We typically do not compress files for storage, but there are some types of information where compression is logical and sensible. - Hamal
Lossless compression is only acceptable if the algorithms for recovering the full information are so established or simple that there is no danger that the compressed information will be loss. Also, the media must be such that an uncorrectable error will not corrupt the remainder of the file (as can happen with a JPEG-compressed image). Lossless compression is the only acceptable solution for preservation [...] because we do not know what kinds of analysis or manipulation we may care to use on the files in the future. An automatic indexing program, for example, might work on a lossless file, but not on a file which has been compressed so that humans cannot detect the loss. These concerns must be balanced against the medium of the original. It is probably better to compress a VHS videotape using MPEG than it is to allow the original to continue to deteriorate. It is just not ideal. - Hirtle
Given the probable improvement of future technologies, and the possible loss of original information before more accurate reproduction is possible, I'd rather use lossless. But economic or other practical considerations tend to intervene, and archivists often have to settle for less than the best. Perhaps a hierarchy of desirables could express the standard. - Lucas
If we are talking about compression then I guess we are talking about migrating the original content from its "native" file format to a compressed format. For many records, this transition away from the native format will be required for the purposes of preservation. If this is the case, then I think compression is acceptable since the link between the preserved information and its original, "native" format has already been broken. Therefore, since the preservation copy is already a "facsimile" there is little lost in leveraging compression techniques. Methods of compression and guidelines for use should be developed. - Messier
Compression depends on the object. Lossy compression is very acceptable for thumbnail images used for browsing, but perhaps not for the larger resolutions that will be sent to the printer. - Ogle
I favor lossless because I cannot predict whether there will be some future cause of loss. - Skarstad
Standards should take into consideration the type of media, for example some loss may be acceptable for audio but not for video. - Vetter
...by allowing choices of compression, the curators and archivists will presumably have storage cost choices. This will allow some items to be preserved with various levels of compression depending upon the "value" of the item. For example, some home videos, collected to document family life, could be converted with quite a lot of compression if we were collecting them for the cultural information about home video production. - Wilson
...Generally speaking we favour lossless compression or no compression for our digital conversion files: we have invested in high resolution capture and want to be able to take advantage of technological advances that will make it feasible to transmit and view high resolution images or recordings. But we have accepted that lossy compression has a place for some material that needs to be kept but where the quality is not a critical preservation factor. - Webb
Some material arrives in the archive already compressed (eg feeds via satellite into newsgathering). The digital transmission will be compressed. The broadcast archive needs to MANAGE compressed materials (cf EC-sponsored project Atlantic), and ensure they are labelled and handled so that the compression does not lead to avoidable deleterious effects. - Wright


Metadata Initiatives
The Dublin Core Workshop and Berkeley EAD (Encoded Archival Description) Projects were the two metadata initiatives that received the most recognition votes, though it is quite possible that the Text Encoding Initiative (TEI) and Recordkeeping Functional Requirements Project (U of Pittsburgh), would have received more votes if we had included them from the very beginning.

Which of the following Metadata projects or initiatives are you familiar with?
a. Warwick Framework 12
b. Dublin Core Workshop 25
c. Berkeley Distributed Video-on-Demand System 3
d. Berkeley EAD (Encoded Archival Description) Project 24
e. Digital Object Identifier System 10
f. MIT Headers and Descriptors Research 1
g. Computer Interchange of Museum Information (CIMI) 10
h. Text Encoding Initiative (TEI) 14+
i. *Recordkeeping Functional Requirements Project (U of
Pittsburgh) 1+
*We did not add this option until late in the survey.
j. None of the above. 8

Also the work of Duranti at the University of British Columbia. - Andolsen
Pittsburgh and Philadelphia Projects
Text Encoding Initiative
Draft Functional Recordkeeping projects at the National Archives. - Conrad
See also commentary.
Your discussion of metadata [...] ignores the work identifying the need for structural, administrative, and descriptive metadata. In addition, we have to remember that one person's metadata may become another person's data. - Hirtle


Cataloging media collections
Overwhelmingly, the institutions represented in our survey use some form of commercial computer application to catalog their media collections. Popular programs like FileMaker and Access were championed, as were specialized packages, such as Endeavor, Cuadra, Cairs, and Tinlib. Of course, some form of MARC cataloging software was mentioned throughout the commentary. It was interesting that, except for MAVIS, no one mentioned software known as "asset management products" that were designed to "to effectively catalogue, index, access, search and retrieve video clips, digital libraries." (See: http://www.cmis.csiro.au/DMIS/VideoTalk/software.html for more information about this type of application.)

How is the cataloging and/or bibliographic material for your media collections now being generated or maintained?
a. in-house via popular commercial software packages, such as Filemaker or Access 23
b. through specialized database management systems, such as Willoughby or Cinebase 17
c. through printed catalogs maintained by staff and distributed to the public 6
d. through out-sourcing or periodic hiring of a temporary staff 0
e. This question is not relevant to my position or institution. 6

Other systems mentioned in the survey:
Nesbit's Media Library System

We converted our paper card catalogues to electronic access over the last 20 years, and have just engaged a new library database management firm, Endeavor... - Carli
WWW access to underlying data in Cuadra Star and Oracle databases. Metadata is stored in the system we use for validating electronic records and is available for answering public queries. - Conrad
Our members catalog into both RLIN and OCLC, the other bibliographic utility. Metadata is also placed in the files (e.g., TIFF headers, SGML, or HTML) themselves. - Dale
We roll our own with postgresql. (http://www.postgresql.org/) - Drelinger
We use MAVIS, Merged AudiVisual Information System, developed by Wizard in co-operation wit National Film & Sound Archive in Australia. - Gaustad
Inhouse through online catalogs (in our case, use of OCLC and RLIN to develop cataloging for our SIRSI Unicorn system). - Graham
Probably through a MARC cataloging software, with finding guides generated in paper form for researcher use (until full retroactive conversion of all Special Collections finding aids is complete). - Hadley
Access is my Bible, but a group I'm associated with also utilizes strange little programs, a kind of high-level mix and match potpourri (because we are a strange little group!) - Hamal
All Library acquired media formats are cataloged inhouse via OCLC in Ohio using MARC standards according to AACR2 and Library of Congress policies. - Hebert
Bibliographic data about all media is included in our online public access catalog according to MARC descriptive standards. - Hughes
In general, I'd say we try to make MARC records work in our NOTIS-based online catalog. Beyond that, our Digital Library collections, such as Making of America, are searchable on a more enhanced level due in great part to the wonders of SGML. - Lowe
My particular project is due to generate both a Filemaker database and an EAD-standard finding aid. - Lucas
Also: through custom-designed databases and application-specific metadata (for example: 1. botanical datasets and 2. image feature information - color,texture) - Ogle
Through electronic records in local and national bibliographic utilities (OCLC). - Olley
We also still have a card catalog with the majority of our collection.- Pollock
For online publications we also use ABN for cataloguing, but we are also developing a metadata archive as part of our PANDORA (Preserving and Accessing Networked Documentary Resources in Australia) Project. This has linkages with ABN but incorporates much more detailed information required for management rather than access. - Webb
These are treated in accordance with standard bibliographic treatment, with full MARC records entered into the Library's online catalog. - Wilding
MARC cataloging on OCLC and distributed via our statewide on line catalog called PALS. - Wilson
All physical-based items (ie sound recordings and multi-media discs, etc) are catalogued via our Australian Bibliographic Network (ABN) which is the national cataloguing system. The records are also included on the Library's own local catalogue. There are other specialised databases we use to record other metadata including preservation management data. Our largest catalogue (2 GB of text) is a bespoke system based on Informix 4GL. We also use commercially-available library packages (Cuadra, Cairs, Tinlib). For documents we have our own registry system written using DataEase. - Wright


Using descriptive information or metadata
This was one of the survey's "dah" questions. Asked how metadata might be used, respondents said in effect that the use of metadata was only as limited as the imagination, though granting "easy accessibility to the material" by way of search and retrieval got the most votes. Copyright figured in a couple of the commentaries, including the need for identifying sources and "terms for usage," and for preserving data integrity.

How would you like to see institutional media collections use descriptive information or metadata?
a. to handle content in a more efficient manner (for example, information about a media object might be entered just once and then distributed to and used by a variety of applications) 32
b. to grant easy accessibility to the material (for example, through a sophisticated search and retrieval application) 40
c. to tag material with original source or copyright information 27
d. to help in the automated creation of Finding Aids 29
e. to assist in linking references from items in your collection to related items in other collections 29
f. do not know yet 7

To allow easy browsing to cut down on the the need to review entire documents and/or productions. - Baker
I do in fact think that source/copyright tagging is of crucial importance in terms of data integrity. - Carli
...bit of a blue sky question here. - Graham
We don't need copyright information as much as identifying the source and knowing if we have approval for public release, or if it's from a stock house, what terms for usage have been set. - Hamal
I would hope to link the visual resources collection to our larger state wide library database system. The faculty are being pushed to teach as many classes as possible by using the Web. - Hehman
There are many layers of metadata that are desirable and necessary for managing all aspects of the digital file, and we need to work out every single one of them. - Turner
For once, "all of the above" seems the right answer! Although we are exploring all of the suggested options, we also expect to find other ways of making use of the descriptive metadata as we move to increasing automation of archiving and retreival, and increasing integration of internal and external systems. - Webb


Media collections on the World Wide Web
Those who knew how they would use the World Wide Web were evenly divided among the limited choices for this question. Commentaries suggested additional possibilities. And though the question used the word "envisioned," it is clear that several institutions are already implementing Web options. A recommended practice might impact on how archivists use the World Wide Web. For example, a common browser might access materials within an archive stored in UPF, either through an intermediary HTML page or through a browser plug-in.

One of the issues involved with container initiatives is how might a specific technical specification impact access to media collections through the World Wide Web. Already one may observe a trend in the software industry in which middleware or intranet products are being adapted for more general use. The latest versions of popular database software are also providing built-in HTML tools and translators. Archival media files and/or information about those files might be available through both database applications and web browsers.
How do you envision using the World Wide Web in respect to your media collections?
a. Finding Aids or bibliographic information only. 28
b. Downloadable "thumbprint" Video. 23
c. Ordering Information. 25
d. Multimedia on demand. 26
e. Do not know. 7

We plan to use an intranet and/or the internet via a browser as The method for searching the index and retrieving audio (both linear audio and reduced bandwidth versions for browsing) from our digital archive. - Baker
As a research library, we are responsible for using whatever method is most appropriate for giving our patrons access to the material they need in a timely fashion. - Carli
Downloadable data in multiple formats -- not just video. - Conrad
The Web is a wonderful way to help researchers find out about collection materials, and can feature materials as exhibits, but it is not a substitute for the reading room. - Hadley
Our image library is on the intranet and to get it there we must go from Access through Cold Fusion to make it Web Happy. We are looking into products (like Cumulus) to find a one-stepper which will still meet all of our database needs and successfully support our web site. - Hamal
...making a robust, viable UPF would be a major contribution and that making UPF "web smart" might not be worth the trouble. In other words: once you get a UPF up and running, let www software engineers figure out how to best leverage the format. - Messier
The Web for our organization will probably be used as a catalog for our picture collection, providing access to low resolution versions of our images. - Pollock
It's a question of time. As the infrastructure becomes able to handle it (the "information highway" as opposed to the Web) it will be possible to have more and more layers available on line, and I feel it is desirable to have them all. - Turner
We are currently constrained by legal, ethical and technology issues that make it hard to predict how we will use WWW for both our sound and multi-media collections. For example, the oral history collection contains a majority of material for which networked transmission has not been approved - this is a significant ethical issue even when we overcome the technological difficulties of passing huge sound files through the network. Likewise, for multi-media publications we face copyright and licencing constraints which may or may not be addressed with technology-based security devices. Of course, for our archive of online publications, the Web is the means of providing access, both to the bibliographic information and to the publication itself. - Webb
I would like to "deliver" some video images so that text alone does not describe the item. But I do intend to put the finding aids on the Web as well. - Wilson
We are part of the EC-sponsored Euromedia project, developing automated indexing and retrieval tools to support online, realtime video delivery in 'preview' quality. We expect to have LIMITED amounts of video online; we already have approx 70k online stills, delivered via intranet to registered users of the system. - Wright


Distributing and Storing Media
This first of two "blue sky" questions was perhaps too narrowly focused to generate feedback from most of our respondents, especially those concerned primarily with electronic records. Nonetheless, a few of the commentaries saw the application of metadata streaming for still image collections. Many other commentaries mentioned the problem of cost versus value in establishing detailed metadata streaming.

How should material that describes an archives collection be distributed and/or stored?
a. Continuously with content : for example, at any given point, one might freeze a video frame and access information about:
+ the video as a complete entity (example: copyright material, original format and running time...)
+ the specific video clip or sequence that includes this frame
(example: number of frames in this version versus number of frames in original, which numbered frame within the sequence)
+ the individual frame's descriptive information (what objects are contained in the image) 19
b. As header information only: information describing "essence" stored as a Table of Contents at the beginning of the video. 13
c. As a separate document: information about the "essence" would load and be stored as a document, accessible via retrieval or "parsing" software. 18
d. Do not know. 13

I am concerned mostly with audio and so I have no opinion on some of these moving image issues. I believe most metadata about audio can be captured in descriptive data entry fields such as: key words, participants... - Baker
[Describing continuously with content] ... might be a realistic undertaking within the budget of a major motion picture or for a uber-publisher, but I doubt individual artists (who can publish their own work on the net), small businesses, and other under-funded or under-staffed groups would be able to afford the time or money to catalog their work with this level of granularity. - Dreilinger
Frame-by-frame description seems extreme for most institutions. But, like all archival description, it should be possible to describe videos similarly at all levels--from a collection of 500 tapes down to a single frame.It's nice to have description attached to the material, but it should also be able to stand alone. Are these mutually exclusive options? -Hadley
Each of these are viable alternatives. Implementation would be dependent on institutions technical resources and abilitiies. - Hedges
"a" through "c" seem nearly impossible for most materials since someone has to oversee the input and value of the massive amount of descriptive information these methods would call for. - Messier
I really like [Continuously with content]. This would also be great for still images. FlashPix suggests this but we are not there yet. - Ogle
I would have video and descriptive information in the same file, as well as stills. - Schwan
We are exploring the use of both header and separated metadata, and think we want both, with a preference for linked separate metadata files. Again, we are very involved in investigating this... [Continuously with content] looks good, and we are after a similar level of retrievability of content and metadata for our sound collection, but it looks like a luxury at this stage. - Webb
For Euromedia, which also uses browser technology and is fully object-oriented, we have a hierarchical storyboard as a metadata descriptor, with keyframes, and with text associated at four levels: frame, shot, sequence, programme. For security, this information could be stored with the digitised data as well as separately. At the object level this distinction is invisible. - Wright


Metadata Streaming
How deep should descriptive cataloging go? If one picture is worth a thousands words, how many of those words do you include in your metadata? The UPF is not designed to answer these questions specifically, but it would establish a foundation that would help enable archivists and other information professionals to resolve metadata questions for their own institutions. One area with which the UPF might deal explicitly is the placement of copyright information, and the identification of "original works" received the highest number of votes. The UPF might also grapple with "versioning": assigning unique identifiers to the first generation object or compound document, then referencing them with unique identifiers with subsequent generations.

Metadata streaming, a kind of sequentially flowing descriptive data, relates to the EBU/SMPTE concept of a "new class of enablers of systems using bit streams for program material exchange." There are many practical applications for this metadata streaming, including a "standardized descriptive data to help in locating material through various database entries." Would you favor a standard by which:
a. each frame of a digitized film or video identifies a relationship to its shot, sequence, and entity. 28
b. actions within a film or video sequence are described. 27
c. objects and placement of objects within an image or frame are catalogued. 29
d. "original works" are identified through standard fields of information. This standard may stem from an established practice, but it must encompass the full gamut of media sources. Just as published works are assigned an ISBN, standard identifiers could be assigned to materials that are digitized for preservation. Distinct identifiers could also be assigned for distinct versions of this material. 33

...integrity and intellectual associations are more important than image-related associations. ...I feel strongly that ["orginal works" are identified through standard fields of information] is paramount for any standard. - Carli
...will we describe the boats in the sea around the Statue of Liberty, considering that one day someone might be interested in this type of sequences? or do we decide that describing the Statue of liberty wil 'implicitly' indicate to a future user that there might be boats around it ? - Despres-Lonnet
The metadata should be able to be applied at this level automatically from a top level record. Enter the title in a base record and when the data is streamed out the title is added to each individual frame. Tools to make this information easily replicated will be crucial. - Hedges
Artists and art students are very aware of originals and "restored" / colorized versions since that is what they sometimes learn to do in their craft. But the industry should not make it so difficult that artists and students cannot be allowed their creativity - They are your future. Think of Steven Spielberg. - Hehman
I think these sound good, but not all will take off at the same time. Some efforts will be the realm of libraries, while others will be left to more risk-taking ventures for entrepreneurial publishers. - Lowe
Yes! Yes! (Oh, excuse me . . . ) It's just that it seems so futuristic compared to not being able to afford to make a reference copy . . . - Lucas
The Standards of Practice and the Code of ethics for the American Institute for the Preservation of Artistic and Historic Works might be interesting in this context. - Messier
All of these features would be so desirable it isn't funny! - Turner
Metadata will be tricky and big. - Weiss
Version control [...] is a very important part of the projects we are running here -- without it we don't believe we have effective preservation because we would lose the provenance of the copies! This is an issue for users but also for the management of the archive. ...we [...] see [this level of retrievability] as a relatively low priority for our kind of collection. However, we are involved in a project looking at adapting that kind of moving image-based software to audio. [Describing actions within a film or video sequence] ...would be very useful, but it is much harder to characterise significant reference points such as movements, colours or shapes in oral history sound recordings. ...to what extent are these preservation tools, worthy though they may be? Is there a danger of access tools distracting us from the preservation challenges? - Webb
We train cataloguers to have a principled, consistent approach, but also they have to vary this approach depending upon the nature and value of the programme (entity). But [Each frame identifies a relationship] and ["orginal works" are identified through standard fields] are both essential to the integrity of the labelling. - Wright


Closing Thoughts
Closing commentaries reassured us that a standard is imperative to any digital migration initiative. Areas discussed in this section that were not singled out in the survey questions include the need of a preservation format to include a common lexicon, a recommended practice or guideline, and even a call for standardizing digital equipment. Education was also stressed throughout the commentaries; archivist organizations should consider establishing online tutorials for its members. Whatever one feels about the specifics of the Universal Preservation Format, it cannot be denied that many archivists are reluctant to invest their time and money into migration projects that are perceived to be short-term or intermediary solutions. Because our project focuses on such crucial issues as platform independence and embedded metadata, we feel we have the start of a workable solution that will last well into the next century.

We realize that concepts touched upon in this survey may be new to you. Here is an opportunity to convey your thoughts or concerns on the development of a digital preservation standard. Based on your understanding from reading this survey and supplementary materials found on our web site, how might this project for a universal preservation format orframework benefit you or your organization? And what activities would you like to see this project undertake that perhaps have not been addressed?

I am in the position of gathering information to support the conversion of materials in an obsolete format to a useable and, at the same time, archival format. We have very limited funds so I am always looking for information that will lead me to the safest choice for our archival show tapes and field tapes. I am glad there is a project that is looking at the idea of a universal format since format wars between manufacturers have proven a nightmare for archivists in past years. This project makes me feel that there is help for those of us with no idea of where to turn as the digital age takes over. Thanks! - Allen
A cohesive standard would allow us to work intelligently with our clients to develop a range of preservation options and approaches that would provide a large measure of assurance that the digital material would be available and could be transferred to newer media with less effort than the current situation requires. - Andolsen
The benefits I see are standards that describe how different parts of an archive solution should interact.The media [that is] used and how they get there will change over time and so by defining the interaction rather than file formats you open things up to rapid development. - Baker
Most important would be to settle on the standard for picture and sound and then a medium, such as DVD as the recommended media. - Bailey
We are in the process of developing metadata requirements for digital items in our repository. A system identifing the archival or "best" version along with related and derivative digital items would be useful. - Bramel
...I preceive a time in the not too distant future that our young will no longer go to school as they do now. Digital data will be employed via virtual reality technique and the use of drugs to educate our children much better and faster than we ever imagined possible. This same technology will also make juvenile crime, prisons and jails obsolete , or almost forgotten. What and how we store digital info now will play an important part in what happens then. - Burris
We are developing a digital library, including both static media such as images and dynamic media such as streamed audio and video, using the IBM Digital Library version 2. This system will be used for three primary purposes: a) provide infrastructure for on-demand and long-term centralized multimedia storage; b) provide system for library special collections to be converted and stored in digital format; c) static and multimedia courseware. - Calarco
...my professional interest in digitization relates mainly to paper-based materials. The idea of the potential complexity and cost of preservation of other media is staggering to me. My husband works in the film department of the George Eastman House, so I have some idea of the technologies and costs involved in analogue preservation; my sense is that budget limitations will be a key limiting factor in digital preservation as well, and that the development of standards will have to take that into account, as well as the fact that technology is now unfolding so fast that whole new paradigms can be expected to come into play in even the near future, much as Web access has done in the recent past. I actually expect, in fact, that the SMPTE initiative's most important contribution may well be to create such a new paradigm, simply by concentrating on the particular needs of archival storage and access. Best of luck! - Carli
I think your project may be too narrowly focused. The survey shows a lack of understanding of preservation practices/ initiatives for archival electronic records that are already underway. Developing a format for long term preservation of graphics, video, and audio is a critical issue. But the work should build upon work that has already been done in the archival world with text and numeric data. - Conrad
I think my comments to the questions above express my concerns and the concerns of our members. While RLG does not own any collections, the collections of our approximate 160 members represent some of the greatest collections that exist. I would be very interested to work with SMPTE on this (or have the opportunity to attend meetings where possible) because the Research Libraries Group is also working with and on behalf of our members to help establish best practices for preserving and providing access to materials through digital means. The website mentioned above gives specific information about the working groups already formed, their charges and progress reports. As well, please see the Digital Initiatives portion of our web site at for further work and involvement in the evolving metadata standards. - Dale
I want to learn more about UPF itself, rather than the issues that led to this survey (yes, I'm backing out and exploring your web site next) if results or a summary of this survey are available, please share: sed@netcom.com or sean@interactivate.com good luck! - Dreilinger
I'm following your work because of my activities in the library community. What you're doing looks good; keep up the work. - Graham
I have a lot more to learn and consider! Glad you're working on this; I'm a firm believer in the usefulness of standards. - Hadley
If you find a great way to catalogue videos...and get clips onto the web...I have not yet investigated any of those avenues. It would also be nice if the TIF were standardized (or is that too much wishful thinking?) - Hamal
Will follow the progress of the project and comment at a later time. Thanks for the opportunity. - Hedges
I need to know what are the possibilities for future preservation of the images which I now provide for the teaching of art at my institution and how I can work to preserving those images in a digital world so that others can learn from the visual arts. - Hehman
Being a preservation administrator, my main concern is how long this information will last in a digital format. Recognizing that discs now have a finite life of only 30 years, digitization is not a viable preservation option at this time. If, by "universal format" you mean a standardization of digitization equipment and process, I think it would be useful. - Herrick
Question 3 under the wrapper format makes it clear that you are looking at video preservation (as one would expect from SMPTE). Don't you think that calling this a "universal preservation format" and a "digital preservation standard" is a little pushy? Do you really envision your standard working for audio, bitmapped and vector images, graphics, text, etc., etc., etc.? - Hirtle
This needs to be an international effort. - Hughes
I would like to see more opportunity for input about the capture and playback equipment itself. I am also curious if the format would still have NTSC / PAL standards or if it could/ would be truly "universal." I am also interested in the length of the tapes and what institutions would need or prefer. - Lee
In my opinion one of the major aspects of the quest for a Universal Preservation Format is the perceived (and in my opinion misdirected) need for a single format to replace the wide array of video/audio/data and multimedia formats in the marketplace. I do not feel that this is realistic or viable now or in the forseeable future. Indeed - in UNIX parlance - I see this effort as YAFF (Yet Another Format Format). Since this new format would have to be migrated to/from existing formats, would have to be used along side of existing formats for production purposes, and also be necessary to output (and input) to different media types, I see it as being yet another format that will necessarily live with formats in the video, audio, and data world... and as such I consider it largely redundant. Perhaps when we have network and local bandwidth that is so fast and storage that is so inexpensive that the network itself becomes the "virtual archive" because it is just as fast as and as cheap as local storage ... this idea would be more appealing to me. Indeed, with RAID mirroring type methodology to ensure reliable data reconstruction from different data storage node sites on very fast data networks - one could have a "virtual archive" and a distributed mediaLESS way of storing information.... that would be the time to seriously consider these issues - but I am afraid that these advances are more than a few years away - and as such I fear that this effort will spawn YAFF that will have to be supported - along with all the others thereby just making matters worse. One aspect that I do find appealing, however, is the storage of other data associated with the a/v records in the same way as the a/v records. For instance shot lists, EDL, and storyboards could all be stored along with the different tagged versions of the a/v material. Good luck on the project. - Lindner
Just as traditional libraries have been doing for centuries, digital libraries will have to contain the world (even worlds). To do so digitally means to model the world's structure with conceptual maps and other frameworks. That task, awesome though it may be, is not as terrible as handling exceptions to that overarching framework. I believe it was Pope who said: "The exception tests the rule." We see that over and over in libraries, and new media are not new in that regard. - Lowe
Education. Especially to library schools like mine: San Jose State University School of Library and Information Science. Workshops. Information about costs, groups, opportunities, training. How do we get from vinegar syndrome in a metal can to metadata streaming? - Lucas
First of all, we would like to preserve our material. Secondly, if we had our material in a digital form, we could provide easy acces to our users and other people in Greece to our Collection. Additionally, we could make links with other collections and provide a more wholesome aid to the researchers. - Mamalinga
I think a standard needs to be developed that will easily accomodate changes in technology, and I think managers and administrators need to be sold on it, not just archivists and records managers. - McLemore
As an art conservator, I am concerned that any preservation effort for electronic media attempts to document and preserve an artifact's integrity and original artistic intent. While I understand that for many electronic artifacts, preservation of the "native" format will be impractical for the purposes of preservation, I am also convinced that there is value in preserving the intrinsic characteristic of the original to whatever extent is possible. For instance there are certain attributes of 2" video and that are part of the material record of a piece and will remain significant even if the piece (again for preservation purposes) is long disembodied from the original 2" tape. Therefore, the material changes that occur as visual information is preserved should be documented. This documentation (including any visual enhancements, artifacts or glitches that occur as part of a preservation program) should become part of a permanent record always associated with the piece. Therefore, a video scholar looking at a 1970's video in a 100 years will not have to ask the question "Am I looking at visual content actually dating from the 70's or am I seeing the cumulative result of manipulations made over time to preserve the piece?" - Messier
A concern that I don't see here is the need to automate these cataloguing activities. It just is not feasible in many cases to manually enter all the info needed for each object in a large collection. Unless a new standard can be reasonably implemented, it is hard to see how it can be entirely useful. (I don't have the answer, but it is a concern of mine.) Automatic identification of image feature information is an active research area in my own project - content analysis using computer vision techniques. I would suggest that any standards for images - still or otherwise- should be flexible enough to allow this kind of info to be stored as it becomes available. - Ogle
Our organization is committed to preserving the history of chemistry. To do that we must preserve not only books and photos, but personal papers, video, film, sound recordings and objects. As a museum and a library, it would probably be enough to preserve and provide access to materials where possible and digitize only the most fragile items. But in our time it is possible to do more. The oldest book we have may be 200 years old. Are we going to be able to do better than that with the pictures and documents and in your case, film and audio, we are producing today? How much longer can we preserve items that are hundreds of years old? We Archivists, historians and librarians are in it for the long haul and we need standards that are going to be sustainable for centuries. - Pollock
Any "universilizing" and standardizing of methods for conversion and storage of data and media is welcome if for only one reason: It allows for more time and energy to be devoted to the creative process, which includes the research process necessary to stimulate new ideas. - Reilly
I am currently researching digital preservation in order to draw up a list of guidelines for our users. This would include a section on best practice. Consequently ANY information on preservation is useful to the Visual Arts Data Service, especially since we will be archiving datasets. We will need to establish our own set of rules and regulations for digital preservation and ensure that these will not become obsolete over time. - Rymer
What type of computer should i buy to be ready for any computer-based archiving technology that is about to come out? Should I keep digitizing clips using jpg and other apple applications? Thinking about a mac 9300 when we get some money. - Schwan
First, we need to talk about definitions. "Archival" means capable of lasting 200 years. I am a preservation librarian. - Sinclair
I feel like I jumped in on the middle of a discussion. My main concerns for the collection are accessibility and preservation. I see a real need to transfer or copy the collection onto a usable format so the films can be used in a classroom setting. I also want to make sure that the original, master film survives. I would like to have special curatorial training so I can make more informed decisions about the collection. - Skarstad
It's clear that a universal preservation format would solve most of the important problems in the area of digital preservation that archivists are facing today. There's a shift in preservation theory because of the technologies involved in digital-format documents, and much of it involves copying and migrating data because the physical supports are not stable enough for long-term storage. A universal physical support that is stable would relieve the first problem, and a complex and thorough enough metadata format would relieve the migration problem--in other words, documents would become independent of hardware and software. This can only benefit everyone who works with archival materials. It's as if we're presently in the midst of a huge set of problems that seemingly have no solution, and some kind of universal preservation format would finally signal the arrival at some kind of plateau at which we could stop worrying so much and get on with our work. I'll be watching this work with much interest. - Turner
I see this project for a "Universal Preservation Format or Framework" as being an opportunity to create a focus-point, a standard to which the caretakers of all media may resort. Thus, I believe it should be capable of handling all media. I will bookmark this site and try to become more aware of your efforts. - Vanderlinden
It would be useful to create a common language (if possible) that included terminology familiar to archivists/preservationists and the terminology familiar to computer/web/digital professionals. It is still difficult for various professionals to talk to each other. - Vetter
Not clear on the benefits for us. That reflects two things: - not sure how well the approach extends beyond moving image collections. I'm sure that if it works it will be applicable to some of material we are dealing with, but we haven't thought the idea through sufficiently to see immediately how it will work for us. - some concerns about its workability. Seems to be a good way of reconciling a diversity of standards at acquisition so that a limited number of formats have to be dealt with for preservation and access, but it requires an input of standard data formats that can be easily converted and it only expands the migration period, rather than eliminating the technological obsolescence problem. Still, the preservation of digital collections is almost certain to be made up of a patchwork of interim solutions that buy us a bit (or byte) more time. Perhaps I would be more comfortable if it was described as a "Universal Archiving Framework". We need to learn a lot more about UPF before making any sort of definitive judgments for this institution. Thank you for the opportunity to comment. - Webb
The problem that I don't understand how it will be addressed is one of copyright and ownership of material once digitized. in this regard, would authentication stamps be available to determine if digitized data had been doctored? - Weiss
Since we exist in a world where interdependence is critical, perhaps the achievement of a standard is more important than all of the specifics within the standard. Whatever happens we will have an increasing need for standards so that we can ensure preservation and access for the future. - Wilding
I can tell from some questions in this survey that my knowledge of digital standards and new developments is lacking. I would like AMIA to develop coursework, for conferences or on line, to help others like me learn more about the concepts and ideas you introduced in your survey and supplementary documents. - Wilson
Until we can move from a 'hold-it-in-your-hand' (physical) format onto a fully 'down-the-wire'method of working, we will be faced with format conversions of the physical items -- meaning there won't be a preservation format, only a succession of physical formats incorporating (hopefully) more and more of the metadata and structure required by the ultimate 'down-the-wire' format. - Wright



Followup Questions

Should the UPF require that the source code used to read data be encapsulated in the storage format? Or should a Recommended Practice allow the option of storing the Rosetta stone as a separate file that might be executable code but would be required to operate on a set of specified platforms? Should the Rosetta stone simply be an ascii file of pre-compiled code and the compiler requirements stored separately or contained within a larger application? Should the UPF recommend a list of specific file formats that would be defined by the Rosetta stone?

Related to the Rosetta stone is the issue of platform independence. In respect to a digital archives, how important is platform independence to the concept of a digital preservation format? Is this a "wait and see" issue, or should the UPF make a definitive statement on the requirement of platform-independence?

How should unique identifers be used for archives? Should there be a central registration body for these identifications? At what level should material be tagged? Should the unique identifiers be allocated to one storage area?

What is the solution for organizations with limited human resources to maintain a wide range of skills? What are the implications of technical expertise upon a universal preservation format, or upon any techincal standard adopted for digital materials? What would be the process for "updating" the Rosetta stone? Another way of asking this question: Should a Recommended Practice advocate technological "Ease-of-Use"?

As we presented our initiative to groups concerned with preserving analog media, we grew aware of a crippling gap between the language used by archivists and the language adopted by digital communities. Words casually chosen by digital initiatives often have deep analog roots. As a result, we wish to preface any Recommended Practice with a glossary of terms that might help bridge the analog-digital domains. Where do we find acceptable source definitions for such a cross-domain dictionary?

One of the points we try to emphasize to archivist groups is that the UPF would help carry on the traditional practices of archivists and librarians by providing a robust framework for digital materials. For example, by incorporating such concepts as unique identifiers and by "gluing" certains kinds of cataloging information to the stored media, we hoped to perpetuate the two principles that are the foundation of standard archival practice: Provenance and Original Order. We also realize that many variations of these practices have evolved, and that no two archives will follow the same guidelines. Are archiving practices standardized enough to be used as a model or metaphor for designing digital archiving practices? Or is it permissible to explore entirely new methods?

Is the available technology adequate to address the problem of digital storage?
Storage technology evolves at a dizzying pace. Recently, we looked at PaperDisk from Cobblestone Software, technology that prints digital information on plain paper through a common laser or inkjet printer, then reads it back into a computer through a standard flatbed or hand-held scanner. We have also heard about using DVD as an archival storage medium. More exciting for storage capacity and permanance is the Pancake Disc developed by Los Alamos National Laboratory. Norsam Technologies, which sells this product, claims that Pancake Disks will last thousands of years without any special storage requirements. Norsam is issuing two products on the Pancake this, one for digital storage and one for analog storage. The analog product, called HD-Rosetta, is an intriguing variation on microfiche technology. You access the stored records as a series of images through a high-powered microscope. The Company's HD-ROSETTA writer can record approximately 90,000 8 * x 11 analog images (in an "xy" coordinate format which it proposes to market primarily as a historical preservation and archival data storage product) on a two-inch Pancake Disc. The other product, called the HD-ROM, developed in a joint venture with IBM, can store 650 gigabytes, which is 1,000 times the capacity of today's CD-ROMs, and may eventually hold up to 12 Terabytes or 12 thousand gigabytes. Norsam's technology uses charged particle beams, rather than laser beams, to write data onto disks.

When the problem of physical storage is resolved, does compression cease to be an issue for archival storage? If a physical storage media can be virtually limitless, then why not store everything in uncompressed format? Or are there other issues to consider, such as speed of retrieval?

Can the Universal Preservation Format in practical terms be envisioned as a "file interchange format" as well as a "storage format"? A "blue sky" use of the UPF would be that "select sets" within a software application, such as an asset management program or even a video editing application, might be saved or exported in UPF in the same way text files are saved or exported in RTF or ASCII. Records in this format could be saved to, say, a zip drive or tape or burned into CD-ROM, and then imported into another program that understands UPF.