emulation

Future tape archaeology: speculations on the emulation of analogue environments

At the recent Keeping Tracks symposium held at the British Library, AV scoping analyst Adam Tovell stated that

‘there is consensus internationally that we as archivists have a 10-20 year window of opportunity in which to migrate the content of our physical sound collections to stable digital files. After the end of this 10-20 year window, general consensus is that the risks faced by physical media mean that migration will either become impossible or partial or just too expensive.’

This point of view certainly corresponds to our experience at Greatbear. As collectors of a range of domestic and professional video and audio tape playback machines, we are aware of the particular problems posed by machine obsolescence. Replacement parts can be hard to come by, and the engineering expertise needed to fix machines is becoming esoteric wisdom. Tape degradation is of course a problem too. These combined factors influence the shortened horizon of magnetic tape-based media.

All may not be lost, however, if we are take heart from a recent article which reported the development of an exciting technology that will enable memory institutions to recover recordings made over 125 years ago on mouldy wax cylinders or acid-leaching lacquer discs.

IRENE (Image, Reconstruct, Erase Noise, Etc.), developed by physicist Carl Haber at the Lawrence Berkeley National Laboratory, is a software programme that ‘photographs the grooves in fragile or decayed recordings, stitches the “sounds” together with software into an unblemished image file, and reconstructs the “untouchable” recording by converting the images into an audio file.’

The programme was developed by Haber after he heard a radio show discuss the Library of Congress’ audio collections that were so fragile they risked destruction if played back. Haber speculated that the insights gained from a project he was working on could be used to recover these audio recordings. ‘“We were measuring silicon, why couldn’t we measure the surface of a record? The grooves at every point and amplitude on a cylinder or disc could be mapped with our digital imaging suite, then converted to sound.”’

For those involved in the development of IRENE, there was a strong emphasis on the benefits of patience and placing trust in the inevitable restorative power of technology. ‘It’s ironic that as we put more time between us and the history we are exploring, technology allows us to learn more than if we had acted earlier.’

Can such a hands-off approach be applied to magnetic tape based media? Is the 10-20 year window of opportunity described by Tovell above unnecessarily short? After all, it is still possible to playback wax cylinder recordings from the early 20th century which seem to survive well over long periods of time, and magnetic tape is far more durable than is commonly perceived.

In a fascinating audio recording made for the Pitt Rivers Museum in Oxford, Nigel Bewley from the British Library describes how he migrated wax cylinder recordings that were made by Evans Pritchard in 1928-1930 and Diamond Jenness in 1911-1912. Although Bewley reveals his frustration in the preparation process, he reveals that once he had established the size of stylus and rotational speed of the cylinder player, the transfer was relatively straightforward.

You will note that in contrast with the recovery work made possible by IRENE, the cylinder transfer was made using an appropriate playback mechanism, examples of which can accessed on this amazing section of the British Library’s website (here you can also browse through images and information about disc cutters, magnetic recorders, radios, record players, CD players and accessories such as needle tins and headphones – a bit of a treasure trove for those inclined toward media archaeology).

Perhaps the development of the IRENE technology will mean that it will no longer be necessary to use such ‘authentic’ playback mechanisms to recover information stored on obsolete media. This brings us neatly to the question of emulation.

Emulation

Insides of a beta-hi-fi machine

If we assume that all the machines that play back magnetic tape become irrevocably obsolete in 10-20 years, what other potential extraction methods may be available? Is it possible that emulation techniques, commonly used in the preservation of born-digital environments, can be applied to recover the recorded information stored on magnetic tape?

In a recent interview Dirk Von Suchodoletz explains that:

‘Emulation is a concept in digital preservation to keep things, especially hardware architectures, as they were. As the hardware itself might not be preservable as a physical entity it could be very well preserved in its software reproduction. […] For memory institutions old digital artifacts become more easy to handle. They can be viewed, rendered and interacted-with in their original environments and do not need to be adapted to our modern ones, saving the risk of modifying some of the artifact’s significant properties in an unwanted way. Instead of trying to mass-migrate every object in the institution’s holdings, objects are to be handled on access request only, significantly shifting the preservation efforts.’

For the sake of speculation, let us imagine we are future archaeologists and consider some of the issues that may arise when seeking to emulate the operating environments of analogue-based tape media.

To begin with, without a working transport mechanism which facilitates the transmission of information, the emulation of analogue environments will need to establish a circuitry that can process the Radio Frequency (RF) signals recorded on magnetic tape. As Jonathan Sterne reflects, ‘if […] we say we have to preserve all aspects of the platform in order to get at the historicity of the media practice, that means archival practice will have to have a whole new engineering dimension to it.’

Yet with the emulation of analogue environments, engineering may have to be a practical consideration rather than an archival one. For example, some kind of transport mechanism would presumably have to be emulated through which the tape could be passed through. It would be tricky to lay the tape out flat and take samples of information from its surface, as IRENE’s software does to grooved media, because of the sheer length of tape when it unwound. Without an emulated transport mechanism, recovery would be time consuming and therefore costly, a point that Tovell intimates at the beginning of the article. Furthermore, added time and costs would necessitate even more complex selection and appraisal decisions on behalf of archivists managing in-operative magnetic tape-based collections. Questions about value will become fraught and most probably politically loaded. With an emulated transport mechanism, issues such as tape vulnerability and head clogs, which of course impact on current migration practices, would come into play.

Audio and video differences

On a technical level emulation may be vastly more achievable for audio where the signal is recorded using a longitudinal method and plays back via a relatively simple process. Audio tape is also far less propriety than video tape. On the SONY APR-5003V machine we use in the Greatbear Studio for example, it is possible to play back tapes of different sizes, speeds, brands, and track formations via adjustments of the playback heads. Such versatility would of course need to be replicated in any emulation environment.

helical scanThe technical circuitry for playing back video tape, however, poses significantly more problems. Alongside the helical scan methods, which records images diagonally across the video tape in order to prevent the appearance of visible joints between the signal segments, there are several heads used to read the components of the video signal: the image (video), audio and control (synch) track.

Unlike audio, video tape circuitry is more propriety and therefore far less inter-operable. You can’t play a VHS tape on a U-Matic machine, for example. Numerous mechanical infrastructures would therefore need to be devised which correspond with the relevant operating environments – one size fits all would (presumably) not be possible.

A generic emulated analogue video tape circuit may be created, but this would only capture part of the recorded signal (which, as we have explored elsewhere on the blog, may be all we can hope for in the transmission process). If such systems are to be developed it is surely imperative that action is taken now while hardware is operative and living knowledge can be drawn upon in order to construct emulated environments in the most accurate form possible.

While hope may rest in technology’s infinite capacity to take care of itself in the end, excavating information stored on magnetic tape presents far more significant challenges when compared with recordings on grooved media. There is far more to tape’s analogue (and digital) circuit than a needle oscillating against a grooved inscription on wax, lacquer or vinyl.

The latter part of this article has of course been purely speculative. It would be fascinating to learn about projects attempting to emulate the analogue environment in software – please let us know if you are involved in anything in the comments below.

Posted by debra in audio tape, audio technology, machines, equipment, video tape, video technology, machines, equipment, 0 comments

Digitisation strategies – back up, bit rot, decay and long term preservation

In a blog post a few weeks ago we reflected on several practical and ethical questions emerging from our digitisation work. To explore these issues further we decided to take an in-depth look at the British Library’s Digital Preservation Strategy 2013-2016 that was launched in March 2013. The British Library is an interesting case study because they were an ‘early adopter’ of digital technology (2002), and are also committed to ensuring their digital archives are accessible in the long term.

Making sure the UK’s digital archives are available for subsequent generations seems like an obvious aim for an institution like the British Library. That’s what they should be doing, right? Yet it is clear from reading the strategy report that digital preservation is an unsettled and complex field, one that is certainly ‘not straightforward. It requires action and intervention throughout the lifecycle, far earlier and more frequently than does our physical collection (3).’

The British Library’s collection is huge and therefore requires coherent systems capable of managing its vast quantities of information.

‘In all, we estimate we already have over 280 terabytes of collection content – or over 11,500,000 million items – stored in our long term digital library system, with more awaiting ingest. The onset of non-print legal deposit legislation will significantly increase our annual digital acquisitions: 4.8 million websites, 120,000 e-journal articles and 12,000 e-books will be collected in the first year alone (FY 13/14). We expect that the total size of our collection will increase massively in future years to around 5 petabytes [that’s 5000 terabytes] by 2020.’

All that data needs to be backed up as well. In some cases valuable digital collections are backed up in different locations/ servers seven times (amounting to 35 petabytes/ 3500 terabytes). So imagine it is 2020, and you walk into a large room crammed full of rack upon rack of hard drives bursting with digital information. The data files – which include everything from a BWAV audio file of a speech by Natalie Bennett, leader of the Green Party after her election victory in 2015, to 3-D data files of cunieform scripts from Mesopotamia, are constantly being monitored by algorithms designed to maintain the integrity of data objects. The algorithms measure bit rot and data decay and produce further volumes of metadata as each wave of file validation is initiated. The back up systems consume large amounts of energy and are costly, but in beholding them you stand in the same room as the memory of the world, automatically checked, corrected and repaired in monthly cycles.

Such a scenario is gestured toward in the British Library’s long term preservation strategy, but it is clear that it remains a work in progress, largely because the field of digital preservation is always changing. While the British Library has well-established procedures in place to manage their physical collections, they have not yet achieved this with their digital ones. Not surprisingly ‘technological obsolescence is often regarded as the greatest technical threat to preserving digital material: as technology changes, it becomes increasingly difficult to reliably access content created on and intended to be accessed on older computing platforms.’ An article from The Economist in 2012 reflected on this problem too: ‘The stakes are high. Mistakes 30 years ago mean that much of the early digital age is already a closed book (or no book at all) to historians.’

Destroyed Hard Drive

There are also shorter term digital preservation challenges, which encompass ‘everything from media integrity and bit rot to digital rights management and metadata.’ Bit rot is one of those terms capable of inducing widespread panic. It refers to how storage media, in particular optical media like CDs and DVDs, decay over time often because they have not been stored correctly. When bit rot occurs, a small electric charge of a ‘bit’ in memory disperses, possibly altering program code or stored data, making the media difficult to read and at worst, unreadable. Higher level software systems used by large institutional archives mitigate the risk of such underlying failures by implementing integrity checking and self-repairing algorithms (as imagined in the 2020 digital archive fantasy above). These technological processes help maintain ‘integrity and fixity checking, content stabilisation, format validation and file characterisation.’

Archival Gold Disc

300 years, are you sure?

Preservation differences between analogue and digital media

The British Library isolate three main areas where digital technologies differ from their analogue counterparts. Firstly there is the issue of ‘proactive lifestyle management‘. This refers to how preservation interventions for digital data need to happen earlier, and be reviewed more frequently, than analogue data. Secondly there is the issue of file ‘integrity and validation.’ This refers to how it is far easier to make changes to a digital file without noticing, while with a physical object it is usually clear if it has decayed or a bit has fallen off. This means there are greater risks to the authenticity and integrity of digital objects, and any changes need to be carefully managed and recorded properly in metadata.

Finally, and perhaps most worrying, is the ‘fragility of storage media‘. Here the British Library explain:

‘The media upon which digital materials are stored is often unstable and its reliability diminishes over time. This can be exacerbated by unsuitable storage conditions and handling. The resulting bit rot can prevent files from rendering correctly if at all; this can happen with no notice and within just a few years, sometimes less, of the media being produced’.

A holistic approach to digital preservation involves taking and assessing significant risks, as well as adapting to vast technological change. ‘The strategies we implement must be regularly re-assessed: technologies and technical infrastructures will continue to evolve, so preservation solutions may themselves become obsolete if not regularly re-validated in each new technological environment.’

Establishing best practice for digital preservation remains a bit of an experiment, and different strategies such as migration, emulation and normalisation are tested to find out what model best helps counter the real threats of inaccessibility and obsolescence we may face in 5-10 years from now. What is encouraging about the British Library’s strategic vision is they are committed to ensuring digital archives are accessible for years to come despite the very clear challenges they face.

Posted by debra in audio tape, video tape, 0 comments