{"id":507,"date":"2016-06-14T16:33:32","date_gmt":"2016-06-14T15:33:32","guid":{"rendered":"http:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/?p=507"},"modified":"2016-06-20T08:28:21","modified_gmt":"2016-06-20T07:28:21","slug":"adventures-in-audiovisual-digitisation-part-3","status":"publish","type":"post","link":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/adventures-in-audiovisual-digitisation-part-3\/","title":{"rendered":"Adventures in audiovisual digitisation* (part 3)"},"content":{"rendered":"<p>*Not really digitisation, more digital transfer<\/p>\n<p>Because many of our depositors (comedians, promotors and producers) have worked on television and radio we have been given copies of their contributions to these programmes as part of their <a href=\"https:\/\/www.kent.ac.uk\/library\/specialcollections\/standupcomedy\/index.html\" target=\"_blank\">collections<\/a>.\u00a0 Material has been deposited on\u00a0CD (both audio cd and CD-R), and on DVDs.\u00a0This material is usually\u00a0contributor copies that they were given by the broadcaster or production company, although we do have a few\u00a0&#8216;off-air&#8217; recordings. We&#8217;ve also\u00a0received published material, such as recordings of specific shows, tours, or compilations the\u00a0depositor has\u00a0appeared on, including in cassette,\u00a0audio CD and DVD formats.<\/p>\n<p>In this post I will focus on how we are capturing audio and video material deposited\u00a0on CD and DVD, which Richard Wright nicely describes as \u2018digital content not in files\u2019 (<a href=\"http:\/\/dx.doi.org\/10.7207\/twr12-01\" target=\"_blank\">page 9<\/a>). &#8216;Digital content not in files&#8217;\u00a0refers to\u00a0digital recordings which require specific technology and workflows to move the sound\/images from their\u00a0dedicated physical carriers (such as DAT, minidisc, and DV formats, as well as material held on optical media, such as audio CDs, CD-R and DVDs) into digital files (<a href=\"http:\/\/dx.doi.org\/10.7207\/twr12-01\" target=\"_blank\">page 3<\/a>).<\/p>\n<div id=\"attachment_94\" style=\"width: 635px\" class=\"wp-caption alignnone\"><a href=\"http:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2015\/02\/MarkThomas_Sheffield2009.jpg\" rel=\"attachment wp-att-94\"><img aria-describedby=\"caption-attachment-94\" loading=\"lazy\" class=\"wp-image-94 size-large\" src=\"http:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2015\/02\/MarkThomas_Sheffield2009-1012x1024.jpg\" alt=\"CD from the Mark Thomas Collection of a recording from the Sheffield leg of his 2009 'It's the Stupid Economy' tour\" width=\"625\" height=\"632\" srcset=\"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2015\/02\/MarkThomas_Sheffield2009-1012x1024.jpg 1012w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2015\/02\/MarkThomas_Sheffield2009-297x300.jpg 297w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2015\/02\/MarkThomas_Sheffield2009-624x631.jpg 624w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2015\/02\/MarkThomas_Sheffield2009.jpg 1458w\" sizes=\"(max-width: 625px) 100vw, 625px\" \/><\/a><p id=\"caption-attachment-94\" class=\"wp-caption-text\">CD from the Mark Thomas Collection of a recording from the Sheffield leg of his 2009 &#8216;It&#8217;s the Stupid Economy&#8217; tour<\/p><\/div>\n<p>Whilst previously optical media was seen as a preservation medium and storage solution, it is now recognized that optical discs are an &#8216;at-risk&#8217; format (see\u00a0<a href=\"http:\/\/www.avpreserve.com\/wp-content\/uploads\/2014\/04\/OpticalMediaPreservation.pdf\" target=\"_blank\">&#8216;An Introduction to Optical Media Presevation&#8217;<\/a> by Alex Duryee&#8217;) and so we have been transferring any material of high priority deposited on optical media (mostly that which is unpublished on re-writable discs)\u00a0to a digital file.<\/p>\n<p>Our approach to this material has varied depending on the format.\u00a0 For material on audio CDs and CD-R we have viewed the physical format as a file carrier,\u00a0as a way to share and transport\u00a0audio files, and we\u00a0view the content on the disc as the important thing to capture (rather than the structure on the disc).\u00a0 However, for DVDs which are more structured (often with a menu)\u00a0we have created a disc image, which we can then mount in tools such as VLC.<\/p>\n<p><strong>Audio CDs (Compact Disc Digital Audio \/ CD DA)<\/strong><\/p>\n<p>Audio CDs hold data in the Compact Disc Digital Audio\u00a0(CD DA) format. Data is written\u00a0in the pulse-code modulation stream (PCM), at\u00a0two channel, 16 bit, and 44.1kHz. When an audio CD is placed in your disc drive the operating system will interpret the data into different files (tracks) with the extension .cda.<\/p>\n<p>After consideration we decided not to extract audio from audio CDs using a disc imaging workflow, but to extract the data\u00a0and save as\u00a0a WAVE file.\u00a0We made this decision based on a number of factors.<\/p>\n<ol>\n<li>Firstly,\u00a0it was the audio data itself which was important to us, rather than the structure of the disc.<\/li>\n<li>Secondly, because\u00a0the discs we had were uncomplicated; many of the audio cds contained\u00a0only two .cda files (one of which was often a radio tone\/test track) or were\u00a0collections of edited tracks from live shows put onto a CD (but not published). Note that we prioritised material deposited on &#8216;unpublished&#8217; (often re-writable)\u00a0CDs and DVDs; we have not transferred any\u00a0material deposited which\u00a0has been\u00a0published and is on mass replicated discs.<\/li>\n<li>I think it would also be honest to say that, thirdly, disc imaging audio CDs seemed rather complicated and unnecessary for\u00a0a relatively small number of discs within our collection.\u00a0 I&#8217;m slightly ashamed to say that this goes against the guidance provided by <a href=\"http:\/\/www.avpreserve.com\/wp-content\/uploads\/2014\/04\/OpticalMediaPreservation.pdf\" target=\"_blank\">avpreserve<\/a>, the <a href=\"http:\/\/openpreservation.org\/blog\/2013\/11\/19\/establishing-workflow-model-audio-cd-preservation\/\" target=\"_blank\">open preservation foundation<\/a>, and <a href=\"http:\/\/arxiv.org\/ftp\/arxiv\/papers\/1309\/1309.4932.pdf\" target=\"_blank\">the DPC\/British Library<\/a>, and I would gladly be corrected if the digital preservation and archiving community thinks we should change our workflow! I would also be interested to hear from other small archives who are undertaking this sort of work, and whether they have disc imaged their CDs or taken a similar route to us.<\/li>\n<\/ol>\n<p>Instead of disc imaging\u00a0we extracted audio data using Adobe Audition (a tool we were using for digitising our sound cassettes and MiniDiscs) and\u00a0set the read speed to be low in order to provide as accurate results as possible. The data was originally\u00a0written to the disc as PCM 16 bit\/44.1kHz so we extracted the data as\u00a0this and used the WAVE (.wav) wrapper. The structure of the audio CD disc was maintained using filenames (numbered sequentially by track on the disc) and through metadata which we embedded in BWF format (using the <a href=\"http:\/\/bwfmetaedit.sourceforge.net\/\" target=\"_blank\">BWF MetaEdit tool<\/a>).<\/p>\n<p>We have also received CD data discs containing mp3 files. Although mp3 is not an archival format the sound files are already compressed and saving them as wav files will only increase the file size, but not the quality of the file. MP3 is a format widely used it is unlikely to become obsolete in the immediate future and so poses no preservation risk. We have also been capturing MP3s through audio editing software, either Adobe Audition or Audacity. We are exporting through software, rather than copying straight from the disc, as the software you use will have an error correction element and help prevent any errors during the export\/copy.<\/p>\n<p><strong>DVDs<\/strong><\/p>\n<p>With rewritable media accessioned into the British Stand-Up Comedy Archive collections (such as hard drives, floppy drives), or media which has inbuilt menu functionality (i.e. DVDs), we thought that here it was important to create a disc image, a sector-by-sector copy, as part of the process of digitally preserving the original accession. Our aim was to:<\/p>\n<ul>\n<li>Ensure that the disc\/drives are free from viruses<\/li>\n<li>Capture an \u2018image\u2019 of the disc\/drive, showing the structure of the files (including folder structure) on the original disc as it was when deposited with BSUCA.<\/li>\n<li>Secure the contents of the disc\/drive (i.e. the documents\/files on the disc itself)<\/li>\n<\/ul>\n<p>We have used the free version of <a href=\"http:\/\/www.isobuster.com\/\" target=\"_blank\">ISOBuster<\/a> to image DVDs and using this tool created an .iso file and a .cue file. \u00a0A complete disk image (.iso file) serves as the preservation master, and from the iso file we have then created an access copy as an mp4 (h.264) file, using VLC, for use in our reading room.<\/p>\n<div id=\"attachment_519\" style=\"width: 635px\" class=\"wp-caption alignnone\"><a href=\"http:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster.jpg\" rel=\"attachment wp-att-519\"><img aria-describedby=\"caption-attachment-519\" loading=\"lazy\" class=\"wp-image-519 size-large\" src=\"http:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster-1024x719.jpg\" alt=\"Creating disc images of DVDs using ISOBuster\" width=\"625\" height=\"439\" srcset=\"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster-1024x719.jpg 1024w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster-300x211.jpg 300w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster-768x539.jpg 768w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster-624x438.jpg 624w, https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/files\/2016\/06\/disc-imaging-isobuster.jpg 1531w\" sizes=\"(max-width: 625px) 100vw, 625px\" \/><\/a><p id=\"caption-attachment-519\" class=\"wp-caption-text\">Creating disc images of DVDs using ISOBuster<\/p><\/div>\n<p>Next time&#8230; how we have been\u00a0digitising VHS and transferring material on DVCam and MiniDV.<\/p>\n<p><strong>Further reading and helpful links<\/strong><\/p>\n<p>\u2018Preserving Moving Pictures and Sound\u2019, Richard Wright, DPC [Digital Preservation Coalition] Technology Watch Report 12-01 March 2012, <a href=\"http:\/\/dx.doi.org\/10.7207\/twr12-01\" target=\"_blank\"><span style=\"color: #0066cc\">http:\/\/dx.doi.org\/10.7207\/twr12-01<\/span><\/a><\/p>\n<p>&#8216;An Introduction to Optical Media Presevation&#8217;, Alex Duryee, AVPreserve, <a href=\"http:\/\/www.avpreserve.com\/wp-content\/uploads\/2014\/04\/OpticalMediaPreservation.pdf\" target=\"_blank\"><span style=\"color: #0066cc\">http:\/\/www.avpreserve.com\/wp-content\/uploads\/2014\/04\/OpticalMediaPreservation.pdf<\/span><\/a><\/p>\n<p align=\"LEFT\">&#8216;Developing a Robust Migration Workflow for Preserving and Curating Hand-held Media&#8217;, Angela Dappert, Andrew Jackson, Akiko Kimura <a href=\"http:\/\/arxiv.org\/ftp\/arxiv\/papers\/1309\/1309.4932.pdf\" target=\"_blank\">http:\/\/arxiv.org\/ftp\/arxiv\/papers\/1309\/1309.4932.pdf<\/a><\/p>\n<p align=\"LEFT\">&#8216;Establishing a Workflow Model for Audio CD Preservation&#8217;, Tonisant, Open Preservation Foundation blog, <a href=\"http:\/\/openpreservation.org\/blog\/2013\/11\/19\/establishing-workflow-model-audio-cd-preservation\/\">http:\/\/openpreservation.org\/blog\/2013\/11\/19\/establishing-workflow-model-audio-cd-preservation\/<\/a><\/p>\n<p align=\"LEFT\">\n","protected":false},"excerpt":{"rendered":"<p>*Not really digitisation, more digital transfer Because many of our depositors (comedians, promotors and producers) have worked on television and radio we have been given copies of their contributions to these programmes as part of their collections.\u00a0 Material has been deposited on\u00a0CD (both audio cd and CD-R), and on DVDs.\u00a0This material is usually\u00a0contributor copies that [&hellip;]<\/p>\n","protected":false},"author":40164,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[140922,140836,140830],"tags":[608,140888,140913,140914,95415,140830,140917,140919,140915,140918,140921,140920],"_links":{"self":[{"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/posts\/507"}],"collection":[{"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/users\/40164"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/comments?post=507"}],"version-history":[{"count":21,"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/posts\/507\/revisions"}],"predecessor-version":[{"id":533,"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/posts\/507\/revisions\/533"}],"wp:attachment":[{"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/media?parent=507"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/categories?post=507"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.kent.ac.uk\/standupcomedyarchive\/wp-json\/wp\/v2\/tags?post=507"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}