Categories
Collection databases Digitisation Web 2.0 Young people & museums

Demspey on ‘getting with the flow’, Morville on ‘findability’

OCLC’s Lorcan Dempsey’s idea of libraries “getting with the flow” (from 2005) is something that has resonated well beyond the library world.

The importance of flow underlines recurrent themes:

– the library needs to be in the user environment and not expect the user to find their way to the library environment

– integration of library resources should not be seen as an end in itself but as a means to better integration with the user environment, with workflow.

Increasingly, the user environment will be organized around various workflows. In fact, in a growing number of cases, a workflow application may be the consumer of library services.

For libraries, as evidenced also in the discussions by Holly Witchey at Musematic who has been covering the Webwise IMLS conference with regular session reports, and Guenter Waibel from RLG’s follow-up commentary, libraries are at a far more pointy end of changes in customer/user behaviour than most museums. Waibel raises the very hefty 290 page OCLC report titled Perceptions in which the survey suggests 84% of general users begin an information search with a search engine, and only 1% with a library website (PDF page 35/1-17). If conducted again now I would expect Wikipedia to rate highly.

Libraries are seen as more trustworthy/credible and as providing more accurate information than search engines. Search engines are seen as more reliable, cost-effective, easy to use, convenient and fast. (PDF page 70/2-18)

Where are museums in this? Is your content in the “flow”? Do users need to come to your site to your onsite search to be able to find it? If so, they are probably going to look elsewhere first, if they haven’t already.

Over at the University of Minnesota they have just held the CLC Library Conference titled “Getting In The Flow” with Dempsey as one of the speakers. There are some great summaries of the presentations including slides over in their conference blog.

Other than Dempsey one of their speakers was Peter Morville who some readers may remember from his first O’Reilly book Information Architecture for the World Wide Web, or the less technically oriented
Ambient Findability (which has been doing the rounds of the office for the past 9 months).

Morville’s presentation slides are an excellent introduction to his work and given their tweaking for the library/information-seeking context are very useful for those in museums too. Ellysa Cahoy has some notes taken during the presentation at the CLC blog as well for the slides that aren’t immediately self-explanatory.

Categories
Collection databases Web 2.0

OPAC2.0 – popular collection categories

In preparation for my presentation at Museums & the Web we have been busy generating a new set of user statistics from our collection database. (Which is also why the frequency of new posts has dropped!)

Objects in the collection, when ‘fully catalogued’ are assigned an object category and an object name from the Powerhouse Museum thesaurus which was first published in 1995 (ISBN 186317060X). This thesaurus is used by other museums as well.

Here are updated top 20 ‘popularity’ tables, the first by object, and the second by category. The top 20 categories indicates the broad collecting areas which receive most interest online.

Top 20 most viewed objects since launch (June 2006)

1 – (17087 views) 2005/1/1 Evening dress, beaded pink chiffon trimmed with charms, designed by Lisa Ho and made in the …
2 – (8875) 94/129/1 Evening dress, womens, `Chocolate box’, plastic / fabric, designed by Jenny Bannister for C …
3 – (7029) 95/23/1 Dress, evening, silk / polyester, designed by Jenny Bannister, Melbourne, Victoria, Australi …
4 – (6636) B1495 Aircraft, flying boat, Catalina, PB2B-2, “Frigate Bird II”, VH-ASA, metal / fabric, made by Bo …
5 – (6133) 88/4 Steam locomotive, No. 3830, iron/steel/brass, New South Wales Government Railways, Eveleigh Rai …
6 – (5504) 97/208/1 Shoes, pair, womens, ‘Super elevated gillies’, leather/ cork/ silk, Autumn/ Winter collecti …
7 – (4672) 88/5 Locomotive, full size, steam, No.1243, metal / glass, made by Davy and Company, Atlas Engineeri …
8 – (4534) 90/816 Aircraft, full-size, helicopter, Bell 206B Jetranger III, “Dick Smith Australian Explorer”, V …
9 – (4384) 2005/127/1 Clothing (9), boys, cotton / wool / metal / mother-of-pearl / plastic / paper / cardboard …
10 – (4297) 98/54/1 Bicycle, Olympic ‘Superbike’, carbon fibre / metal, Australian Institute of Sport / Royal Me …
11 – (3936) 92/405 Mantel clock, Sessions Clock Co, USA, 1905-1915 …
12 – (3557) 86/1015 Room Divider, “Carlton”, wood / plastic laminate, designed by Ettore Sottsass, made by Memph …
13 – (3481) 2006/68/1 Three piece suit, men’s, corduroy cotton, made by David Jones Ltd, Sydney, New South Wales …
14 – (3336) 2003/83/1 Chair, ‘Wiggle’, cardboard, designed by Frank Gehry, United States, 1972, made by Vitra, G …
15 – (3227) 85/1975 Armchair, `Globe’, fibreglass / aluminium / fabric / synthetic materials, designed by Eero A …
16 – (3204) 99/4/46 Model steam engine and box, donkey engine, metal / cardboard, Scorpion Superior Model / Mode …
17 – (3001) 92/305 Food safe (bush pantry), wood/ metal, unknown maker, [Queensland], Australia, c. 1925 …
18 – (2879) 7949 Locomotive, steam, No. 1, metal, hauled the first passenger train in New South Wales in 1855, m …
19 – (2798) 96/386/2 Evening dress, womens, silk, Madeleine Vionnet, Paris, France c. 1930 …
20 – (2772) L611 Aircraft, full size, Bleriot XI monoplane, wood / canvas / wire, designed by Louis Bleriot, mad …

Top 20 most popular categories* since launch (June 2006)

1 – clothing and dress (1419335 viewed objects)
2 – ceramics (1104058)
3 – numismatics (584429)
4 – pictorials (466852)
5 – textiles (394591)
6 – domestic equipment-home (320764)
7 – decorative metalwork (313593)
8 – toys (277752)
9 – arms and armour (245407)
10 – documents (235438)
11 – health and medical equipment (224492)
12 – glass (223899)
13 – jewellery (222371)
14 – models (220519)
15 – transport-land (217331)
16 – personal effects (202413)
17 – photographs (177066)
18 – musical instruments (156136)
19 – furniture (154648)
20 – juvenilia (143011)

*note – some objects belong to multiple categories

Categories
Interactive Media Web 2.0

SxSW podcasts

The SxSW podcasts have started going up and for those who couldn’t be there because they are on the other side of the world then the podcasts offer a great way of listening to, but not participating in, the major panels and sessions from the festival.

I started with the Bruce Sterling keynote, mainly because Will Wright wasn’t up yet (only very short clips are online now and focus on the game demo rather than the storytelling intro section). True to form, Sterling is provocative and probing. Sterling looks at the ideas of Henry Jenkins, Lev Manovich and Yochai Benkler, all three of whom should be reasonably familiar names to Fresh + New readers. Sterling picks up on Benkler’s ‘common space peer production’ and breaks it down into a series of key points and guidelines.

Slightly less interesting, at least until the audience questions begin was Emerging Social and Technology Trends. The questions around emerging technology trends in the developing world/global South are particularly fascinating.

These are well recorded and eminently listenable podcasts for public transport. More should be appearing on the SxSW podcast archive soon.

Categories
Collection databases

Powerhouse Museum’s Castle Hill stores open and online

We have just launched the website (and the physical site too) for our new Powerhouse Discovery Centre located in Castle Hill.

The PDC at Castle Hill dramatically increases the proportion of objects available for the public to look at (up from the museum world standard 5% ot around 40%!). In the warehouse spaces visitors can browse drawers and storage racks, and then go to a series of internet-connected kiosks to look up more information using a slightly enhanced version of our collection database.

In building the Castle Hill site we have added several thousand new high quality images to the collection database. Have a look either using the Castle Hill OPAC or the Powerhouse Museum’s main OPAC. Both use the same data source, the main difference being that the Castle Hill version has a early iteration of a a forthcoming visual browser we are building (which is highly dependent on colour images!).

Categories
Digital storytelling Web 2.0 Young people & museums

Sub groups of consumer co-created content

From the marketing world comes this quite useful subcategorisation of ‘consumer generated content’. Indeed, seeing co-created content through the lens of marketing can itself be quite revealing.

(summarised)

Consumer-generated media (CGM): At its core, CGM represents first-person commentary posted or shared across a host of expression venues, including message boards, forums, rating and review sites, groups, social networking sites, blogs, and, of course, video-sharing sites. It’s commonly influenced or informed by relevant experience with brands (e.g., “I’m so angry with Jet Blue,” “I love Target”).

Consumer-generated multimedia (CGM2): This subset of CGM is more anchored to “site, sound, and motion” components, each with the potential to dial up the effect and persuasiveness of the consumer storytelling. Visualization elevates drama, emotional resonance, and the ability to prove one’s case through documentation (one big reason TV commercials have been so hard for advertisers to shake).

Consumer-fortified media (CFM): Unilever’s Dove Evolution is a classic example of CFM. The advertisers created the spot, but its meaning was shaped, or fortified, by the conversation, commentary, and debate that wrapped around the content.

Consumer-solicited media (CSM): The term that most commonly captures this form is “co-creation.” Others loosely call it “participatory advertising.”

Compensated consumer-generated media (CCGM): This is when marketers outright pay consumers to do certain things, or when publishers compensate artists or content creators for submissions.

Paid media: This is exactly as it sounds. Marketers buy media, usually in the form of impressions, to affect sales. Some call this “marketer-generated media” (MGM), but the old description works just fine.

Categories
Web 2.0

Michele Martin on “Organizational Barriers to Using Web 2.0 Tools”

Echoing a conversation that was in the office this morning in which Jerry Watkins, Angelina Russo, Lynda Kelly and I were having – Michele Martin writes on Organizational Barriers to Using Web 2.0 Tools (via Beth Kanter)

Are these democratising tools of social media “evolutionary” or “disruptive”? It depends on who you are talking to.

This reminded me of an e-mail conversation I’ve been having with a nonprofit user in Australia. She pointed out to me that while she sees that social media tools make it easier for non-technical types to integrate technology into their workflow, at the same time there’s an ongoing organizational message that says “Leave the technology stuff to the IT department.”

I’m seeing a real tension developing between where various new tools are taking us and how organizations are responding. Most organizational cultures haven’t caught up to technology and institutional barriers are getting in the way of even experimenting with new technologies.

Categories
Digital storytelling Interactive Media Web 2.0 Young people & museums

Jenkins on ‘crud’ in participatory culture

There is an excellent recent post by Henry Jenkins titled ‘In Defense of Crud‘ in which he examines some of the recent debates around fan fiction, YouTube etc. Jenkins’ response to some of the criticisms of ‘participatory culture’ is wonderfully distilled into seven precepts which can be broadly applied.

1. We should not reduce the value of participatory culture to its products rather than its process.

2. All forms of art require a place where beginning artists can be bad, learn from their mistakes, and get better.

3. A world where there is a lot of bad art in circulation lowers the risks of experimentation and innovation.

4. Bad art inspires responses which push the culture to improve upon it over time.

5. Good and Bad, as artistic standards, are context specific.

6. Standards of good and bad are hard to define when the forms of expression being discussed are new and still evolving.

7. This is not a zero-sum game. It is not clear that the growth of participatory culture does, in fact, damage to professional media making.

What is the opportunity cost for museums of not engaging with participatory culture? I’d wager that the issues we face when we do engage are significantly less problematic than if we do not engage. Our audience are already engaging in a participatory culture – its very hard not to do so in a mainstream life – even our television shows are forcing us to vote or their outcomes.

Categories
Museum blogging Web metrics

Towards an ROI measure of museum blogging

Museum blogging is taking off.

Jim S and I have been talking a lot about how blogging is an efficient way of generating a buzz around your museum’s content. At the Powerhouse Museum our flagship blog is really the Sydney Observatory’s blog. It has been charting ludicrous traffic – it now represents over 60% of the Sydney Observatory’s traffic and has been responsible for a 300% rise in site visitation. Most excitingly though is the level of audience participation. So far for 111 posts there have been a mammoth 490 user comments after filtering and spam removal. One post on the Mars hoax email received 135 comments.

I’ve been reading Charlene Li at Forresters’ work on corporate blogging. Their reports propose a framework for measuring ROI on organisational blogging. She summarises the methodology as a chart –


(source: Forresters)

Within the non-profit sector brand visibility is the key benefit from blogging – brand awareness leads to potential future (real world visitation), and in terms of collecting museums and research centres, a general awareness of the nature of “what exactly it is you do other than exhibitions”.

The Sydney Observatory has always had a lower public profile than the Powerhouse Museum. Those Sydneysiders who are aware of its existence (and don’t get it confused with the Observatory Hotel) often don’t associate it with a place that they and their family could visit – let alone look through a telescope – each night.

Prior to the launch of the Sydney Observatory blog there was no way for the astronomers at the Observatory to publish sky-related news, let alone the discoveries of amateur astronomy groups affiliated with the Observatory, nor respond to sightings of fireballs in the sky. The previous website architecture didn’t allow for such ‘loose’ content, nor did workflows allow for such material to quickly edited and posted.

Now, though, Sydney Observatory features prominently in Google searches for related topic areas as a result of the content on the blog. It is also critical to understand that everyone who does a search for ‘Comet McNaught Sydney’ for example, and visits the blog (which ranks #2 for such a search), is now made aware of the existence of the Sydney Observatory, and its activities.

Here’s another excerpt from Li –

(from Charlene Li) Q: Is there a standard ROI for blogs? A: Nope – sorry, it isn’t that easy! Just as there isn’t a standard ROI for a Web site, there’s no standard for a blog. It depends on what the goal of the blog is and also how much investment the company (and the blogger) puts into it.

Q: What’s the best way to measure the effectiveness of a blog? A: Again, it starts with the goal of the blog. I strongly suggest that companies start with the goal, develop metrics that measure the attainment of that goal, and find ways to assign value to those metrics.

Q: But aren’t blogs risky? How do you take that into account? A: We definitely take risk into account by generating scenarios that show the impact of low-likelihood but high impact events — such as a lawsuit.

Q: Our CMO/CEO/CFO won’t let us have a blog until we can show him/her the definitive ROI of a blog. Help!! A: It’s not an unreasonable request — they don’t really understand the value of a blog and see just the potential cost and risk. By going through the exercise of defining and quantifying the benefits, costs, and risks of a blog, you’ll be educating your C-level executives while also demonstrating the discipline that they expect.

So, how does your organisation measure the success of its blogs?

Jim and I will present some answers shortly.

Categories
AV Related Digitisation Web 2.0

Testing podcast transcription – Casting Words

Audio transcription is an essential part of digitisation. Our curatorial researchers are recording thousands of hours of interviews with subjects onto a mix of analogue (tapes) and digital (MP3/WAV) media. These oral histories are filed away for preservation purposes but will remain almost unusable in any serious way until they are digitised – that is, transcribed into a searchable machine readable format.

Likewise, we record many events at the museum and in the last few years have begun offering them as podcasts on our websites.

Last week we tried out a service called Casting Words. Casting Words is a transcription service that offers to send back a transcription of any podcast or audio file, quickly and cheaply.

Generally transcribing podcasts, especially those of live talks and events, has been an arduous task, one that even with the best of intentions often doesn’t happen. Transcription has tended to be expensive and time consuming. It has also been typically inaccurate.

Yet without a transcription the contents of the podcast are rendered invisible to all but the most dedicated internet user – who already knows of the podcast’s existence. This is because a transcript not only serves the interests of vision-impaired users and those wanting to skim read before downloading, it also exposes the content of the podcast to search engines thus aiding discoverability.

Here are the results from our test of Casting Words.

TEST 1 – The Sydney Observatory February 2007 night sky guide – this recording runs for about 14 minutes and has one speaker talking throughout. Whilst not explicitly technical and aimed at a general audience it is about constellations and uses common astronomical terms. It is recorded in a quiet room with no background noise and is edited in post-production to a script.

Original MP3 – listen at the Sydney Observatory blog
Transcript – view online
Time taken to transcribe – 24 hours (from submission to receipt of finished product)
Cost of transcription – US$10.50

TEST 2 – The live recording of a D-Factory public talk titled “Pop-ups, fold-outs and other design adventures” – this recording runs for 61 minutes and consists of four individually microphoned speakers recorded into a single stationery video camera via a live mixing desk feed. The same audio feed is used as the signal to the PA system in the room. As a result of the room acoustics, mixdown and speaker behaviour each speaker’s voice is inconsistently recorded and the recording itself fluctuates in volume. The talk is set up in the manner of a studio interview – a little like a TV chat show – it is totally unscripted and the recordings have no post production editing. There is background noise present throughout.

Original MP3 – listen or view video online (3rd item, right column)
Transcript – read online at Design Hub
Time taken to transcribe – 48 hours
Cost of transcription – US$45.50

How accurate are the transcriptions?

Other than American spellings, the transcripts are very accurate. In the Sydney Observatory transcript there was one numerical error that has been corrected. The DFactory transcript is a little more difficult to check but there does not seem to be significant errors – which, given the original recording quality, is surprising. There is one instance where the transcriber has noted that all the speakers were speaking at once and thus no transcript was available for those few seconds.

How does Casting Words work and why is it so cheap?

Casting Words uses Amazon’s Mechanical Turk to divide complex work into a small tasks which are advertised for freelancers (turkers, as they are known) to perform – anywhere, anytime in the world. There are some tasks that humans perform better than machines and Amazon’s Mechanical Turk uses its machines to allocate these tasks more efficiently. The name ‘mechanical turk’ comes from a (in)famous hoax by Hungarian baron Wolfgang von Kempelen in the late 18th century.

Turkers who undertake Casting Words transcription tasks are not unqualified. Each has to undertake a small qualification task, and their rate of payment depends upon their qualification level. Also, each transcription is edited and checked as separate tasks. There seem to be about 10,000 qualified trascribers and 3500 qualified editors.

Salon.com did a report on Mechanical Turk mid last year which interviewed Casting Words who explain how it works.

With a little code, plus the turkers, it has succeeded in basically automating the process. The company charges its customers from 42 cents a minute for podcast transcription to 75 cents a minute for other audio. CastingWords pays Mechanical Turk workers as little as 19 cents a minute for transcription. If a transcription job is posted on Mechanical Turk for a couple of hours at the rate of 19 cents a minute, and no worker has taken on the project, the software simply assumes the price is too low and starts raising it.

After a transcription assignment is accepted by a worker, and completed, it goes back out on Mturk.com for quality assurance, where another worker is paid a few cents to verify that it’s a faithful transcript of the audio. Then, the transcript goes back on Mturk.com a third time for editing, and even a fourth time for a quality assurance check. “It’s been terribly useful for us,” says Nathan McFarland of Seattle, one of the co-founders of CastingWords. Transcription is the type of relatively steady task that keeps turkers with good ears who are fast typists coming back. “There are people who have been with us for months, and they’re not leaving,” says McFarland.

The article is essential reading as it also explores the criticisms of Mechanical Turk – the nature of labour allocated under this system, the pay rates and worker agreements, and the question raised by many people who do the work, “do they actually consider it as ‘work’?”. Much of the other tasks done by turkers are micro-tasks – very short, quick tasks such as image tagging, or trivia quiz answering.

The demand for transcription is only going to increase. Each month we are recording more and more in digital form, and the demand for it to be made searchable (which is one of the reasons we digitise in the first place) gets stronger and stronger. What other services have other museums tried to deal with this media overload?

Categories
Interactive Media Web 2.0

Brief report on Yahoo Pipes and RSS

Yahoo launched their Pipes application a few days ago. Traffic seems to have almost overwhelmed the site, but if you can get on to it you will find a very nice, and well featured visual tool for combining, manipulating and presenting different data sources.

This effectively allows you to build your own data mashup in a matter of minutes. Yahoo provides a range of data sources already but you can add your own RSS feeds, for example, or scrape data from web pages and then combine them with image searches, maps, and internet searches. As Stutzman has pointed out, Yahoo has realised the importance and potential of RSS and Pipes should reinforce this in the minds of other developers.

I built a quick Flickr results display based on an Opensearch feed from our collection search in 30 minutes by pulling apart and looking at the way in which others had built Pipes to display Flickr results from other RSS feed data such as the New York Times headlines. The Opensearch feed is not clean enough to get a ‘good’ Flickr result, but with a bit more time Pipes could clean up and improve the results.