Crowdsourcing the pain of transcribing audio

The trouble with recording interviews is that you have to transcribe them. So after one of my forays to New Haven last week, where I interviewed people in connection with a book I’m working on about community news sites, I had a ton of audio and the unpleasant task of translating it all to text.

I decided to crowdsource the task through an Amazon.com service called Mechanical Turk. More about that in a moment. But first I want to explain my reluctance to try it.

I think the results are better when I do it myself. I have to listen carefully, which helps me seal the best stuff inside my leaky brain. I know what we were taking about, which means that I’m not flummoxed by names and unusual phrases, as any transcriber would be. And because I have an idea of how I’ll use the material, I can decide on the spot what to transcribe verbatim, what to paraphrase and what to leave out altogether. So I knew I could potentially be giving up a lot by turning the task over to others.

Some years ago I used a transcription service near Harvard Square when time was of the essence and when, most important, someone else was paying the bill. This time, faced with many hours of work, I decided to take advice given me last fall by Zach Seward and try MTurk. Seward, then with the Nieman Journalism Lab, told me that lab director Joshua Benton had used it to transcribe this talk by New York University’s Clay Shirky. I was impressed.

I posted a query on Twitter, and several people responded by sending me a link to an online guide by Andy Baio. I decided to try it with two interviews — a 65-minute recording with New Haven Independent founder and editor Paul Bass, made on his reasonably quiet back deck, and a 35-minute conversation with New Haven alderman Michael Jones, at an outdoor café on a busy street.

My first step was to go through the cumbersome process of converting my Olympus recorder’s WMA files to MP3s, and then dividing those MP3s into five-minute chunks so that a number of different people could apply themselves to the task. By the time I got around to doing the second interview, I had stumbled upon EasyWMA, a $10 utility that took the pain out of conversion, and had finally taught myself enough about Audacity, a free audio editor, so that I could painlessly produce five-minute bits.

I was surprised by how quickly the crowd swarmed over my files — in less than a day, I had everything I needed. Unfortunately, the quality was extremely uneven. Some of the mistakes were bizarre or unintentionally hilarious. How “state of Connecticut” became “state of Kentuckian” is one I’ll never figure out. And here’s a choice excerpt from my conversation with Bass. First, the MTurk version:

They had a Sunocompass call with WBR few weeks ago to get the advice, how the membership strives. The taste and ever didn’t undership strives because I felt that if the widely suceessful they might get thirty to fourty thousand dollars.

Now, what he really said:

They had us on a conference call with WBUR few weeks ago to get advice on how to do membership drives. In the past I hadn’t done membership drives, because I felt that if they’re wildly suceessful they might get you to $30,000 or $40,000.

Following Baio’s advice, I’d set a price of $2 per five-minute excerpt. You have the option of rejecting unusually bad work, refusing to pay and letting someone else take a crack at it. I decided to accept everyone’s work, including the person who produced what you see above. But I blocked two people (including the one I just cited), so that if I use the service again, they won’t have a chance to work on my stuff.

Overall, I paid $41.80*, $3.80 of which went to Amazon, the remainder to the folks who actually did the work.

Between file conversion and preparation, downloading transcribed interviews, listening to everything again and cleaning up the transcripts, I don’t know how much time I saved. Not much, probably. Yesterday I transcribed two interviews myself, and I thought the results were much better.

On the other hand, I purposely chose my Bass interview for MTurk because it was long and he talks very quickly. It was also an unusually substantive conversation, and I knew there wasn’t much I wanted to leave out. Most of the transcribers did an OK job.

My bottom line is that, in the future, I would probably reserve MTurk for situations in which I have good audio quality and need a full verbatim transcript. Even knowing that I’ll have to do a fair amount of retyping, it’s still better than starting from scratch.

But if I’m producing normal interview notes, I’ll handle it myself.

*Addendum: Jack Shafer of Slate told me the price I cited doesn’t mean much without comparing it to the price of a professional transcription service. So I contacted a good one and was told it would cost about $140 an hour — or about $230, nearly six times as much as what I paid. That’s a huge mark-up. On the other hands, the results would have been more usable.

Illustration via Wikimedia Commons.

Talking about Google and privacy on “Greater Boston”

I’ll be on “Greater Boston” today at 7 p.m., talking with host Emily Rooney about Google’s mounting privacy problems. On Monday, Connecticut attorney general Richard Blumenthal announced that he would lead an effort comprising about 30 states to investigate how Google came to intercept e-mail, passwords and other confidential information when collecting data for its Street View feature.

Come on and Safari with me

Click on image for larger view

Because I had a lot of writing to do yesterday, I indulged myself with some quality screwing-off time and installed Safari 5, the latest version of Apple’s Web browser. I can’t say I expected much. Safari has always been feature-laden but sluggish. The new version, though, is speedy enough that I may make it my primary browser.

For several years I had been a dedicated Firefox user. But after Google released Chrome for Mac earlier this year, Firefox seemed downright slow by comparison. Chrome blazes, but it doesn’t have much else to recommend it. I especially don’t like the way it displays type — it seems like everything is either a smidgen too small or too large.

The new Safari, by contrast, is slick and attractive, and has a lot of nice touches. I’m a big fan of the Top Sites window, a graphical representation of my most-visited stops on the Web. Chrome has something similar, but the customization features are minimal. Safari also handles bookmarks nicely. Most important, it seems as fast as Chrome, and, unlike Firefox and even Chrome, it doesn’t gag on the Boston.com ad server.

The most interesting feature of Safari is something called Safari Reader. Open a page with an article on it, and a clickable label appears in the address bar. Select it and a new window opens with a nicely formatted text page. Unfortunately, Reader makes it easier to avoid advertising. But since photos within the text are displayed, I see no reason why ads couldn’t be embedded as well.

Reader is especially nice for complex sites with tiny type, such as the example I’ve included above from the New Haven Independent.

One problem is that Web designers have to write to Reader’s specifications or it won’t work properly. NYTimes.com, for instance, handles jumps with aplomb, whereas Boston.com, upon encountering a jump, incorrectly displays the first page again. Reader is going to have to prove very popular in order to force Web designers to change. But it could happen. Safari, after all, isn’t just for Macs (and PCs), but for iPads, iPhones and iPods as well.

No sooner did I tweet my enthusiasm about Reader than Alex Johnson responded by telling me that the same feature had been available in other browsers for some time. Sure enough, I found an extension for Chrome called Readability that did exactly the same thing. But it was glitchy compared to Safari Reader, which Johnson concedes is “the better option for Mac-only users.”

Safari also has a built-in RSS reader, but on first glance I see no reason to switch from Google Reader, which I love. (A lot of programs named Reader, eh?) There doesn’t seem to be any way of pulling my Google Reader feeds into Safari, which would be a minimum requirement for me even to test it.

Between Safari and Chrome, I doubt I’ll be using Firefox any time soon. I’ll try Version 4 when it is released later this year. For now, though, Firefox has definitely fallen behind.

Bringing together citizens, government and media

[youtube http://www.youtube.com/watch?v=qIsFcydDbkw&hl=en_US&fs=1&rel=0]
SeeClickFix is an interactive website that lets users report problems in their communities and plot them on a Google map. Because it’s an open forum, local officials can check in to see where trouble spots are, and news organizations can track them as well. The New Haven Independent is one of many news sites that posts the RSS feed for its community. The interactive pothole map at Boston.com is powered by SeeClickFix as well.

On May 18 I had a chance to sit down with SeeClickFix co-founder and chief executive Ben Berkowitz in his second-floor office in downtown New Haven. Berkowitz, a hyperkinetic 31-year-old, had forgotten we were supposed to meet, but he graciously agreed to a video interview despite having a full agenda.

Berkowitz describes SeeClickFix as “citizens working collectively,” and explains that he started it three years ago when he was trying to get graffiti cleaned up in his neighborhood. The site has been growing rapidly since the New York Times published a feature story on it in January.

Today, the company has some 400 media partners and employs five people thanks to a $25,000 We Media prize and several hundred thousand dollars’ worth of venture capital. Although the basic service is free, SeeClickFix charges media sites for certain premium services, and posts advertising as well.

One aspect of Berkowitz’s philosophy that I found particularly interesting was his insistence that SeeClickFix is not just for holding government accountable — citizens, too, should take responsibility. As an example, he pointed to a similar project, the British website FixMyStreet — a great name that he nevertheless doesn’t like, he says, because it removes accountability from citizens and places it entirely on the government.

Does Berkowitz, who previously worked as a Web designer, consider himself a journalist? He pauses before answering. “I think SeeClickFix is a tool for journalists,” he replies. “I don’t think that I am a journalist. I don’t think of us as a news organization.”

For a good example of how journalists can use SeeClickFix as a reporting tool, see this story on “the ugliest storefront on Chapel Street” in the New Haven Independent.

A multitasking, multimedia journalist

Thomas MacMillan covers a finance committee meeting in New Haven City Hall.

Back when I was covering city council, school committee and board of selectmen meetings in the 1970s and ’80s, the only tool I brought with me was a notebook and a pen.

How times have changed. On Tuesday evening I connected with Thomas MacMillan, a reporter for the New Haven Independent, so I could watch him cover a finance committee meeting. (Click here for a video feature on the Independent, a non-profit community news site.) We met outside the aldermanic chamber in New Haven City Hall just before 6 p.m., and I followed him to the front row.

MacMillan accepted congratulations from a few city officials for a national reporting award he won last week, then settled in to live-blog the debate. He was a bit harried — he’d just come over from covering another event, and he hadn’t had time to write the introduction. A few minutes later, though, he was good to go.

For the next two hours I watched as MacMillan posted a series of updates on what was going on, pored through budget documents, moderated and posted reader comments, periodically jotted a few things down in a notebook (how old-fashioned), and took photos.

Alderman Darnell Goldson, who was sitting in our row, whispered, “Hey, Thomas!”, and pointed behind us, where an otherwise-dignified looking man was wearing a lighted-up Christmas tree on his head. His aim was to protest Mayor John DeStefano’s proposal to save money by not erecting a tree on New Haven Green this year. MacMillan turned and shot.

And when two aldermen got into a semi-heated discussion about cuts to the education budget, MacMillan pulled out another camera and shot some video, although he ended up not using it.

Despite my front-row seat, I would have had little idea of what was going on if it weren’t for MacMillan’s updates, which I read on my BlackBerry.

I left at 8; the hearing ended at 9:30. Later, MacMillan took his blog items and notes and turned them into the story that you can see today, and posted a few photos as well.

What MacMillan did last night was impressive but not unusual. The technical skills he brought to bear on his assignment were nothing that couldn’t be mastered in a few weeks. It’s the mindset that matters. Journalists today must be prepared to juggle a variety of tasks and to perform them with minimal supervision.

And to think that there was a time when the biggest challenge in covering a meeting was to stay awake.

From talking about it to just doing it

[googlemaps https://maps.google.com/maps/ms?ie=UTF8&hl=en&msa=0&msid=110849334117410151532.00048518b4ffdc95dd0ae&ll=41.656497,-72.388916&spn=2.872863,5.493164&z=7&output=embed&w=500&h=350]
When I first started teaching a course called Reinventing the News a few years ago, I envisioned it mainly as a seminar. The idea was that we would look at some case studies of where the news business might be headed and blog about it.

I quickly realized that wasn’t good enough. The spark for me was a student who had just come back from her co-op job at the Patriot Ledger of Quincy. She had assumed the most complicated tool she’d have to use would be a notebook. Instead, she was tossed a point-and-shoot digital camera and told to teach herself how to capture and edit video. She liked it so much she ended up changing her career goals from print to video.

It was with some trepidation that I began adding three weeks of Web video to Reinventing a year and a half ago. First, I had to teach myself how to do it. And it required exposing some vulnerabilities. I knew some students would be starting from zero, but I also knew that others were already better at video journalism than I’d ever be. Nevertheless, it proved to be well worth it.

Last week we finished the most complex version of Reinventing I’ve offered, and my students had to pull together a variety of skills for their final project. The assignment was to use free online tools to create a multimedia story. The elements:

  • An 800- to 1,000-word story about a digital media project that had caught their eye, written up as a blog post with relevant links.
  • A slide show of six to 10 still photos, posted to Flickr and embedded in their blog.
  • A two- to five-minute video they shot and edited, posted to YouTube and also embedded in their blog.
  • An explanation of how they used social networks such as Twitter and Facebook to find sources and report their story.

At the end of it all, they were asked to note the location of their story on a Google map and link to their blog post. The result is the map I’ve embedded above. I invite you to explore. These young journalists did a terrific job, and I am very proud of them.

If you click on “View Reinventing the News: Final Projects in a larger map,” directly under the embedded map, you’ll find the list of students on the left-hand side. Click on a name to find his or her spot on the map, each one of which is linked directly to their project. Hmmm … Google could make this a little bit simpler, eh?

I’ll be teaching Reinventing again this fall, and I will continue to refine. My first thought is that I ought to dump the brief wiki exercise I offer and instead delve more deeply into how to handle comments. Any thoughts you have would be welcome.

Open systems, open society

Apple’s attempt to ban a Pulitzer-winning cartoonist from its iTunes Store is an extension of the same mindset that led it to keep Adobe’s Flash software off its new generation of closed devices — the iPhone, the iPod touch and the iPad. And it shows that Steve Jobs and company are poorly cast in their role as a savior of the struggling news business. Or so I write in the Guardian.

Apple’s heavy-handed approach to speech

I’m trolling for Boston-area stories about Apple’s heavy-handed approach to allowing and banning apps for the iPhone, the iPod Touch and now, of course, the iPad. If you know of any, please pass them along. I would love nothing more than to give Steve Jobs a Muzzle Award, but I need a local angle.

What prompts my request is this outrageous example involving newly minuted Pulitzer-winning cartoonist Mark Fiore, who was unable to get his app approved because his work “ridicules public figures.”

I’ll be in the market for a new phone in the summer of 2011. It’s looking less and less likely that I’ll be going with Apple, much as I love its technology.