Cars, gadgets that 'hear' your voice
Xzibit  |  by money.cnn.com. All rights reserved. 2.04 | 6:28

--========================================================-->

Now You're Talking

Voice-recognition technology is no longer stuttering - and that means huge opportunities for established players and newcomers alike.

(Business 2.0 Magazine) -- As man-vs.-machine classics go, it had the crucial elements: The brash young champion.

The new-and-improved computing powerhouse. That the champ was 17-year-old Ben Cook, anointed by the Guinness Book of World Records as the world's fastest text messager, and the machine was not a supercomputer but a cell phone, didn't detract from the drama - at least not to the crowd gathered at an Orlando voice-recognition software conference last fall.
They did it with Nuance's recently launched Mobile Dictation software, which will be available through carriers as early as the first half of this year.

There's also a broader explanation: Voice recognition, long ridiculed as one of those perpetually just-around-the-corner technologies like the personal jet pack or the Dick Tracy wristwatch, has finally arrived.
Advances in processing power, new software algorithms, and even better microphones have enabled established players like Nuance and a raft of startups to design systems that work - often at near 100 percent accuracy rates. And they're creating explosive potential for growth in markets for everything from handheld dictation devices to mobile phones to auto parts to battlefield translators.


The overall market for voice-recognition technology topped $1 billion for the first time in 2006, a 100 percent increase in just two years. Within that broad market, there are numerous subsectors that are likewise surging: The market for server-based voice-recognition technology to power call centers and the like reached nearly $600 million in 2006 and is expected to double by 2009, according to Opus Research.


The market for speech technology embedded in devices such as phones and auto dashboards - worth about $125 million in 2006, according to research firm Datamonitor - is expected to quadruple to $500 million by 2010, powered by the rapid spread of voice-command features on phones and cars with increasing levels of talking electronics, from music players to navigational systems. Ultimately, some experts say, voice-recognition systems are likely to be built into almost every gadget, appliance and machine that people use.
The surge in demand is already triggering investment from established voice players and newcomers alike.

In 2006, ( ) bought Dictaphone to enhance its presence in the health-care industry, even as Nuance's sales grew 20 percent to more than $300 million.
( ) new Vista operating system comes with voice technology that, after suffering embarrassing glitches, is now winning kudos from reviewers. ( ) has said that it's studying technology to enable search-by-voice.

Venture capitalists, meanwhile, are lining up to fund entrepreneurs with voice-recognition ideas all over Silicon Valley and beyond. Speech technology, says Datamonitor analyst Daniel Hong, is finally transitioning from a cool technology to a business solution.

The next generation
Voice-recognition technology dates to 1952, when Bell Labs researchers cobbled together a primitive system that could recognize numbers spoken over a telephone.

Progress since has been halting, but with the advent of far more powerful computing components and years of plain old trial and error, systems today have finally reached the point where they can cope with innumerable accents, dialects and quirks of speech.
VoiceBox Technologies, a startup in Bellevue, Wash., in 2004 unveiled a prototype whose components had to be carried in a steamer trunk.

Today roughly the same system fits on a device the size of a credit card and could be the brains of Toyota's voice-command dashboard systems (see correction below).
VoiceBox systems are now so sophisticated that they can analyze context to, say, figure out if the command traffic refers to road congestion, tunes from Steve Winwood's old band or a dope-smuggling film starring Michael Douglas.
Today's systems also have powerful capacity to essentially teach themselves.

Tellme Networks, a startup in Mountain View, Calif., makes voice-recognition software used for corporate call centers and telecoms' 411 information systems. Tellme's platform captures some 10 billion utterances annually and constantly analyzes them, improving the system's precision literally every day.

Voice recognition is all about pattern recognition, says Tellme executive Jeff Kunins. The more data you have, the better the recognition gets.

--startclickprintexclude-->

And the more valuable voice recognition becomes as a customer tool. Call centers and customer service departments are notorious for the infuriating Press or say 1 purgatories that older speech-recognition technologies created, but customer outrage isn't the only penalty: The average call-center call costs $5 if handled by an employee but 50 cents with a self-service, speech-enabled system, according to Data-monitor.
Online brokerage E-Trade Financial uses Tellme to field about 50,000 calls a day; half never go to an E-Trade employee.

The company says Tellme's system is saving it at least $30 million annually.
Startup TuVox is also racking up customers in the call-center and corporate markets. Its VP for marketing, Azita Martin, has her team dial a call center and record the typically torturous, multistep efforts to, say, reach the billing department.

Then they create an audio file that reveals what the interaction could sound like if Martin's target used TuVox's software for routing calls with advanced voice-recognition technology. She e-mails the two interchanges to the CEO of the company using the call center. The contrast has helped Martin sign up numerous clients during the past few months - one reason TuVox's annual revenue is growing at double-digit rates and its customer base has quadrupled in 12 months.

Telecom New Zealand, one of its new customers, reports a tripling of call-center customer satisfaction since it installed a TuVox system.
While call centers and autos are expected to continue to be growth markets for voice recognition, the real bonanza will likely come in improved systems for all manner of mobile devices. Start with cell phones: Telecom companies think consumers will pay for a host of additional services such as dictating e-mail or searching for a restaurant if there's an easy-to-use voice interface on mobile phones.

Indeed, Opus Research says telecoms expect to earn an additional $5 to $15 per month from every customer who opts for a voice-enabled phone.
Numerous startups are scrambling to provide that technology, including Promptu. Founded in 2000 by speech-technology veterans, the Menlo Park, Calif.

, startup has developed a package of voice-recognition features that will be offered through several carriers later this year. The telecoms are calling us now, says Brady Bruce, a Promptu senior vice president. I love that.

Real

Creative uses
Other startups are developing voice features for everything from MP3 players to handheld GPS devices to laptops.

Read more on by money.cnn.com. All rights reserved.
Keywords: Opus Research, e Trade
Related news
  • From the Blotter
    Mariah Carey

    The following incidents were reported to the Fort Belvoir Provost Marshal Station during the week of Oct. 15-21. bull; An individual was charged with traffic accident resulting in damage to property. The individual was parked backed into a parking space...

  • Ah, the bicycle
    Vivica Fox

    Bike or car? Think twice -- Shanghai Daily | 上海日报 -- English Window to China News...

  • Sales My Writings
    Travis Barker

    The best and the biggest international brands are here in India –but the irony if it all: where is the after-sales-service? So integral to a brand, so critical for it s success and so taken for granted in developed markets! In India, after sales se...

  • reconditioning
    Usher

    It’s official – I’m depressed. Lately, I feel about as palpable as a downed power line on a lonely country road...

  • more...
    Beyonce

    Dried cellulose pulp, minted pieces of metal -- an icy wasteland only slightly warmer than interstellar space, unspoiled by locals -- oldest populated continent, with the most large land species -- former Denebian penal colony, whose descendants include...

Post comments
Name
Place
8 + 1 =
Comments