MY NEW, LOWER, REVISED, EXCELLENT RATES FOR 2012: Bronze, Gold, or Platinum--Choose Your Level!


PLATINUM: Tier 1 (1 to 2 business days' turnaround) = $95 per audio hour, or $1.58 per audio minute. [This Tier is also for conference calls, medical transcription, or very difficult to hear audio.]


GOLD: Tier 2 (3-5 business days' turnaround) =$80 per audio hour, or $1.33 per audio minute. [This Tier is also for interviews with 2 interviewees or English as a second language audio files.]

BRONZE: Tier 3 (6-10 business day's turnaround) = $70 per audio hour, or $0.86 per audio minute [This Tier is also for well-recorded audio with one person talking or interview with one other person.] For proofing of voice recognition documents, please also use Bronze Level charges.
[A $10 bad audio fee will be charged for extremely difficult or inaudible mp3 files.]

13.5.10

Mispredicted Words, Mispredicted Futures

The accuracy of computer speech recognition flat-lined in 2001, before reaching human levels. The funding plug was pulled, but no funeral, no text-to-speech eulogy followed. Words never meant very much to computers—which made them ten times more error-prone than humans. Humans expected that computer understanding of language would lead to artificially intelligent machines, inevitably and quickly. But the mispredicted words of speech recognition have rewritten that narrative. We just haven’t recognized it yet.

After a long gestation period in academia, speech recognition bore twins in 1982: the suggestively-named Kurzweil Applied Intelligence and sibling rival Dragon Systems. Kurzweil’s software, by age three, could understand all of a thousand words—but only when spoken one painstakingly-articulated word at a time. Two years later, in 1987, the computer’s lexicon reached 20,000 words, entering the realm of human vocabularies which range from 10,000 to 150,000 words. But recognition accuracy was horrific: 90% wrong in 1993. Another two years, however, and the error rate pushed below 50%. More importantly, Dragon Systems unveiled its Naturally Speaking software in 1997 which recognized normal human speech. Years of talking to the computer like a speech therapist seemingly paid off.

However, the core language machinery that crushed sounds into words actually dated to the 1950s and ‘60s and had not changed. Progress mainly came from freakishly faster computers and a burgeoning profusion of digital text.

Speech recognizers make educated guesses at what is being said. They play the odds. For example, the phrase “serve as the inspiration,” is ten times more likely than “serve as the installation,” which sounds similar. Such statistical models become more precise given more data. Helpfully, the digital word supply leapt from essentially zero to about a million words in the 1980s when a body of literary text called the Brown Corpus became available. Millions turned to billions as the Internet grew in the 1990s. Inevitably, Google published a trillion-word corpus in 2006. Speech recognition accuracy, borne aloft by exponential trends in text and transistors, rose skyward. But it couldn’t reach human heights.

Source: National Institute of Standards and Technology Benchmark Test History (borrowed from: http://robertfortner.posterous.com/the-unrecognized-death-of-speech-recognition)

Take a Moment to Sound off:

Logo

We Benefit from Your Opinions

Page One
1.
2.
3.
4. Become a member of this blog. We will not allow anyone access to your name or other personal information. It is for the purpose of occasional notifications of this blog's updates. Thanks!


 Required Question
Online Survey Software
Online Surveys powered by SurveyGizmo

My Tweets in Twitterville

Twitter Updates

    follow me on Twitter

    How Vital Medical Transcription Is

    <a href="http://www.linkedtube.com/JPYI_pqKwEw0689635ab3ffcab54c6c7ea57c51099e.htm">LinkedTube</a>

    Some of My Favorite Music--Stay and Listen a While


    Get a playlist! Standalone player Get Ringtones