Skip to main content

Microsoft Is Listening: Vista Speech Recognition Is Worth Talking About

As a professional programmer who also happens to be afflicted with spinal muscular atrophy (a severe neuromuscular disorder), PC accessibility is of paramount importance to me. Accessibility (or the lack of it) directly influences how efficiently I am able to work, which invariably influences my bottom line. More than that, it affects my state of mind. Being able to click that little red 'x' to close a window on your desktop may seem easy to most of you, but it can become quite tiresome or perhaps even be impossible to do for many users with disabilities. So when the world's most influential software maker introduces a new or updated accessibility feature, I take notice. And after test driving Windows Vista's speech recognition engine, it most certainly opened my eyes, er mouth!

Why am I so excited? Well, for one, speech recognition has finally become a first-class citizen in Windows. Before Vista, speech recognition was never installed by default in Windows (and for good reason). It used to only be effective in a very limited number of scenarios, like dictating in Microsoft Word, but, now, it is useful almost everywhere. Why is that? The short answer: It's truly integrated in the OS, which gives it much more power than ever before. The long answer: Nearly all Windows controls (text boxes, dropdown lists, menus, etc.) are now interfacing with the new Text Services Framework, but you can learn the details elsewhere from the experts.

So what does all of this really mean? Now, I can surf the web by voice without touching a mouse; I can click a point on the screen by speech alone; and I can dictate this article without typing on a keyboard. Pretty cool!

Of course, all of this has largely been available before in third-party applications, like Nuance Dragon NaturallySpeaking (DNS), but, in my opinion, never so elegantly and effectively with the entire user experience. Just try comparing Windows Speech Recognition and DNS when surfing the web in Internet Explorer or finding a file in Windows Explorer, and you'll quickly understand what I mean.

Windows Speech Recognition (WSR) still has room for improvement. One significant shortcoming of WSR is that there is no macro support yet. Also, my dictation is still more accurate in DNS, but the difference is minimal, and, with more use, WSR may very well eliminate that gap. Command-and-control is significantly superior with WSR, though, and the price is right (it's included in the OS). All in all, the speech recognition competition will definitely benefit consumers.

I, for one, am appreciative of all of Microsoft's effort put into speech recognition and am grateful it has become a mainstream feature in Windows. Indeed, I may have actually experienced a genuine "wow" moment because of it. ;-)

Comments

Popular posts from this blog

Using the On-Screen Keyboard as an Alternative to Typing with a Physical Keyboard

As an individual with a physical disability who touts speech recognition so much, I occasionally get asked how I ever use the computer without having speech recognition available (since I cannot move my arms well enough to operate a standard physical keyboard)? This is a good question, since speech recognition is not one of the most portable tools around. For example, I've never come across a public computer at a library or hotel that was set up with a good microphone and sound card combo, which are necessities for using speech recognition. So, when the necessary hardware is unavailable, that means I have to look for software to simulate it--in this case, the On-Screen Keyboard . The On-Screen Keyboard is nothing new to Windows; it's been one of the standard accessibility tools for several versions now, not just Vista. It's pretty simple, really, but is extremely useful for users like me who cannot utilize a traditional physical keyboard. Basically, the On-Screen Keyboard a...

"Start Typing" with Windows Speech Recognition

As a software developer with a physical disability that makes using a keyboard practically impossible for me, one of the most important capabilities of speech recognition that I always look for is keyboard emulation.  And by keyboard emulation, I’m not talking about entering a bunch of common words and phrases like I’m doing while writing this article.  This is called dictation.  Rather, I’m referring strictly to the ability to key short (or not-so-short) sequences of characters and/or key combinations like myVariableName or myFile.doc .  Words like these aren’t easily understood by the built-in speech recognition dictation engine because they are not in any dictionaries I know of (nor should they be), so another speech recognition mechanism is needed.  This is called typing. Vista’s speech recognition tutorial and the what can I say Windows help documents suggest one good way to type single keyboard keys— Press X .  For example, you can say Press a to t...

Shoot Ghosts with Windows Speech Recognition

Sorry about the lengthy blogging hiatus. I've been extremely busy at work and just have not found the time to spend on fun things like my blog. I know that's a lame excuse, so I'll give you another one. In what little free time I've managed to find, I've actually been playing a game. :-) And, guess what, I've been using Windows Speech Recognition to help me win. What game have I been playing, you ask? Well, my current game of choice happens to be Desktop Tower Defense , a relatively simple but strategically complex game. In fact, I would have never known about it without reading Text Services Framework guru Eric Brown's blog . Thanks, Eric! Now, I'm addicted, too. The object of this free Flash-based game is pretty simple. Shoot all the little ghosts before they escape the maze of towers that you create. It sounds simple enough, but it can get extremely difficult as the game progresses. In fact, a lot of the challenge involves managing and upg...