IoT Agenda

Jun 29 2017   5:06PM GMT

The next big innovation in industrial enterprise is voice: What’s the holdup?

Brian Ballard Profile: Brian Ballard

Consumer IoT
Enterprise IoT
Internet of Things
Mobile productivity
User experience
User Interface
Voice platform
Voice recognition

Think about the ways we’ve been navigating around computers and information systems for the past four decades: keyboards, mice, touchscreens, even data gloves. What do they have in common? They are all designed for people who can use their hands as part of their information task. But what about the workforce that needs to keep their hands on their tools and equipment, the people working on assembly lines, in warehouses or at customer locations and job sites?

Voice offers hands-on professionals a much needed interface

Help for hands-on workers may be on the way from a technology that has seen breakout success in the consumer world. Digital assistants like Amazon Echo and Apple’s Siri let people interact with devices, find information online and perform complex tasks, all through the power of voice. Imagine how that could help workers who need to be connected to information but don’t have hands free to operate a keyboard or touchpad.

In the industrial environment, that capability needs to follow workers around the factory or warehouse. Fortunately, voice is being bundled with another technology to assist hands-on workers: smart glasses. Smart glasses offer a viewing experience that doesn’t require people to look away from what they’re doing to look at a display screen or paper document. Instead, these head-mounted displays connect workers to information like checklists, maps, product documentation, data outputs from connected machines and even instructional videos in their field of view. It’s part of a subset of augmented/mixed reality technologies that we call assisted reality.

The utility of AR and smart glasses raises a question of UX design: How should hands-on workers interact with information presented to them on a display device that is not equipped with traditional inputs, in work scenarios that do not allow for them to hold or interact with even a simple mobile phone or touchscreen?

Voice interaction solves this problem. Workers can issue simple voice commands like “mark all steps complete” or “open next task” to invoke the powerful capabilities of the system. Some software allows for voice-to-text transcription, turning spoken words into documents, annotations on a picture or process, or communications with a remote colleague or expert. As artificial intelligence capabilities mature, some software will be able to accommodate context-based queries (“Where does this part go?” or “Am I doing this right?”) that will allow people to learn faster and work faster with less effort, greater confidence and fewer errors.

What’s muzzling voice in the enterprise?

The technology that enables this isn’t science fiction. Companies like Apple, Amazon and Microsoft have already invested billions in making it real and useful for consumers around the world. So why hasn’t it taken off in the enterprise?

There are two main reasons. First, voice recognition is largely irrelevant to the desk-based knowledge workforce who commands the majority of IT spending and attention, so it hasn’t been high on the agendas of CIOs tasked with provisioning business systems. As companies turn their attention to Industry 4.0 and an era of smart, connected machines, investments that empower the frontline workforce in manufacturing, logistics and field service will start to increase as well.

The second big stumbling block is the technical architecture of the systems themselves. Voice recognition is powered by machine learning systems that are constantly updating based on millions of user interactions. These systems, and the accuracy they afford, require the kind of massive processing power and back-end data that resides in the cloud; and for some customers, the idea of any public cloud implementation raises concerns about data security, access control, user privacy and legal risk. Consequently, enterprises that are reluctant, for whatever reason, to migrate business applications to the cloud cut themselves off from these kinds of advanced capabilities.

Speaking up for higher productivity

Businesses that take a go-slow approach on voice-enabled work processes are missing a big opportunity, and may be putting themselves, their partners and customers, and their workers at risk of being out-produced by competitors that have already learned to accelerate with voice as an available tool to the workforce.

We say to the vendors supplying hardware and AR devices, like Google, Vuzix and RealWear, for the industrial enterprise, that they need to offer more robust support for voice functionality, with better quality microphones, rugged design and on-board noise reduction that meet the needs of the industrial workplace.

For our partners developing speech engines and system software, we communicate the enterprise concerns about data security and privacy, and push for features that enable customers to configure and control data for how it is processed or resides in the cloud.

Finally, we say to industrial enterprises customers, approach the issues of cloud-based systems with an open mind, especially when they promise to unlock capabilities that can help your business and employees operate more productively, with greater safety and quality control.

We have been using voice input as a keystone of our AR technology approach for years and see it delivering real results in customer scenarios every day. Eventually, when paired with future technologies like computer vision and more immersive augmented reality, it will open a whole new set of possibilities for advanced manufacturing and other industries.

So let’s start the conversation.

All IoT Agenda network contributors are responsible for the content and accuracy of their posts. Opinions are of the writers and do not necessarily convey the thoughts of IoT Agenda.

1  Comment on this Post

There was an error processing your information. Please try again later.
Thanks. We'll let you know when a new response is added.
Send me notifications when other members comment.
  • ciparkinson

    We at RealWear have for many years considered voice to be the Primary interface for any successful hands-free tool such as our HMT-1. 


    We designed the HMT-1 for voice from the beginning. We chose our microphones carefully. We chose the microphone locations carefully. We then selected the best in breed voice recognition and noise cancellation solutions, and  spent years marrying all of these parts together, tuning and optimizing at every level. And in return we have created what we believe is the most robust, intuitive, always listening hands-free interface available today. Further, it works just as well in noise; lots of noise!


    On the software front we also addressed the biggest challenges around voice - we do not offer up an SDK; we do not ask our enterprise users to rewrite their applications to work with voice. All of this happens automatically. Just drop an existing Android application onto our headset and it can immediately be driven by voice.


    We did not add voice as an afterthought. We do not have a navigation buttons, or a swipe panel. We truly are hands-free. It takes a lot of know-how to get here, but we are here and I invite you all to take our HMT-1 for a test drive and sample an enterprise-ready voice interface for yourselves.


    Dr. Chris Parkinson

    Chief Technology Officer

    RealWear Inc.

    0 pointsBadges:

Forgot Password

No problem! Submit your e-mail address below. We'll send you an e-mail containing your password.

Your password has been sent to:

Share this item with your network: