Winning enterprises over with speech to text
MyCaption combines voice recognition technology with human ears and deep integration into the handset to create an enterprise class text mobile speech-to-text platform
Industry News
Blogs
Briefing Room
advertisement
MyCaption Chief Executive Officer Vipul Bhatt thinks it’s time to get serious about speech to text. The technology has been around for years powering auto-dictation and voice-search applications like Vlingo’s and voicemail-to-text services such as SpinVox’s, but most of those services have been targeted at consumers and prosumers. The most obvious target for speech-to-text technology, the enterprise, has been reluctant to embrace it for the simple reason that there is no enterprise-caliber platform to embrace, Bhatt said.
MyCaption is hoping to change that. The Bay Area startup has launched what it claims to be the first speech-to-text for the serious business user. It combines the network-based voice-recognition technology of all speech-to-text engines with human editors to refine translation. But Bhatt said its platform is more than just an auto-dictation service, describing the technology as voice-to-data rather than speech-to-text.
MyCaption relies on a robust middleware client that integrates closely with the productivity applications of a BlackBerry smartphone as well as the Outlook exchange server back in the office. A MyCaption user can set calendar entries and initiate new tasks as well as dictate email and personal memos up to 3 minutes in length. Once a voice message is converted, it is sent back to the phone for review, but the user can perform in-message editing all by voice. And once approved, the message or calendar entry isn’t sent through a proxy server or saved in a separate client but slotted right back into the BlackBerry productivity app and synched back to the enterprise server.
The BlackBerry client records speech and instructions and sends them as a file over the packet data network, where it is run through a voice-recognition engine and then passed on to a human editor. Due to the complexity of the messages, the vast majority of them have to be heard by human ears, Bhatt said.
“Honestly, we are considering doing away with the voice recognition,” Bhatt said. “Business users’ demands for accuracy are so high, we need the precision of a human editor to meet those standards.”
All speech-to-text applications rely on human translators to varying degrees to handle garbled phrases and nuances of speech that a speech engine simply can’t handle, but Bhatt said MyCaption has de-emphasized its speech-recognition technology for all but the most basic of messages. Whenever the speech engine encounters a sound or word order it does not recognize or even a homonym it cannot contextualize, it immediately shoots the message up to translator. For an enterprise user sending an email to an important client or booking a crucial meeting, that level of redundancy is key, Bhatt said.
But while the human element meets the accuracy requirements of an enterprise, it can also compromise its security requirements. A completely automated voice-to-text system is as secure as its transmission channels and server protection. With human translators, a live person is listening to what could be confidential communications. MyCaption has taken precautions to limit that exposure: Its human translators only receive the raw voice files and aren’t given any clue as to the individual identity, company or phone number of the person speaking. No messages are stored either in voice or text form—all are erased as soon as a transaction is finalized. But Bhatt acknowledged that companies in highly sensitive industries will continue to have security concerns. In the future, he said, MyCaption hopes to implement even further technological and methodological safeguards that will mask the context of the message without masking the content itself.Want to use this article? Click here for options!
© 2012 Penton Media Inc.
advertisement
Learning Library
Webcasts
Using Real-Time Offers, Alerts and Interactions To Improve the Mobile Broadband Experience
In this Webinar you will learn how to create a real-time relationship with your customers, how to proactively improve the customer experience, and how to successfully target and cross-sell services to boost incremental revenue.
- Megabytes to Megabucks, Bandwidth to Business Models: How 4G Is Changing Everything
- How to Unplug Your Redundant Telco Apps To Save Money and Improve Efficiency
- When IaaS Isn't Enough: Service Provider Business Models to Drive Growth and Build Margin
- How to Transform Your Aging Telco Voice Network to Drive New Profits and Revenue
- Creative Licensing Approaches for Telcos & Their Network Equipment Vendors
- Smart Home Opportunity: Balancing Customer Data & Privacy
White Papers
The Role of Diameter in All-IP, Service-Oriented Networks
This paper discusses the rise of Diameter and benefits of Diameter Protocol.
- Conducting The Orchestration – Order Management at the Speed of Business
- Toward a Converged Network Edge
- Beyond Spam – Email Security in the Age of Blended Threats
- 6 Important Steps to Evaluating a Web Filtering Solution
- The Expertise to Protect You from Botnet and DDoS Attacks
- Seeing is Believing – Bridging the Order Visibility Gap
Featured Content
A time and money saving approach to fiber deployment
Service providers are under tremendous pressure to turn up new services faster then before and, at the same time,
to do it at less expense - and intra-office fiber is one of the biggest challenges in terms of both cost and service
turn-up.
of interest
The Latest
News
From the Blog
Briefingroom
Join the Discussion
Resources
Get more out of Connected Planet by visiting our related resources below:
Connected Planet highlights the next generation of service providers, as well as how their customers use services in new ways.
Subscribe Now







