Deepgram continues to release news about extending its VoiceAI into new areas. Today’s news concerns collaboration with IBM. As with almost all new technologies, I see the potential uses but also see things I don’t care for. Voice AI (speech-to-text (STT) and text-to-speech (TTS)) has many uses, but I become annoyed when talking to customer service only to discover I’m talking with a computer with limited resources for help rather than a human. I guess that’s the future.
IBM and Deepgram announced a collaboration to integrate Deepgram’s speech-to-text (STT) and text-to-speech (TTS) capabilities into IBM’s watsonx Orchestrate generative AI solution.
To address client needs for highly performant, enterprise-grade transcription and real-time captioning, IBM will embed Deepgram’s capabilities into watsonx Orchestrate. This collaboration makes Deepgram IBM’s first voice partner, bringing voice AI technology that helps enterprises automate their operations and meet the growing demand for conversational AI technology, including advanced speech-to-text voice recognition so users can interact with digital agents using natural speech.
Many organizations are adopting AI-powered speech-to-text systems to automate transcription while handling real-world audio conditions, including background noise, diverse accents, and real-life dialog. This integration addresses these challenges by offering a wider range of languages and dialects, including dozens of Arabic and Indian variants, along with voices that reflect regional accents. It also adds options for custom tuning, real-time captioning and natural-sounding speech.
These technologies open new possibilities for enhanced automated customer care and support, call analysis, and voice-driven data entry in fields like healthcare and finance.
Click on the Follow button at the bottom of the page to subscribe to a weekly email update of posts. Click on the mail icon to subscribe to additional email thoughts.




