Realtime Voice transcription with Assembly AI & Deepgram

lindsay_knowcode · February 27, 2023, 9:55pm

The plugin lets you embed Assembly AI into your Bubble app and do real-time voice transcription.

Demo page: https://planbbackups.io/assemblyai
Editor: AssemblyAI | Bubble Editor
Marketplace: Assembly.ai Realtime Transcription Plugin | Bubble

The Plugin has two controls - start and stop .
The Plugin has some useful exposed states;

transcription - the transcribed audio as text.
isRecording - yes/no - useful for showing to your users that they are recording.
status - a clue as to what is happening internally with the recording/transcription status
length seconds - how long the recording/transcription is so far - useful for throttling the length of transcription.
is final transcript? - whether the final transcript has been sent.

And three events - useful for wiring into your workflows

recording starting
recording stopping
final Transcript Received

Optionally when you start a recording you can set the level of console logging message to a higher level. The default is “no” - a lower level of debug message.

The plugin also has a backend action Assembly AI - get temp token - safe to use within the web browser. This means your Assembly AI key is not revealed in the browser, and that only requests from your Bubble app for a temporary token are processed.

Purchase of this plugin comes with reasonable support to get you up and running.

lindsay_knowcode · March 3, 2023, 2:15pm

If you are looking for Deepgram Realtime Transcription Plugin see here
https://planbbackups.io/deepgram

Deepgram is very similar to AssembyAI - perhaps with slightly lower pricing and a more generous “free” account. Both have very good accuracy.

rod.danan · March 17, 2023, 2:31am

Hey Lindsay. Thanks for setting this up. Any idea why the recording wouldn’t start? My input says “Starting…” but I don’t get a pop-up to allow recording and no audio is transcribed.

EDIT: Had to add “Token” in front of the API key. Working well now!

EDIT 2: How can I clear the deepgram? I am building a mock interview tool and users can record their responses to be transcribed. Each answer should be brand new but right now it’s continuing the previous conversation.

lindsay_knowcode · March 17, 2023, 7:09am

Fixed - refresh your Editor to pick up the new Plugin version. Thanks for reporting

billyzhou271 · March 29, 2023, 8:12am

Hi Lindsay! I am very interested in the plugin. I want to build an app that can transcribe video in real time, so I may need the plugin to record the system output audio, instead of the mic input. I don’t know if that is possible. Thanks

lindsay_knowcode · March 29, 2023, 8:20am

Interesting! Yes, I can imagine a use case where you are in the browser doing a video call and the real-time transcription appears underneath - like a few of the Teams/Zoom/Gmeet tools do it.

It’s likely feasible - just grab the audio track from the video stream … I say “just” Built into browsers is HTML5 audio and Video ability - basically the browser does all the heavy lifting.

billyzhou271 · March 29, 2023, 8:39am

I’m thinking about opening zoom / video on one side and the web app on the other side to see the transcription happening in real time. Might be a bit hard to achieve I suppose. I could merge the mic input with the system output channel to see if that works

rod.danan · April 28, 2023, 8:32pm

Question, I got an email from Deepgram about Nova being their new model and it was better and cheaper. Is this plugin upgraded to use that one?

lindsay_knowcode · April 28, 2023, 9:46pm

No specific upgrade, I’ll check in the morning. However I thought by default deepgram was set to use the latest model.

Depends also on whether the model is enabled for real-time.

I’ll check tomorrow.

tursun.alkam · April 28, 2023, 11:38pm

I am using DeepGram via API connector. Their transcription speed is way faster than others but the speed compromised the accuracy of transcription.

lindsay_knowcode · September 1, 2023, 8:17am

I’m adding diarization (aka speaker identification) to realtime Deepgram shortly. Email lindsay@knowcode.tech or DM me to encourage me to do it sooner

piriyakarthik · November 22, 2023, 3:58pm

I am using Deepgram Realtime Transcription. I am not able to start transcribing. I am getting following in debug mode:

Transcription status - Ready to start transcribing

lindsay_knowcode · November 22, 2023, 4:13pm

This means the plugin was installed in your Bubble app - but is no longer installed. Which is weird.

piriyakarthik · November 22, 2023, 4:21pm

Deepgram Realtime Transcription appears under paid plugins.
here are my settings:

I got API key from deepgram. I am using same key for API Key and API Key- dev. could this be the problem?

susindra1430 · December 1, 2023, 6:31pm

Hey Lindsay! is there any chance can we setup a specific language?

lindsay_knowcode · December 5, 2023, 10:21pm

Good question - Assembly AI only supports English for real-time transcription (at this point Supported languages — AssemblyAI | Documentation) .

Deepgram supports more realtime languages - Models & Languages Overview — Deepgram | Documentation. and the plugin supports passing in the language parameter.

lindsay_knowcode · January 5, 2024, 6:10pm

Now Speechmatics for Realtime Voice translation - up to 5 different languages realtime translated. Translates into 5 languages as you speak.

https://planbbackups.io/speechmatics

lindsay_knowcode · March 11, 2024, 11:27am

The Speechmatics translation is really fun when you then send it into ElevenLabs https://elevenlabs.bubbleapps.io/version-test? These translation & synthetic voices are getting incredibly good. It’s blowing my mind.

lindsay_knowcode · March 31, 2024, 10:50am

I’ve added an event for a Deepgram feature called “speech final”. This event happens when a speaker stops or pauses speaking. Optionally you can adjust the number of milliseconds to wait. It’s explained by Deepgram here End of Speech Detection While Live Streaming

What you could use this for is to detect when a speaker has finished answering a question for example, to stop the recording.

lindsay_knowcode · April 3, 2024, 9:56am

I’ve added speaker detection to the Deepgram real-time plugin- when Deepgram thinks each speaker has finished speaking - the plugin emits an event of “speaker change” and has an exposed state of “current speaker id”.

Deepgram can accurately detect speaker changes - but more amazing, if I try to use a funny voice to try and trick Deepgram that I am a different speaker - Deepgram knows it’s still me.

Topic		Replies	Views
Assembly.ai + bubble app Plugins	1	290	September 13, 2023
New Plugin: Audio Transcription Plugins	4	721	September 19, 2023
NEW Speech to Text Streaming with DeepGram (+Silence detection) Plugins	2	330	February 18, 2025
Looking for an audio recorder plugin that copes with large files Need help	1	63	June 11, 2024
Steps to use Assembly.ai plugin (in detail) Plugins	4	787	October 4, 2023

Realtime Voice transcription with Assembly AI & Deepgram

Related topics