✍ ᴺᴱᵂ ᴾᴸᵁᴳᴵᴺ Google Video AI - Transcribe Video (incl. Automated Google Environment Setup!)

redvivi · February 22, 2021, 2:09pm

Hi Bubblers !

With this plugin, you can transcribe spoken audio in a video into text and returns blocks of text for each portion of the transcribed audio, along with the speaker within a .MOV, .MPEG4, .MP4, .AVI, or any ffmpeg decodable video file format, provided as input.

The use-case ranges from automated captioning, simple archiving, categorising, enhanced search purposes of your video portfolio to SEO improvement.

The supported language are specified here: Speech-to-Text 支持的语言 | Cloud Speech-to-Text 文档 | Google Cloud

The plugin provides :

a Visual Element to detect the video duration,
a first Workflow Action to trigger the analysis.
a second Workflow Action to return the analysis progress rate, completion status, and when completed, a list of transcriptions. For each, it returns a list of words with related timestamps, confidence rate, and the speaker(s).

You can test out our Google Video AI - Transcribe Video Plugin with the live demo here.

Enjoy !
Made with by wise:able
Discover our other Artificial Intelligence-based Plugins

lankri.erez · February 23, 2021, 1:20pm

Hello,
Is there an option for a simplified version in which we just send a request with a http address and get back just the transcript?(without all the extra data) without the do every 5 seconds and all of that?
It’s almost impossible to start adding a do every 5 seconds workflow into a big system, will make things heavy.
Thanks

redvivi · February 23, 2021, 1:42pm

Hello,

Thanks for your message.

As mentioned in the instructions, the implementation of this plugin is asynchronous, which means that a request is sent first, processed in Google Cloud Platform, and once completed, is sent back on requestor request.

The asynchronicity is required for large media such as long audio and especially video, because Bubble.io platform allows an action to run a specific amount of time before being killed by Bubble engine, also known as timing out, typically around 30 seconds.
This duration is simply too short for GCP to process the video file and get the transcript back, hence the asynchronous operation.

Even if it would be synchronous (e.g. what you are requesting) and if there would be no timeout, the action would run and hang the application for dozen of seconds if not minutes, pending completion from the GCP platform, which is not sustainable and against architectural best practices.

You can find more information about these operations here: Long-running operations | Cloud Video Intelligence API Documentation

Alternatively, feel free to change the polling interval, we used 5 seconds in our demo but it can be any other value.

Should you required any further info, feel free to DM us to investigate specifically your case.

lankri.erez · February 23, 2021, 2:49pm

Thanks for the elaborated reply!
Is there an option to make a call and then schedule a workflow say in 2 minutes time(by then most likely the api response is ready) and retrieve the transcription into the desired field?

redvivi · February 23, 2021, 3:02pm

Sure, simply enter a different value in the Action start interval, as mentioned before:

If you do not want some actions to run when no analysis is expected, use the “Only when” field in the workflow. Then the action will execute only for the test you define, as done in our demo.

lankri.erez · February 23, 2021, 3:34pm

I’ll just elaborate my situation and maybe it will clear it out.
I have a screen where a user can upload several videos into, for each of these videos i would like to get a transcribe saved into a field of a data type holding the video url and a transcribe(text) field. The amount of videos is dynamic and changing.
How can that be achieved?

redvivi · February 23, 2021, 4:53pm

Hey @lankri.erez,

Please provide us your editor link and access to your app in DM, we would like to have a look.

Thanks.

redvivi · December 27, 2021, 9:31am

And yes, only for you Bubblers, this plugin now supports speaker diarizarion

rick.mooberry · January 30, 2022, 1:04am

This is super exciting and just want I am looking for!

I was wondering if there is a cost (presumingly from google) to make calls out to their API? I see the cost for the plugin but am unfamiliar with google’s side of the house.

redvivi · January 30, 2022, 9:44am

Hey @rick.mooberry !

You will find the associated pricing here: Pricing | Cloud Speech-to-Text | Google Cloud

Topic		Replies	Views
:writing_hand: ᴺᴱᵂ ᴾᴸᵁᴳᴵᴺ AWS Transcribe - Audio & Video (incl. Speech Recorder & Automated AWS Environment Setup!) Showcase	4	830	November 29, 2022
:abcd: [New Plugin] Google Video AI - Detect Text (OCR) Showcase	0	589	March 26, 2021
:speech_balloon: ᴺᴱᵂ ᴾᴸᵁᴳᴵᴺ Google Cloud - Speech to Text (incl. Speech Recorder + Automated Google Environment Setup!) Showcase	31	4148	November 29, 2022
Video Tutorial - Low Code AI - Transcription - with Bubble and AWS Showcase	36	3409	November 1, 2021
[New Plugin] Groq AI with Groq multimodal, Vision, Chat and Speech to text with a lightweight audio recorder Plugins	4	93	February 3, 2025

✍ ᴺᴱᵂ ᴾᴸᵁᴳᴵᴺ Google Video AI - Transcribe Video (incl. Automated Google Environment Setup!)

Related topics