LLM streaming in repeating group

stoklasa.michal666 · May 1, 2025, 7:25pm

Hi,

Has anyone figured out how to implement LLM streaming in RG for a chatbot use case?

I’d like to display the partial response as it’s streaming in last message, and then replace it with the final version saved in the DB once it’s completed. But thats not the way…

Thanks for any advice!

Michal

rjwilkinson10 · May 1, 2025, 9:53pm

When it was first released my approach was to stream the response directly to the page then once the stream finished → hide that and display the same message from the RG or database. It works but as you mentioned it’s not ideal..

The core issue seems to be that the data is streamed via a WebSocket from an external API (like OpenAI), whereas you’re trying to stream it from a RG. RGs don’t poll data continuously in the same way, which complicates things.

Also, I imagine the streamed data is saved only after the full response is received (otherwise it would be constantly saving and WU +++🚀). If that’s the case, then there’s technically nothing to stream from the RG during the process (it only becomes available once the stream is complete).

Hopefully someone else has figured out a cleaner approach to this.

justin.j · May 2, 2025, 12:09am

This is how I approached it with my workflow after the user hits submit on the chat. First thing you need is a group on the page to act as the “Stream Handler” with a data type of “Text Stream”

"Create a new message" or however you’re populating the RG (You might need additional workflows here to populate the RG with this message, depends on how your DB is set up)
I call the OpenAI API
“Display data in a group/popup” with the “Stream Handler” as the group populating it with Result of Step 2's current Stream (this makes it so that the stream handler always contains the current stream of text)
Make a change to a Message (Step 1) with Result of Step 2's completed Stream (What this does is basically write to the DB with the completed stream)

Now all the data streamed back will populate that element which we can retrieve from in our RG.

Finally, within our conditions for the last cell in the RG, we can conditionally populate it’s contents based on if the "Stream Handler’s Stream is Streaming or the “Sream Handler’s Stream is not Streaming”

If it is streaming, we choose the “Stream Handler” as the datasource, otherwise we choose the “Current cell’s message” or however you set up your message objects.

stoklasa.michal666 · May 3, 2025, 7:16pm

Thank you so much!

For these, who are bad in english (like me) here is re-written guide from chatgpt:

User clicks “Send”
As soon as the user submits their question or prompt, you kick off two things at once:

You create a new message record in your database (so you have a spot to save the final answer later).
You start calling the OpenAI API with streaming turned on.

Set up your “Stream Handler”
On your page, add a hidden or off-screen group (let’s call it the Stream Handler). Give it a custom data type called Text Stream. Its job is simply to hold whatever bits of text the API spits out as they arrive.
Populate the Repeating Group (RG)
You probably already have a Repeating Group showing your chat history. Make sure that when you “create a new message” in step 1, that blank message appears immediately in the RG (even though its text is still empty).
Feed partial text into the Stream Handler
As each little chunk of text comes back from the API:
Use a workflow action like “Display data in a group”, targeting your Stream Handler.
Set its data to Result of Step 2’s Current Stream.
This means the Stream Handler always holds exactly what’s arrived so far—and nothing more.
Show the partial response on-screen
Inside your last RG cell (the one for the new message), put a text element whose data source is conditional:
If the Stream Handler’s is streaming flag is TRUE, show Stream Handler’s text.
Otherwise, show that message’s saved text (from the database).
Once streaming completes, save the full answer
When the API finishes:
In your workflow, do “Make a change to a thing”, targeting the message you created in step 1.
Set its text field to Result of Step 2’s Completed Stream (i.e. the entire answer).
Now the full answer is safely stored in your database.
Switch from streaming to saved text
After you save the full answer, the Stream Handler’s is streaming flag turns OFF. That automatically flips your RG cell’s condition: it stops showing the temporary stream text and instead shows the final, saved message text.

mattblake · May 9, 2025, 1:03pm

In case it is helpful here is my approach

API Configuration
- Set up your API connector and initialize it to integrate with OpenAI or similar AI services. Ensure you format the input text as JSON-safe to avoid syntax errors.
Workflow Setup
- Create a workflow for user interaction that will handle sending messages and receiving AI responses:
  - Initiate Message: On a user action (e.g., clicking send), create a user message in the database.
  - Create Assistant Response: Create an empty assistant response in the database with a yes/no field for tracking if streaming is in progress.
Send Request
- Send the formatted request to OpenAI with the user messages included. Use the API connector to handle this.
Temporary Text Storage Group
- Set up a temporary group (you can name it text_stream) on the page to hold the incoming text from the AI. This temporary group will display the stream before saving it to the database.
Display Streamed Data
- In the workflow:
  - Assign incoming stream data to the temporary group.
  - Continuously update this group as data streams in.
Save Streaming Data
- At the end of the streaming:
  - Save the full text to the database.
  - Update the streaming field to “no”.
Conditional Display Logic
- Set up a repeating group to show messages:
  - Default state: Show text saved in the database.
  - Streaming state: When streaming is “yes”, show text from the temporary group.

pnocode · May 13, 2025, 2:48am

Thank you everyone, this sheds some light on the handling the streaming response:

I’m curious if anyone has tried to implement the streaming using backend workflows. Is this even possible?

mattblake · May 13, 2025, 12:07pm

I believe this is covered in Bubble’s docs regarding streaming. I’m not sure why it would be useful as streaming is not writing each character in turn to the database, so based on the docs I think streaming in the backend workflow will appear to the user as non-streamed content.

navidmani1 · May 14, 2025, 7:39am

I have further question regarding this, i got the streaming and everything to work, but when in the workflow display data happens the whole repeating group scrolls to the top. For a chat this is really annoying. As you can see my workflow is a bit messy, but it works in this order:

Make a new message
Scroll to the last message (else the user always has to scroll, if you have a solution for this I would appreciate it)
Embeddings of message
Data search pinecone
Set streaming to yes, so my message doesnt start streaming to early and show old stream data
Request chatgpt, including old messages for history
display data (here it refreshes for some reason)
Markdown to BB-Code so its looks pretty (cant get chatgpt to write in BB itself, just ignores me for some reason)
Make changes to step 1, set streaming no and take answer from bb code
create another message that is the response from chatgpt which is hidden but important for history

navidmani1 · May 14, 2025, 7:46am

Got it to work 1min after I send this in despair, just used the plugin to reverse repeating group. Not sure if this is optimal still, seems like a lot of wasted WU… hopefully Bubble makes this easier somehow

angel1996 · May 23, 2025, 6:28am

Hi! I’m facing the exact same problem. In my case, the oldest messages are the ones on top, while the new ones are on the bottom. Applying this solution of reverse RG, do you have it with the oldest ones on top and newest ones on the bottom? Or it’s the opposite?

navidmani1 · May 26, 2025, 8:09am

Hey yea, my repeating group works now. The newest messages are at the bottom and it always scrolls down. I just used a different plugin that worked in the end. The normal one should also work, i just noticed that if i have the element collapsed on page-load it doesnt get reversed with some plugins, but other do work. So its just finding the one that worked (for me: RepeatingGroup Reverse)

angel1996 · May 26, 2025, 4:05pm

Hey, you mean RepeatingGroup Reverse by Fernando Paes? There are many similar with that name.

I implemented the solution and while it’s true that there is no scroll up, the problem now is that it continuously goes down with the chunks very quickly. Even if I set a scroll to the last message, or the last user message, for some reason it doesn’t work, it’s like it’s not being activated, and with long messages, it goes down and down. Do you happen to have a solution for this?

Thanks in advance!

navidmani1 · May 27, 2025, 1:39am

Yep exactly the one by Fernando Paes worked for me. Weird i dont have the problems you mentioned. I am not sure what you did exactly, on my use I dont need to scroll to the last message, as this repeating group does it by default. That is because due to the plugin the “last” message becomes the first from my understanding.
Not sure if that would help but I would create a stripped version of your chat and then see when it breaks. It should work…

angel1996 · May 27, 2025, 5:25am

I’ve tried that one and I still have the same problem. Have you tried with the phone and very long text from AI? This is when the problem appears

navidmani1 · May 31, 2025, 11:16am

Hmmm yep just tried it and still works for me, sorry not sure what it is.

Topic		Replies	Views
Open.AI Assistants APIs	3	544	April 24, 2025
Autoscroll when streaming Bugs	5	77	June 6, 2025
How to display the result of an API Workflow in a repeating group? Need help	11	253	September 4, 2024
[New feature] Native support for API streaming Announcements	157	4556	August 2, 2025
Saving data out of a repeating group Tips	15	488	October 17, 2024

LLM streaming in repeating group

Related topics