[New Free Plugin] Tokenizer for GPT-3.5 & GPT-4

bkerryk · April 20, 2023, 2:23am

Hey fam, this is my first published plugin, super simple but useful for counting the token count of a inputted text.

What does the Tokenizer do?
It accurately calculates the token count of a given text input using GPT-3 and GPT-4’s byte pair encoding (BPE) method. With seamless integration into your Bubble.io app’s workflows, this plugin helps you better understand text complexity and manage API usage when working with GPT-3 and GPT-4 powered applications.

Built using this package suggested by OpenAI - gpt-3-encoder - npm

Example of use:

Inform users/you workflows when the text input exceeds a certain token count, prompting them/our workflows to shorten the text or break it into smaller chunks.
Estimate the size of an API call before sending it, helping you manage API usage and avoid unexpected costs, and avoiding API errors.

How to use:

Install plugin
In workflows find ‘Get token count’.
The input is for your text
The output comes out in following workflow step using the expression “Results of step ‘#’ (Get token count)'s token count”

Notes:
Putting more than 10k word into it will potentially timeout the plugin, so test really large docs before putting in production and have measures in place if it fails. But for most use cases 10,000 words is enough!

The plugin page hasn’t provided a link yet, so just search for it.

Hope someone find its useful, but I built it for me so I’m happy regardless

stuart4 · May 2, 2023, 9:05pm

I love it when someone has already done the thing I was looking for… will test it out now!
thanks

bkerryk · May 3, 2023, 12:05am

Great to hear I hope it helps! Its not super fast but still very useful over 2 weeks I have used it.

stuart4 · May 3, 2023, 12:26am

Seperate question… but since youve done the counting.

What do you use to do the splitting of docs? If you’ve needed to do this for ‘chunking’?

bkerryk · May 3, 2023, 11:35am

This post is what you want I reckon - Langchain with vector database connected to Bubble - #46 by jeffbuze

The whole thread is good

m.aguirra · November 7, 2023, 9:31pm

Many thanks!!!

Topic		Replies	Views
Tokenizer- Chat gpt Plugins	0	15	October 6, 2024
[New Plugin] Reading Time and Word Count Showcase	1	583	January 19, 2021
Document Reader & Word Counter - New Plugin from Zeroqode Showcase	23	2283	February 5, 2024
[New Plugin] OpenAI API - Supporting ChatGPT & GPT3 Plugins	4	1428	April 13, 2023
Search text within a file that's uploaded APIs	3	455	April 21, 2021

[New Free Plugin] Tokenizer for GPT-3.5 & GPT-4

Related topics