N8n Community Node: GPT-Tokenizer

geckse · May 12, 2023, 8:23am

Hey I’ve released a new node which should pair just great with the OpenAI Node:

GPT-Tokenizer
https://www.npmjs.com/package/n8n-nodes-gpt-tokenizer

So basically I found it often hard to determine exactly how much Tokens a Prompt will take before submitting to OpenAI. That textLength/4 doesn’t really made the cut for me.

Naturally we want to make that as efficient as possible, so I created this node.

With this node you can:

Encode a string into BPE Tokens (may be cool for custom training)
Decode an array of BPE Tokens back to a string (for funzies?)
Determine a strings token length before submitting to the OpenAI API
Calculate costs before submitting to OpenAI API
Split a text into chunks which match exactly a definable Token Limit

It uses this npm package under the hood:
https://www.npmjs.com/package/gpt-tokenizer

let me know what you think of it!

onurbolaca · May 12, 2023, 9:34am

Hey @geckse ,

Thank you for your efforts Marcel!

Jayavel · May 12, 2023, 11:15am

This is great! Useful for many tasks that utilize OpenAI’s API.

I am on the lookout for using the Embeddings model as a node (creating embeddings, storing them in vector databases, and searching the embeddings). If you are aware of any options, pls point me to them.

geckse · May 15, 2023, 4:55pm

Might indeed be Interesting to give the OpenAI Node the capabilities of embeddings but I’m afraid theres currently no node for that.

You could maybe get that done with the HTTP-Request.

Might be also worth noting that Weaviate comes with a build-in OpenAI vectorizer:

Maybe that helps with your intention.

Jayavel · May 16, 2023, 7:19am

Hey, thanks for this. I got it working with the HTTP Request node, as you suggested. I was testing it with a free version of Pinecone DB (that too using the HTTP Request node). I am able to generate, store, and query embeddings.

Thanks,
Jayavel S

geckse · May 16, 2023, 8:04am

awesome! I’m sure there will be nodes from the community regarding the common vector databases in the future, if not myself will implement some when I’m finally taking a deep dive into Vector DBs.

Jayavel · May 16, 2023, 10:31am

Would love to see it.

pooria · September 1, 2023, 4:02pm

@geckse is it possible to use your community node to automate text/data to embeddings and upsert them into pinecone with particular metadata?

like using an rss feed, csv, text…turn into embeddings, assign metadata, upload to pinecone.

If so, do you have any sample workflows or tips on how to do it?

Kevin_Pilgrim · May 14, 2025, 9:19pm

Jayavel, is there any chance that you could share your solution for retrieving data from Weaviate in n8n? I’m running into similar issues and I’m not finding solutions online.

Topic		Replies	Views
nodes-langchain package Feature Requests	0	47	October 23, 2025
Embeddings Google PaLM - only English Questions node	2	212	April 24, 2024
Dimensions option for Embeddings Google Vertex and Embeddings Google Gemini nodes Feature Requests node	3	903	August 25, 2025
Cost of using Chatgpt(Tokens) Questions	10	194	February 16, 2026
Generic embeddings API node Feature Requests embeddings , ai	0	146	October 19, 2024

N8n Community Node: GPT-Tokenizer

Related topics