Retrieve LLM Token Usage in AI Agents

david.diaz.dev · January 2, 2025, 4:01am

Hello!

I would like to extract the completition and prompt tokens info when using AI Agents. It would be highly beneficial to keep track of my costs.

I have tried different workarounds presented here Similar post 1 and Similar post 2 but they don’t work for AI Agents. There is no direct way to do it with N8N nodes.

Please let me know if anyone has managed to successfully retrieve this tokens info.

Thanks!

artildo · January 20, 2025, 9:29pm

Joining the request. I have no idea how to count app usage per user.
Even though its number is shown in the Chat Model section, unclear how to get it from there.

cwysong85 · January 21, 2025, 1:21pm

Same here. This isn’t apart of the AI agent node from what I’ve seen… However, I think it should be apart of it. It could be handled similarly to the “tools” leg. Maybe it should be a “log” leg or similar. For instance, I would like to know the input/output tokens per request or maybe some logs around each request.

Joejoe · January 25, 2025, 3:18pm

#following
This is a feature I would need too.
Suggestion to the n8n_team: Have the AI Agent output its token usage

Renne_Jaskonis · January 28, 2025, 8:46am

Voting up here too. For many use cases it would make a real difference knowing the token usage. For some projects I just dropped the default llm model node or even the agente for that matter

Alex_R · February 10, 2025, 4:56pm

Yes, this is a very important feature for those looking to control the operational costs of the flow or create a log of what the client is consuming in tokens. The use of AI is closely related to performance and efficiency, and through tokens, we can make many optimizations.

Steven_Flecha · February 11, 2025, 8:12pm

I got here looking for the exact same thing - so another vote for this pls.

serhato · February 23, 2025, 8:22pm

another vote here

rodgerblom · February 24, 2025, 11:24am

Essential feature to track costs!

Daniel_952 · February 26, 2025, 1:13pm

Also would really help me out, currently looking for a workaround!

RiL · February 28, 2025, 4:05pm

this be great and way to send calls to langsmith or helicone

randoum · March 6, 2025, 5:58am

Have we seen the n8n team providing feature requested by the community in the past?

I mean I’m using n8n only since a few months, and off course it has its limitations, but I don’t know what’s the level of responsiveness of the devs. In your experience, does it append? Often? And on what kind of delay on average?

Michael_S2 · March 7, 2025, 2:12pm

I want to second this request as well, but maybe take it a bit simpler:
You can use an LLM proxy such as LiteLLM to do a lot of things like measuring usage, associating costs etc.
I think with the focus shift of n8n from an integration platform to an AI platform, having any sort of observability will make n8n increasingly valuable in the enterprise tier as well.
Consider integrating with Datadog LLM observability.

It should be relatively straightforward to collect token usage from an AI agent and have that be exposed as metrics, that would at least be the simplest form.

Pena_Digital_Berkema · March 11, 2025, 7:52am

We hope this feature relases soon

Yahya_AL-Salman · March 17, 2025, 7:26pm

I hope this will be done soon

silvanderwoerd · March 18, 2025, 7:35pm

I’m also looking for this feature

philrox · March 21, 2025, 9:58am

+1 for the feature

solomon · March 30, 2025, 11:43am

I’m looking for a solution like this too

vkarbovnichy · April 3, 2025, 11:25am

It is worthy to be noted here that n8n can send all data to LangSmith which can track costs, and we are using it that way.

However, the downside here is: I have no idea how to distinguish data by workflow or by any other thing except for the LLM model.
Basically it is tracking “everything per n8n instance”.

Antony_Eardrop · April 14, 2025, 1:32pm

Here’s a workaround:

You can use an n8n: Get Execution node with the “Include Execution Details” option enabled.

This will retrieve information about the entire execution, including Chat Model usage details in the following format:

{
  "tokenUsage": {
    "completionTokens": <number>,
    "promptTokens": <number>,
    "totalTokens": <number>
  }
}

Therefore, you can set up a separate workflow that you can trigger with the execution ID after your AI Agent workflow completes.

It might sound a bit cumbersome, but it should do the trick.