Describe the problem/error/question
I am using the AI Agent node and Claude API. 1 of my request to Claude is 140k in size which is hitting the rate limit. I want to do 2 things to improve this.
- Can i first remove the user prompt from every request? Below is a screenshot where you see the sequential requests it makes and it seems each request includes the user prompt which is redundant i imagine and takes up valuable tokens. Since this session has memory i think i only need the user prompt or even the system prompt to be sent 1
- Also referencing the screenshot below you can see i make 4 requests and all passed except for the last one. Is there a node i can use that will batch the calls?
I basically make 4 requests and the last 2 are above 140k tokens each. What options do I have?
What is the error message (if any)?
{ "status": 429, "headers": { "anthropic-organization-id": "56f9697a-4d0c-4893-8667-9b9512dbd0b7", "anthropic-ratelimit-input-tokens-limit": "40000", "anthropic-ratelimit-input-tokens-remaining": "0", "anthropic-ratelimit-input-tokens-reset": "2025-06-04T17:47:28Z", "anthropic-ratelimit-output-tokens-limit": "8000", "anthropic-ratelimit-output-tokens-remaining": "8000", "anthropic-ratelimit-output-tokens-reset": "2025-06-04T17:45:41Z", "anthropic-ratelimit-requests-limit": "50", "anthropic-ratelimit-requests-remaining": "50", "anthropic-ratelimit-requests-reset": "2025-06-04T17:45:41Z", "anthropic-ratelimit-tokens-limit": "48000", "anthropic-ratelimit-tokens-remaining": "8000", "anthropic-ratelimit-tokens-reset": "2025-06-04T17:45:41Z", "cf-cache-status": "DYNAMIC", "cf-ray": "94a93ecf9804f09a-DFW", "connection": "keep-alive", "content-length": "529", "content-type": "application/json", "date": "Wed, 04 Jun 2025 17:45:41 GMT", "request-id": "req_011CPobeKF8LnxAZnGFUqakc", "retry-after": "52", "server": "cloudflare", "strict-transport-security": "max-age=31536000; includeSubDomains; preload", "via": "1.1 google", "x-robots-tag": "none", "x-should-retry": "true" }, "request_id": "req_011CPobeKF8LnxAZnGFUqakc", "error": { "type": "error", "error": { "type": "rate_limit_error", "message": "This request would exceed the rate limit for your organization (56f9697a-4d0c-4893-8667-9b9512dbd0b7) of 40,000 input tokens per minute. For details, refer to: https://docs.anthropic.com/en/api/rate-limits. You can see the response headers for current usage. Please reduce the prompt length or the maximum tokens requested, or try again later. You may also contact sales at https://www.anthropic.com/contact-sales to discuss your options for a rate limit increase." } }, "lc_error_code": "MODEL_RATE_LIMIT", "attemptNumber": 7, "retriesLeft": 0 }
Please share your workflow
(Select the nodes on your canvas and use the keyboard shortcuts CMD+C/CTRL+C and CMD+V/CTRL+V to copy and paste the workflow.)
Share the output returned by the last node
Information on your n8n setup
- n8n version: 1.94.1
- Database (default: SQLite): SQLite
- n8n EXECUTIONS_PROCESS setting (default: own, main): own
- Running n8n via (Docker, npm, n8n cloud, desktop app): Docker
- Operating system: MacOS


