Is it possible to use newly added sonnet models with Azure AI Foundry using Azure OpenAI Chat Model?

Nitish_Gupta · November 19, 2025, 7:07pm

Hi, Microsoft just made Anthropic models available in Azure AI Foundry. I would like to utilize these models within n8n AI Agents but we currently only have option for Azure OpenAI Chat Model. Is there a way to use Azure-hosted Claude models within n8n AI Agents or we have to wait for the integration?

James_Pardoe · November 26, 2025, 5:21am

Seconding this request.

edsingin · December 1, 2025, 12:29pm

Third. Do we know if they are working on this?

Jherico_Medina · December 5, 2025, 2:12am

Fourth. Anthropic Chat Model Node doesnt work either when your credentials came from Azure AI Foundry

vlad-tsoy · December 10, 2025, 3:31pm

Fifth, really hope the team can work on this in the near future.

Jones_y · December 16, 2025, 4:47pm

Sixth, would be amazing to support all Foundry endpoints, especially non-OpenAI model selection.

test3 · January 13, 2026, 11:08am

Seventh, i really need this

Jorn · January 30, 2026, 4:11pm

Please allow us to connect to a claude model hosted on Azure Foundry.

Brendan_Gooden · February 9, 2026, 3:42am

Really would appreciate this! Happy to support any integration effort

matthewjensen · February 17, 2026, 12:01pm

Eighth this, really need this for my work.

PatricLee-SMKT · February 25, 2026, 10:00am

I agree, the Azure “OpenAI” Chat Model should support Foundry Models and Endpoints. Currently it is way to limited for enterprise customers.

Matthew_DuVal · March 19, 2026, 5:29pm

Would really appreciate this being addressed!

Matthew_DuVal · April 16, 2026, 7:44pm

Hi all, I got this working. Add an Anthropic credential and set the baseUrl like the screenshot below. Note the connection will fail, but it will work in your flow.

Select an Anthropic Chat Model for your AI agent node as opposed to an Azure Open AI node. Choose your cred and use the “By ID” setting in the dropdown. Type your model deployment name in the text box. Execute and it should work. I only tried with API key auth, but this is working for me.

TomHughesSAXTech · April 16, 2026, 11:43pm

Can confirm that this does work, just connected to my Opus 4.6 and Sonnet models in Azure AI Foundry. Thanks!!!

TomHughesSAXTech · May 1, 2026, 9:20am

I lied… after two weeks of using Azure anthropic claude 4.7 and sonnet 4.6 with this hack job connection, streaming broke out of nowhere and now it takes minutes for any responses. Streaming would give me a preamble in my front end, and then 2min later, data dump the response in the chatbot. Switching back to AOAI 5.4 and 5.4-mini, streaming was back and fast

TomHughesSAXTech · May 1, 2026, 10:10am

So the workaround I found works best is that you build a LiteLLM and point the native OpenAI nodes to that:The n8n bug — Issue #28635 (April 17, 2026):

The @n8n/n8n-nodes-langchain.lmChatAnthropic node v1.3 (which is exactly what you’re using) emits the legacy thinking format thinking: { type: "enabled", budget_tokens: N }. Anthropic has removed that format on Claude Opus 4.7 the API returns a 400 the moment thinking is on. The same format is also deprecated and scheduled for removal on Opus 4.6 and Sonnet 4.6. Anthropic now requires thinking.type.adaptive and output_config.effort to control thinking behavior. GitHub + 2

This is in n8n’s Linear queue with the AI team assigned, not yet shipped.

There’s also a separate n8n streaming bug — Issue #23851:

Enabling the “Streaming” toggle on the AI Agent does not pass the stream: true parameter to the underlying Anthropic SDK. This causes requests with large input token counts (~20k+) to fail with the error: “Streaming is required for operations that may take longer than 10 minutes.” This one’s been open longer and is exactly your shape long input contexts (your CPA agent’s system prompt alone is huge, plus 20-turn Redis memory, plus your prep context block) flip the request into a path where stream-flag propagation breaks. GitHub

And Anthropic confirmed model-side behavior changes:

All users now default to xhigh effort for Opus 4.7, and high effort for all other models as of April 7. Opus 4.7 also has a notable behavioral quirk relative to its predecessor: as we wrote about at launch, it tends to be quite verbose. Combined with extended thinking running silently in the background, that explains why long answers feel like they pause forever and then dump Opus 4.7 is doing more thinking than 4.6 did at the same effort tier.

LiteLLM proxy in your AKS cluster. 30-minute setup. Stand up a LiteLLM pod, point it at Anthropic with your real key, configure it to expose an OpenAI-compatible endpoint. Then in n8n, use the OpenAI Chat Model node with a custom base URL pointing at your LiteLLM service. LiteLLM handles all the streaming translation and adaptive thinking parameter shapes correctly — it’s literally maintained for this exact problem. You bypass the broken n8n Anthropic node entirely. This gets you Opus 4.7 back with proper streaming.

config.yaml

model_list:

model_name: claude-opus-4-7
litellm_params:
model: anthropic/claude-opus-4-7
api_key: os.environ/ANTHROPIC_API_KEY
model_name: claude-sonnet-4-6
litellm_params:
model: anthropic/claude-sonnet-4-6
api_key: os.environ/ANTHROPIC_API_KEY

general_settings:
master_key: sk-your-internal-litellm-key

Topic		Replies	Views
Unable to connect Opus 4.5 model on Azure AI Foundry to n8n AI Agent node Questions openai , ai	2	314	January 2, 2026
Add Claude 3.7 Sonnet model support to n8n AI nodes Feature Requests (done) core , node	9	1633	April 1, 2025
How to connect an Azure Foundry AI model to the AI agent as a Chat Model? Questions	0	439	August 18, 2025
Support for Azure OpenAI provider Feature Requests custom-node , openai	1	155	October 13, 2025
Azure OpenAI in new Foundry Feature Requests	0	74	December 2, 2025

Is it possible to use newly added sonnet models with Azure AI Foundry using Azure OpenAI Chat Model?

config.yaml

Related topics