Organise Agent response in Json format

diegoschister · February 20, 2025, 1:25pm

Describe the problem/error/question

I’m developing an automation process composed of three steps for a technology research agent:

Agent Query:
A chat is used to ask a Gemini-based agent to perform a specific search.
Information Organization:
A second agent takes the information obtained in the first step and organizes it into a structured JSON format.
Upload to Google Sheets:
The structured JSON is uploaded to a Google Sheets document for further analysis.

The issue arises in the second step. I am unable to get the agent to organize the information into an appropriate JSON format that can be uploaded to Google Sheets. Below, I detail the components involved and the error that appears:

What is the error message (if any)?

Error in sub-node ‘Google Gemini Chat Model1‘

[GoogleGenerativeAI Error]: Error fetching from https://generativelanguage.googleapis.com/v1beta/models/gemini-2.0-flash-lite-preview:streamGenerateContent?alt=sse: [400 Bad Request] * GenerateContentRequest.tools[0].function_declarations[0].parameters.properties[output].type: must be specified when not using one_of

Please share your workflow

Promt for the second agent:
You are an expert information extraction agent. You must perform the following tasks:

Extract relevant information from the research.
Structure the information in JSON format.
Update an Excel file.

1. Extract the following data for each research task:

Company name:
Location:
Solution name:
Solution description:
Solution features:
Where was it tested?:
Main clients (oil & gas companies):
Economic impact:
Efficiency:
Emissions reduction:
Link to the solution:
Dimension:
Discipline: (Subsurface, Drilling, Completion, Production, Maintenance, Data, HSE)
Technology status: (Idea, Prototype, Scaling, Ready to develop, Mature)

2. Generate the JSON file with the extracted information:

[
{
“Company name”: “”,
“Location”: “”,
“Solution name”: “”,
“Solution description”: “”,
“Solution features”: ,
“Where was it tested?”: “”,
“Main clients (oil & gas companies)”: ,
“Economic impact”: “”,
“Efficiency”: “”,
“Emissions reduction”: “”,
“Link to the solution”: “”,
“Dimension”: “”,
“Discipline”: “”,
“Technology status”: “”
}
]

Share the output returned by the last node

Information on your n8n setup

n8n version: standar
Database (default: SQLite): no
n8n EXECUTIONS_PROCESS setting (default: own, main): standar
Running n8n via (Docker, npm, n8n cloud, desktop app): standar
Operating system: win 11

n8n · February 20, 2025, 1:25pm

It looks like your topic is missing some important information. Could you provide the following if applicable.

n8n version:
Database (default: SQLite):
n8n EXECUTIONS_PROCESS setting (default: own, main):
Running n8n via (Docker, npm, n8n cloud, desktop app):
Operating system:

solomon · February 21, 2025, 3:47pm

Many people are having the same problem. See here.

Using another LLM model solves the issue. Wait for the n8n team to fix and use OpenAI in the mean time. It’s super cheap.

If my reply solves your problem, please remember to mark it as a solution.

diegoschister · February 21, 2025, 7:34pm

Thanks I will try that. I fix a couple of errors but sometimes I got and answer and sometimes not. Very rare!

solomon · February 21, 2025, 7:38pm

That’s interesting. I bet it’s bug with the tool. The team might even be on it, already.

ofs · February 21, 2025, 8:25pm

Here how can I convert a scraped text data to json formatting with gemini.

I am assuming you can get your output properly from 1st agent and only problem is structuring the second agent.

In my case I tried to parse second hand e-bike listing for ebay. For second hand listings, users doesn’t provide same structured info and also content is German. So I used Gemini to understand listing and structure it json. On browser I tested and got JSON correctly. When I done on API with n8n, I can get the same result. I add my parser after agent with JS code. I will share my chat and JS code.

CHAT:

You are a professional data extraction assistant for eBike listings. Your task is to analyze the provided description and extract structured information about the eBike. If the description is in a foreign language (e.g., German), translate it internally to English before processing.

Your key objectives are:

1-Extract relevant details about the eBike in a structured JSON format.

2-Identify any negative information such as damage, issues, missing components, abnormal wear, or any statement that may suggest a potential concern regarding the bike's condition, documentation, or usage.

3-If the model year is described in relative terms (e.g., "five years old" or "3 years ago"), calculate the Year_Model by subtracting the duration from the current year. If a specific year is not mentioned and cannot be inferred, return "Not Provided."

4-If any essential information is missing (e.g., battery capacity, motor model), return "Not Provided."

5-Include any important details like warranty status, maintenance history, available documents, or reported issues in the "Additional_Details" field.

6-Specifically list any negative information in the "Negative_Details" field.

7-Output only the structured JSON with English values for each key. Do not add any explanation or additional comments.


**Output Format**

{

"Document": {

"Available": "Yes/No",

"Type": "e.g., Purchase Receipt, Warranty, User Manual, etc."

},

"Brand_Model": "Brand and Model Name (e.g., Haibike XDURO AllMtn)",

"Year_Model": "Year (e.g., 2022 or Not Provided)",

"Motor_Model": "Motor Model (e.g., Bosch Performance Line CX)",

"Motor_Volt": "Motor Volt (e.g., 36V, 48V, 56V or Not Provided)",

"Battery_Power": "Battery Capacity (e.g., 500Wh, 625Wh, or Not Provided)",

"Milage_KM": "Milage in KM (e.g., 1200 km or Not Provided)",

"Additional_Details": "Any other important details such as warranty status, maintenance history, available accessories, or anything else noteworthy.",

"Negative_Details": "Any negative information, such as damage, issues, missing components, abnormal wear, or concerns raised by the seller in the description. If no negative details are found, return 'None.'"

}


Guidelines:

1- Translation: If the description is in a foreign language, translate it to English before extracting the data. Record the extracted details in English in the JSON output.

2- Completeness: Ensure the JSON output covers all key details. If information is not provided in the description, explicitly mark it as "Not Provided."

3- Negative Information: Pay special attention to any statements indicating potential problems or concerns with the eBike, including missing documentation, signs of damage, or abnormal wear. Capture these details in the "Negative_Details" field.

4- Document Verification: Check if the seller mentions having important documents (e.g., purchase receipts, warranty, user manuals). If such documents are missing, this should also be noted in the output.

5- Exact Year Mentioned: If the description includes a specific year like 2018 or 2020, use that directly for Year_Model.

6- Relative Time Mentioned: If the description includes phrases like "five years old" or "bought 3 years ago", calculate the year by subtracting the time from the current year (e.g., if it's 2025 and the bike is "five years old," the model year is 2020).

7- No Time or Year Mentioned: If neither a specific year nor a relative time reference is provided, return "Not Provided" for Year_Model.


TITLE of LISTING: E-Bike Flyer c 5.1, 26 Räder, 45 Rahmen, ges.500km gefahren


**LISTING Description**

Schweizer Markenfahrrad !

Neuwertiger Zustand ohne Probleme,

Tiefeinsteiger mit Bordkomputer, abgelesene 508km.

Mittelmotor 36 V, 432 Wh, 8 Gang Narbenschaltung,

freilauf, hydraulische Felgenbremsen v.+hi.

Schiebehilfe, Berganfahrhilfe, Felgenschloß,



Komfortabel zu fahren, mit Anleitung

und alle Papiere......Fahrradpaß

5 Jahre alt, wegen Krankheit nur wenig genutzt,

Batterie immer im warmen Wohnzimmer gelagert.

Fahrleistung bis zu 80 Km nie genutzt.

Anschauen lohnt sich, "sieht aus wie neu"

LP: 2800

OUTPUT from AI

JS Code for Formating the output

// Loop over input items and transform the output JSON
const outputArray = [];

for (const item of $input.all()) {
    // Clean up the JSON string before parsing
    let cleanOutput = item.json.output.replace(/```json|```/g, "").trim();
    let extractedJson = JSON.parse(cleanOutput);

    // Format and modify the extracted JSON
    outputArray.push({
        "Document": {
            "Available": extractedJson.Document.Available,
            "Type": "Fahrradpaß, Anleitung"
        },
        "Brand_Model": extractedJson.Brand_Model,
        "Year_Model": "2020", // Manually set the new Year_Model
        "Motor_Model": "Not Provided", // Manually set default value
        "Motor_Volt": extractedJson.Motor_Volt,
        "Battery_Power": extractedJson.Battery_Power,
        "Milage_KM": extractedJson.Milage_KM,
        "Additional_Details": "Mittelmotor, 8 Gang Narbenschaltung, freilauf, hydraulische Felgenbremsen v.+hi., Schiebehilfe, Berganfahrhilfe, Felgenschloß",
        "Negative_Details": extractedJson.Negative_Details
    });
}

return outputArray;

Result:

[
  {
    "Document": {
      "Available": "Yes",
      "Type": "Fahrradpaß, Anleitung"
    },
    "Brand_Model": "Flyer c 5.1",
    "Year_Model": "2020",
    "Motor_Model": "Not Provided",
    "Motor_Volt": "36V",
    "Battery_Power": "432Wh",
    "Milage_KM": "508 km",
    "Additional_Details": "Mittelmotor, 8 Gang Narbenschaltung, freilauf, hydraulische Felgenbremsen v.+hi., Schiebehilfe, Berganfahrhilfe, Felgenschloß",
    "Negative_Details": "None."
  }
]

Same test result on UI too:

Workflow:

Problem probably you are not cleaning up json format. Because AI returns this:

{
  "response": {
    "generations": [
      [
        {
          "text": "```json\n{\n\"Document\": {\n\"Available\": \"Yes\",\n\"Type\": \"User Manual, All Papers, Bicycle Passport\"\n},\n\"Brand_Model\": \"Flyer c 5.1\",\n\"Year_Model\": \"2019\",\n\"Motor_Model\": \"Not Provided\",\n\"Motor_Volt\": \"36V\",\n\"Battery_Power\": \"432Wh\",\n\"Milage_KM\": \"508 km\",\n\"Additional_Details\": \"Low entry bike with on-board computer, 8-speed hub gears, freewheel, hydraulic rim brakes front and rear, push assistance, hill start assist, rim lock. Comfortable to ride. Battery always stored in the warm living room. Mileage up to 80 km never used. Original price: 2800\",\n\"Negative_Details\": \"None.\"\n}\n```",
          "generationInfo": {
            "finishReason": "STOP"
          }
        }
      ]
    ]
  },
  "tokenUsageEstimate": {
    "completionTokens": 164,
    "promptTokens": 978,
    "totalTokens": 1142
  }
}

NOTE:
My perspective is that let the AI generate JSON format and respons should be only the JSON format like my example, then try to extract that json from “text” (you can see above json AI return) with cleaning extras with code (like JS that I provided above)

If my reply solves your problem, please remember to mark it as a solution.

diegoschister · February 24, 2025, 1:53pm

UPDATE:

AFTER considering all the feedback, I am able to process the data with the Output parser. However, I do not get any output. As you can see i get no error.

I believe that this is a Gemini’s problem or n8n?

Thanks

ofs · February 24, 2025, 2:43pm

Is that 1st or 2nd agent output? Or parser?

diegoschister · February 24, 2025, 3:20pm

2 agent ouput

ofs · February 24, 2025, 3:59pm

Can you get response from Gemini?
Did you check your log?

If it is parsing, probably it is returning empty object.

Do you still using same prompt and parser?

ofs · February 24, 2025, 4:23pm

Can you modify this depends on your need. This works well. Carefully design system prompt for proper output. Also I realized that your JSON example structure is wrong. You should start/end with {}, not .

Input data for this example:

Title: E-Bike Flyer c 5.1, 26 Räder, 45 Rahmen, ges.500km gefahren Description

Schweizer Markenfahrrad ! Neuwertiger Zustand ohne Probleme, Tiefeinsteiger mit Bordkomputer, abgelesene 508km. Mittelmotor 36 V, 432 Wh, 8 Gang Narbenschaltung, freilauf, hydraulische Felgenbremsen v.+hi. Schiebehilfe, Berganfahrhilfe, Felgenschloß,

Komfortabel zu fahren, mit Anleitung und alle Papiere......Fahrradpaß 5 Jahre alt, wegen Krankheit nur wenig genutzt, Batterie immer im warmen Wohnzimmer gelagert. Fahrleistung bis zu 80 Km nie genutzt. Anschauen lohnt sich, "sieht aus wie neu" LP: 2800

diegoschister · February 24, 2025, 7:30pm

Hi, I correct the json structure that you mention. However, sometimes the model excecutes with no problem and other has an error. When the excution has no problem, the output of the second agent is empty but in the log session I can see only one register that was done ok.

ofs · February 24, 2025, 7:43pm

Can you please give me an example chat input to test it.

diegoschister · February 25, 2025, 12:51pm

“Please do a research for oil and gas digital solutions in spanish”

ofs · February 25, 2025, 5:08pm

Your parser didn’t work because n8n not able to parse it. Use this one:

diegoschister · February 25, 2025, 7:50pm

Hi, Thanks for the help. I run the code you mention but it din’t work. I modify one of the arrays and use the function slice() but also didn’t work.

Here’s the code i run.

// Loop over input items and transform the output JSON
const outputArray = [];

for (const item of $input.all()) {
    // Clean up the JSON string before parsing
    let cleanOutput = item.json.output.replace(/```json|```/g, "").trim();
    let extractedJson = JSON.parse(cleanOutput);

    // Format and modify the extracted JSON
    outputArray.push({
        "nombre_empresa": extractedJson.nombre_empresa,
        "ubicacion": extractedJson.ubicacion,
        "nombre_solucion": extractedJson.nombre_solucion,
        "descripcion_solucion": extractedJson.descripcion_solucion,
        "caracteristicas_solucion": extractedJson.caracteristicas_solucion ? extractedJson.caracteristicas_solucion.slice(0, 4) : [],
        "donde_se_probo": extractedJson.donde_se_probo,
        "principales_clientes_oil_gas": extractedJson.principales_clientes_oil_gas ? extractedJson.principales_clientes_oil_gas.slice(0, 3) : [],
        "impacto_economico": extractedJson.impacto_economico,
        "reduccion_emisiones": extractedJson.reduccion_emisiones,
        "link_solucion": extractedJson.link_solucion,
        "dimension": extractedJson.dimension,
        "disciplina": extractedJson.disciplina,
        "estado_tecnologia": extractedJson.estado_tecnologia
    });
}

return outputArray;

Here’s the output:

I’m frustrated with this. It should be simple to work.

Also, when you said that din’t work for n8n. Why? Is it something on their side? Is the Gemini model? I would like to underestand so I can know what to do in future workflows. Thanks for the help!

ofs · February 26, 2025, 7:19am

I used what you provided. I tested same input couple times and it was working. I only changed Gemini 2.0 Flash and also created that JS code. Never failed me.

Why it is not working on n8n: Gemini’s response needs to be parsed. n8n parse not able to get json format properly, I guess because of extras at the beginning of json response.

If you tested with different input, make sure 2nd agent responds expected json properly. If that is returning properly, make sure your JS file alligns with response. What I mean, each key name and if it is array their values amount like what you done with slice needs to match.

My guess is currently you are not able to get proper output from 2nd agent. It might be changed by Gemini. If your key names always same and getting proper output, then problem is parser.

Also:
Can you please check what are you using in second agent model.
This is what I used in 2nd one:

This is first one:

This is example result for second agent response:

This is after json parser:

This red markeds are the problem: (didn’t marked all but you got the idea)

system · May 27, 2025, 7:20am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.