Hello,
We are trying to create an AI agent that can provide a summary of a book that we upload and provide Q&A. But we found that it’s not working to the level that we’re expecting that it cannot do a summary and the answers it returns seems to be only partial.
Here’s more details about our workflows:
Objective
Upload any document (100 pages or more) and have the AI workflows to learn about the book so that it’s able to summarize the book and answer questions related to the book content.
Workflow 1:
- Download the document from Google Drive (https://drive.google.com/file/d/1iAi-bCNpde2BdoEhNzZ3gTW038wubauI/view?usp=sharing)
- Using Google Gemini to process the document and upload to Pinecone Vector Store
- Recursive Character Text Splitter Settings: Chunk Size: 500 and Chunk Overlap: 20
Workflow 2:
- When asked a question, using Google Gemini to process the question
- Then look up relevant information from the Pinecone Vector Store and provide an answer
Steps:
- Upload Dot Com Profits (https://drive.google.com/file/d/1iAi-bCNpde2BdoEhNzZ3gTW038wubauI/view?usp=sharing) in Workflow 1
- Ask questions to the chat agent in Workflow 2 related to the document uploaded.
Problems:
- When asked to provide a summary of the book, the agent appears to just trying to find if a summary is already provided in the document. It doesn’t generate a summary based on the book and responded that “no summary was found”.
- When asked what are the steps of generating wealth online, the agent is not able to describe the steps (chapters) mentioned in the book. Rather, it responds that the book mentioned about creating multiple income streams, but not the specific steps which can be found in the book.
- When asked “how many steps are there?”, the agent is able to return that there are 8 steps. But when asked “what are the eight steps?”, it’s not able to list it.
Questions:
- It appears that the agent only attempts to find information that’s explicitly provided and it’s not able to aggregate information or provide a summary. Are there any extra processing required to prepare the document to able to do that?
- Are the settings of uploading the document able to allow the agent to process the document properly? Is it better to have larger chunk size and chunk overlaps to allow continuous context?
- Sometimes, the Q&A agent seems to have the correct answer like when asked “how do I sell a high-priced product?”, it responded “The provided text states that selling high-priced products depends on perceived value. Effectively communicating the value of your product, including the time and money invested in its creation, and highlighting your knowledge and experience are key. A well-built marketing funnel is also important.?”
- What can be the reasons that it can get some answers right, but generally, others incorrectly?