web
You’re offline. This is a read only version of the page.
close
Skip to main content
Community site session details

Community site session details

Session Id :
Copilot Studio - General
Suggested answer

Ambiguity in Search Results

(1) ShareShare
ReportReport
Posted on by 6
I am creating a Chatbot that will help answer questions related to Standard Operating Procedures.
There are over 6000 SOP word & PDf documents.

I am running into interesting issue of how to resolve Ambiguity.
1. It find answers from irrelevant documents.
    There are Multiple SOP that would have information around What PPE(Personal Protective Equipment) a Machine Operator should wear when troubleshooting a speciif Machine.
    It will randonlmy find answer that from different documents.
    How can make Chatbot to ask some more questions to user before answering a question where the information is found in multiple documents and use that new information to only search specific document.

    So, for question like
    User: What are the PPE requirements ?
    Chatbot: Can you please tell me more about which Machine & Assembly line are you talking about ?
    User : MAchine A on Assembly line 1.
        <Out of 6000 Share Point document it should only search the documents that are relevant for MAchine A and assembly Line 1>

   How do achieve that ?

Also i only want Chatbot to ask user if he has not already provided that information. If he has already provided that information, Chat bot should not ask for it again.
 
Categories:
I have the same question (0)
  • Michael E. Gernaey Profile Picture
    51,852 Super User 2025 Season 2 on at
    Ambiguity in Search Results
     
    I am moving this to the Copilot area as this is not a Power Up Challenge question :-0)
  • Michael E. Gernaey Profile Picture
    51,852 Super User 2025 Season 2 on at
    Ambiguity in Search Results
     
    Do you still need help with this?
  • Suggested answer
    Romain The Low-Code Bearded Bear Profile Picture
    1,439 Super User 2025 Season 2 on at
    Ambiguity in Search Results
    there are two way : 
    First one : the one i have often difficulty to make it run 100% of the time : with the global instruction windows, tell it to ask question. (but it's not enough precise, you have to teach the agent "what is a not enough precise question" : the agent is not trained ingenier and don't know if the question contain enough information, since the exemple you give could be a question : so you have to give sample to the agent to understand what and why.  Agent are just maths, they are not sentient intelligence :) so they need a lot of help when it's about complexe matter (i have similar problem like yours ATM i m on a bot for industrial ingenier and some question... are very complicated to be understand by the agent)
    Annexe option for this solution : you could provide glossary inside the instruction or with glossary.xlsx file as knowledge to help the agent :)
     
    Second one : more reliable : use the topics ability to create topic and ask question. You could create topic for "generic" question and force to give detail, store progressivly all answer and concatenate them to create a big question and send it to a generative AI node (inside advanced tab of the node choice inside a topic)
  • Michael E. Gernaey Profile Picture
    51,852 Super User 2025 Season 2 on at
    Ambiguity in Search Results
     
    So you wrote this
     
    I am creating a Chatbot that will help answer questions related to Standard Operating Procedures.
    There are over 6000 SOP word & PDf documents.
     
    --I am guessing you are storing these in SharePoint? Is your knowledge source(s) setup at the as the root? How many levels of documents do you have? How many documents in a given document library and or subfolder do you have?
     
    --Firstly there is a limit to the # of documents that an Agent will index. It will also limit the amount of data, based on size and complexity of your documents. My point is, an Agent by itself is not really intended to do this many. Training an LLM or extending a Model is another story, but simply pointing at 6,000 goes beyond its limits. In many cases you will need to partition your data into subfolders etc and then put the documents in there specifically and make them knowledge sources versus having it try to use SharePoint at (any level) try to find it deeper down.


    I am running into interesting issue of how to resolve Ambiguity.
    1. It find answers from irrelevant documents.
        There are Multiple SOP that would have information around What PPE(Personal Protective Equipment) a Machine Operator should wear when troubleshooting a specific Machine.
        It will randomly find answer that from different documents.
    --This would be expected due to the volume of data you have. The issue is, think about it like this. You have a zillion books in a room, the person who is trying to find the answer (let's call them a librarian), they have no idea which books have the information you want because they cannot summarize (index) them all. so they try and find something and when they do... they go oh this is it.

        How can make Chatbot to ask some more questions to user before answering a question where the information is found in multiple documents and use that new information to only search specific document.

        So, for question like
        User: What are the PPE requirements ?
        Chatbot: Can you please tell me more about which Machine & Assembly line are you talking about ?
        User : MAchine A on Assembly line 1.
            <Out of 6000 Share Point document it should only search the documents that are relevant for Machine A and assembly Line 1>

       How do achieve that ?
     
    --So to mix up the answer a little bit (from the above questions), essentially you need to use Scoring. In the scoring, if the agent scores the answers too low, then you would want it to ask more questions. This is harder to do, not impossible but its not an OOB agent thing exactly, its a Prompt Against a real model.
     
    If you want it to only search documents that are related to Machin A, then you have to separate your data in a way where the knowledge sources can be Indexed properly against the topics (and by topic I do not mean Agent topics) but the focus area they are asking about. It is never going to be "just smart" enough to take your thousands of documents dropped somewhere and OOB do that. 

    Also i only want Chatbot to ask user if he has not already provided that information. If he has already provided that information, Chat bot should not ask for it again.
    -----Conversations are only so large, no matter whether your Topic ends a conversation or not, an Agent will no automatically remember everything forever. It will reset at some point. Which means you would have to leverage global variables to track everything and even then, they can be wiped over time, simply meaning there is no gauranteed way (in memory) to make it remember it all. There are ways to cache it, write it to a file etc, and read it only the fly etc. But its not as simple as it would be if it was going against an actual model, versus Knowledge sources.
     

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Responsible AI policies

As AI tools become more common, we’re introducing a Responsible AI Use…

Chiara Carbone – Community Spotlight

We are honored to recognize Chiara Carbone as our Community Spotlight for November…

Leaderboard > Copilot Studio

#1
Michael E. Gernaey Profile Picture

Michael E. Gernaey 475 Super User 2025 Season 2

#2
Romain The Low-Code Bearded Bear Profile Picture

Romain The Low-Code... 425 Super User 2025 Season 2

#3
Giraldoj Profile Picture

Giraldoj 55 Super User 2025 Season 2

Last 30 days Overall leaderboard