You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The output of SelectorAgent, corrected_input, is supposed to be a cleaned version of ASR text to helps subsequent agents in the workflow to process user's requests
However, the actual output can contain
Body text from system message: You are an intelligent laparoscopic cholecystectomy surgical assistant who must select from the available agents based on the user's request.
Truncated output if the user message is long, e.g. ending a sentence with "..."
Reproduce
With the uncertainty of LLMs in mind, one can try prompts like "Check the text in the image" / "Repeat the text". The VLM may be triggered to provide incorrect corrected_output
Alternative Solution
Engineered prompts of the subsequent agents, and let those agents know the user text can be from ASR and may contain errors.