Distilling insights with ReasoningBank
ReasoningBank distills international reasoning patterns into high-level, structured recollections. Every structured reminiscence merchandise accommodates the next:
- Title: A concise identifier summarizing the core technique.
- Description: A short abstract of the reminiscence merchandise.
- Content material: The distilled reasoning steps, determination rationales, or operational insights extracted from previous experiences.
The reminiscence workflow operates in a steady, closed loop of retrieval, extraction, and consolidation. Earlier than taking motion, the agent attracts upon the ReasoningBank to collect related recollections into its context. It then interacts with the surroundings and makes use of an LLM-as-a-judge to self-assess the ensuing trajectory and extracts success insights or failure reflection. Notably, this self-judgement doesn’t have to be completely correct, as we discover ReasoningBank to be fairly sturdy in opposition to judgment noise. Throughout extraction, the agent distills workflows and generalizable insights from the trajectory into new recollections. For simplicity, we instantly append these to the ReasoningBank, leaving extra refined consolidation methods for future work.
Crucially, in contrast to present workflow reminiscence methods that solely concentrate on profitable runs, ReasoningBank actively analyzes failed experiences to supply counterfactual indicators and pitfalls. By distilling these errors into preventative classes, ReasoningBank builds highly effective strategic guardrails. For instance, as a substitute of merely studying a procedural rule like “click on the ‘Load Extra’ button”, the agent would possibly study from a previous failure to “at all times confirm the present web page identifier first to keep away from infinite scroll traps earlier than trying to load extra outcomes”.

