A Step is a unit of execution within a Session. It captures both a snapshot of the agent’s current state and the action taken to transition to the next state. Steps are the backbone of a session timeline and form the nodes in your workflow graph.Each Step may contain multiple underlying Events, such as tool calls, API requests, or LLM invocations.
Steps may include a screenshot that visually captures what the agent was seeing or interacting with at that moment. These can be critical for understanding UI-based agents or browser actions.
User-provided scores: log your own logic-based rating of a step
LLM-generated scores (planned): use limited context to judge a step’s quality
Default heuristics: highlight stuck or repeated states
Copy
Ask AI
lai.update_step(eval_score=0.3, eval_description="Tried to switch tabs but failed.")
Warning: Step-level evaluation in isolation is often difficult without broader task context. We recommend pairing this with session-wide scoring or rubrics when possible.
When multiple Steps share the same state and similar actions (e.g., retries or confusion loops), they’re automatically grouped in the graph view. This helps surface:
Stuck behavior (e.g., trying the same thing repeatedly)
Revisiting states (e.g., reloading a site, flipping between tabs)
Structural patterns (e.g., loops, branches, dead ends)