Agent callbacks #1316

tkattkat · 2025-11-26T23:58:08Z

Summary

Adds callback support to the Stagehand agent for both streaming and non-streaming execution modes, allowing users to hook into various stages of agent execution.

Changes

New Types (`lib/v3/types/public/agent.ts`)

Added callback interfaces for agent execution:

AgentCallbacks - Base callbacks shared between modes:
- prepareStep - Modify settings before each LLM step
- onStepFinish - Called after each step completes
AgentExecuteCallbacks - Non-streaming mode callbacks (extends AgentCallbacks)
AgentStreamCallbacks - Streaming mode callbacks (extends AgentCallbacks):
- onChunk - Called for each streamed chunk
- onFinish - Called when stream completes
- onError - Called on stream errors
- onAbort - Called when stream is aborted
AgentExecuteOptionsBase - Base options without callbacks
AgentExecuteOptions - Non-streaming options with AgentExecuteCallbacks
AgentStreamExecuteOptions - Streaming options with AgentStreamCallbacks

Handler Updates (`lib/v3/handlers/v3AgentHandler.ts`)

Modified createStepHandler to accept optional user callback
Updated execute() to pass callbacks to generateText
Updated stream() to pass callbacks to streamText

Type Safety

Added compile-time enforcement that streaming-only callbacks (onChunk, onFinish, onError, onAbort) can only be used with stream: true:

// ✅ Works - streaming callbacks with stream: true
const agent = stagehand.agent({ stream: true });
await agent.execute({
  instruction: "...",
  callbacks: { onChunk: async (chunk) => console.log(chunk) }
});

// ❌ Compile error - streaming callbacks without stream: true
const agent = stagehand.agent({ stream: false });
await agent.execute({
  instruction: "...",
  callbacks: { onChunk: async (chunk) => console.log(chunk) }
  // Error: "❌ This callback requires 'stream: true' in AgentConfig..."
});

Type Castings Explained

Several type assertions were necessary due to TypeScript's limitations with conditional types:

1. Callback extraction in handlers

const callbacks = (instructionOrOptions as AgentExecuteOptions).callbacks as
  | AgentExecuteCallbacks
  | undefined;

Why: instructionOrOptions can be string | AgentExecuteOptions. When it's a string, there are no callbacks. We cast after the prepareAgent call because at that point we know it's been resolved to options.

2. Streaming vs non-streaming branch in v3.ts

result = await handler.execute(
  instructionOrOptions as string | AgentExecuteOptions,
);

Why: The implementation signature accepts string | AgentExecuteOptions | AgentStreamExecuteOptions to satisfy both overloads, but within the non-streaming branch we know it's the non-streaming type. TypeScript can't narrow based on the isStreaming runtime check.

3. Error fallback in stream()

return {
  textStream: (async function* () {})(),
  result: resultPromise,
} as unknown as AgentStreamResult;

Why: When prepareAgent fails in streaming mode, we return a minimal object with just textStream and result. This doesn't satisfy all properties of StreamTextResult, but the result promise will reject with the actual error. The double cast (as unknown as) is needed because TypeScript knows this partial object doesn't match the full type.

Usage Example

const agent = stagehand.agent({
  stream: true,
  model: "anthropic/claude-sonnet-4-20250514",
});

const result = await agent.execute({
  instruction: "Search for something",
  maxSteps: 20,
  callbacks: {
    prepareStep: async (ctx) => {
      console.log("Preparing step...");
      return ctx;
    },
    onStepFinish: async (event) => {
      console.log(`Step finished: ${event.finishReason}`);
      if (event.toolCalls) {
        for (const tc of event.toolCalls) {
          console.log(`Tool used: ${tc.toolName}`);
        }
      }
    },
    onChunk: async (chunk) => {
      // Process each chunk
    },
    onFinish: (event) => {
      console.log(`Completed in ${event.steps.length} steps`);
    },
  },
});

for await (const chunk of result.textStream) {
  process.stdout.write(chunk);
}

const finalResult = await result.result;
console.log(finalResult.message);

Testing

Added agent-callbacks.spec.ts with tests for:
- Non-streaming callbacks (onStepFinish, prepareStep)
- Streaming callbacks (onChunk, onFinish, prepareStep, onStepFinish)
- Combined callback usage
- Tool call information in callbacks

Summary by cubic

Adds lifecycle callbacks to the Stagehand agent for both non-streaming and streaming modes, so users can hook into steps, chunks, finish, and errors. Strong type safety prevents using streaming-only callbacks without stream: true.

Why:

Give users fine-grained control over agent execution.
Prevent misuse of streaming callbacks with clear compile-time errors.

What:

Types
- Added AgentCallbacks, AgentExecuteCallbacks, AgentStreamCallbacks, AgentExecuteOptionsBase, AgentStreamExecuteOptions.
- Compile-time checks restrict onChunk/onFinish/onError/onAbort to stream: true.
Handlers
- createStepHandler now forwards user onStepFinish.
- execute() and stream() pass prepareStep/onStepFinish and streaming callbacks to generateText/streamText.
V3 and Cache
- V3 agent() overloads accept both execute and stream option shapes; routing updated accordingly.
- AgentCache.sanitizeExecuteOptions now uses AgentExecuteOptionsBase.
- Callbacks require experimental: true; agent() throws if callbacks are provided without experimental.
Tests
- Added agent-callbacks.spec.ts covering non-streaming, streaming, and combined callbacks, including tool call data.

Test Plan:

Automated: playwright tests in packages/core/lib/v3/tests/agent-callbacks.spec.ts verify:
- Non-streaming: prepareStep, onStepFinish, tool call info.
- Streaming: onChunk, onFinish, prepareStep, onStepFinish.
- Combined usage with multiple callbacks.
Type checks: public API type tests ensure new option and callback shapes are exported and correct.

^{Written for commit bc1d2a6. Summary will update automatically on new commits.}

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

changeset-bot · 2025-11-26T23:58:12Z

🦋 Changeset detected

Latest commit: bc1d2a6

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 2 packages

Name	Type
@browserbasehq/stagehand	Patch
@browserbasehq/stagehand-evals	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

greptile-apps · 2025-11-27T00:02:55Z

Greptile Overview

Greptile Summary

Adds lifecycle callback support to Stagehand agents, enabling users to hook into various stages of agent execution for both streaming and non-streaming modes.

Key Changes:

New callback interfaces: AgentCallbacks (shared), AgentExecuteCallbacks (non-streaming), and AgentStreamCallbacks (streaming)
Compile-time type safety preventing streaming callbacks (onChunk, onFinish, onError, onAbort) without stream: true
Handler updates to extract and forward callbacks to AI SDK's generateText and streamText
Callback exclusion from cache keys via AgentExecuteOptionsBase
Comprehensive test coverage for all callback scenarios

Implementation Quality:

Clean separation of concerns with callbacks properly typed and routed
Type assertions are justified and documented in PR description
Cache integration correctly excludes callbacks from cache keys
Tests verify callback execution, tool call information capture, and multi-callback usage

Confidence Score: 5/5

This PR is safe to merge with minimal risk
The implementation is well-architected with strong type safety, comprehensive test coverage, and proper separation of concerns. The callback system correctly integrates with the AI SDK, and the cache exclusion logic prevents callback pollution in cache keys. Type assertions are necessary due to TypeScript limitations with conditional types and are well-documented.
No files require special attention

Important Files Changed

File Analysis

Filename	Score	Overview
packages/core/lib/v3/types/public/agent.ts	5/5	Adds comprehensive callback type definitions with compile-time safety guards preventing streaming callbacks in non-streaming mode
packages/core/lib/v3/handlers/v3AgentHandler.ts	5/5	Updates handler to forward user callbacks to AI SDK, properly extracting and passing callbacks to generateText and streamText
packages/core/lib/v3/v3.ts	4/5	Updates agent() overloads and routing logic to support callback parameters in both streaming and non-streaming execution paths

Sequence Diagram

sequenceDiagram
    participant User
    participant V3
    participant V3AgentHandler
    participant LLMClient
    participant AISDKCallbacks

    User->>V3: agent({ stream: true/false })
    V3-->>User: AgentInstance

    User->>V3: agent.execute({ instruction, callbacks })
    V3->>V3: prepareAgentExecution()
    V3->>V3AgentHandler: new V3AgentHandler()
    
    alt Non-streaming (stream: false)
        V3->>V3AgentHandler: execute(instructionOrOptions)
        V3AgentHandler->>V3AgentHandler: prepareAgent()
        V3AgentHandler->>V3AgentHandler: extract callbacks from options
        V3AgentHandler->>LLMClient: generateText({ prepareStep, onStepFinish })
        loop Each step
            LLMClient->>AISDKCallbacks: prepareStep(context)
            AISDKCallbacks-->>LLMClient: modified context
            LLMClient->>LLMClient: execute step
            LLMClient->>AISDKCallbacks: onStepFinish(event)
            AISDKCallbacks->>V3AgentHandler: createStepHandler(event)
            V3AgentHandler->>User: callbacks.onStepFinish(event)
        end
        V3AgentHandler-->>V3: AgentResult
        V3-->>User: AgentResult
    else Streaming (stream: true)
        V3->>V3AgentHandler: stream(instructionOrOptions)
        V3AgentHandler->>V3AgentHandler: prepareAgent()
        V3AgentHandler->>V3AgentHandler: extract callbacks from options
        V3AgentHandler->>LLMClient: streamText({ prepareStep, onStepFinish, onChunk, onFinish, onError })
        loop Each step
            LLMClient->>AISDKCallbacks: prepareStep(context)
            AISDKCallbacks-->>LLMClient: modified context
            loop Each chunk
                LLMClient->>User: callbacks.onChunk(chunk)
            end
            LLMClient->>AISDKCallbacks: onStepFinish(event)
            AISDKCallbacks->>V3AgentHandler: createStepHandler(event)
            V3AgentHandler->>User: callbacks.onStepFinish(event)
        end
        LLMClient->>User: callbacks.onFinish(event)
        V3AgentHandler-->>V3: AgentStreamResult
        V3-->>User: AgentStreamResult
        User->>User: iterate textStream
        User->>User: await result promise
    end

greptile-apps

_{6 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

cubic-dev-ai

No issues found across 6 files

cubic-dev-ai

No issues found across 7 files

tkattkat and others added 30 commits November 21, 2025 14:24

agent steaming

861224d

put behind experimental

1fa6093

update typing

a0c8ddf

add streaming example

8d4f29d

lint

e9c63f6

update types

f0bfffe

change default max steps to 20

b21cd59

Merge remote-tracking branch 'origin/main' into agent-steaming

c5c60f1

manually bump version

57ec8c1

Update packages/core/examples/agent_stream_example.ts

e31d2df

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

changeset

ef9a78e

change example to verbose 0

5b5619c

add caching

e782674

format

1b7d126

add wrapStreamForCaching

51877dd

consolidate shared logic

ebd814f

remove version change

0d06078

agents streaming

eeee0dd

update example

7f88860

update changeset

ae01423

update example

269d7fc

greptile suggestions

1e7af15

format

4ad4efc

throw error when stream is used with cua

05bdf3c

add test for streaming

3744a71

update test

c14d736

lint

c1e3bae

add support for callbacks in agent

b832be9

format

41aa84b

add try catch

1b403df

tkattkat added 2 commits November 26, 2025 15:52

Merge remote-tracking branch 'origin/main' into agent-callbacks

ab26f6c

format

3166208

add changeset

eed6abd

greptile-apps bot reviewed Nov 27, 2025

View reviewed changes

put callbacks behind experimental

e4cc3ec

cubic-dev-ai bot reviewed Nov 27, 2025

View reviewed changes

update types

bc1d2a6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Agent callbacks #1316

Agent callbacks #1316

Uh oh!

tkattkat commented Nov 26, 2025 •

edited by cubic-dev-ai bot

Loading

Uh oh!

changeset-bot bot commented Nov 26, 2025 •

edited

Loading

Uh oh!

greptile-apps bot commented Nov 27, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Agent callbacks #1316

Are you sure you want to change the base?

Agent callbacks #1316

Uh oh!

Conversation

tkattkat commented Nov 26, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

New Types (lib/v3/types/public/agent.ts)

Handler Updates (lib/v3/handlers/v3AgentHandler.ts)

Type Safety

Type Castings Explained

1. Callback extraction in handlers

2. Streaming vs non-streaming branch in v3.ts

3. Error fallback in stream()

Usage Example

Testing

Summary by cubic

Why:

What:

Test Plan:

Uh oh!

changeset-bot bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

greptile-apps bot commented Nov 27, 2025

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tkattkat commented Nov 26, 2025 •

edited by cubic-dev-ai bot

Loading

New Types (`lib/v3/types/public/agent.ts`)

Handler Updates (`lib/v3/handlers/v3AgentHandler.ts`)

changeset-bot bot commented Nov 26, 2025 •

edited

Loading