streaming tools

流式工具执行允许多个工具并发执行，并实时反馈进度。

设计理念

问题

串行执行工具效率低：

readFile(file1) → 等待 → readFile(file2) → 等待 → readFile(file3)
总时间: 300ms + 300ms + 300ms = 900ms

解决方案

并发执行：

readFile(file1) ┐
readFile(file2) ├→ 并发执行
readFile(file3) ┘
总时间: max(300ms, 300ms, 300ms) = 300ms

核心实现

StreamingToolExecutor

class StreamingToolExecutor {
  constructor(
    private tools: Map<string, Tool>,
    private options: {
      maxConcurrency: number;
      onProgress?: (progress: Progress) => void;
    }
  ) {}
  
  async *execute(
    toolCalls: ToolCall[]
  ): AsyncGenerator<ToolResult> {
    const queue = [...toolCalls];
    const executing = new Set<Promise<ToolResult>>();
    
    while (queue.length > 0 || executing.size > 0) {
      // 启动新的工具执行（不超过并发限制）
      while (queue.length > 0 && executing.size < this.options.maxConcurrency) {
        const call = queue.shift();
        const promise = this.executeTool(call);
        executing.add(promise);
        
        // 完成后从集合中移除
        promise.finally(() => executing.delete(promise));
      }
      
      // 等待任意一个完成
      const result = await Promise.race(executing);
      yield result;
    }
  }
  
  private async executeTool(call: ToolCall): Promise<ToolResult> {
    const tool = this.tools.get(call.name);
    
    try {
      const result = await tool.execute(call.input, this.context);
      
      // 通知进度
      this.options.onProgress?.({
        tool: call.name,
        status: 'completed',
      });
      
      return result;
    } catch (error) {
      this.options.onProgress?.({
        tool: call.name,
        status: 'failed',
        error: error,
      });
      
      return {
        success: false,
        error: error.message,
      };
    }
  }
}

并发控制

最大并发数

const MAX_CONCURRENCY = 5;

// 限制同时执行的工具数量
const semaphore = new Semaphore(MAX_CONCURRENCY);

async function executeTool(call: ToolCall): Promise<ToolResult> {
  await semaphore.acquire();
  
  try {
    return await tool.execute(call.input, context);
  } finally {
    semaphore.release();
  }
}

工具分组

某些工具必须串行执行：

function groupToolCalls(calls: ToolCall[]): ToolGroup[] {
  const groups = [];
  let currentGroup = [];
  
  for (const call of calls) {
    if (isWriteTool(call.name)) {
      // 写操作开始新组
      if (currentGroup.length > 0) {
        groups.push(currentGroup);
      }
      groups.push([call]);
      currentGroup = [];
    } else {
      // 读操作可以并发
      currentGroup.push(call);
    }
  }
  
  if (currentGroup.length > 0) {
    groups.push(currentGroup);
  }
  
  return groups;
}

错误处理

错误级联

一个工具失败可能影响其他工具：

async function executeWithCascade(
  calls: ToolCall[]
): Promise<ToolResult[]> {
  const results = [];
  
  for await (const result of executor.execute(calls)) {
    results.push(result);
    
    // 如果关键工具失败，中断后续执行
    if (!result.success && isCritical(result.tool)) {
      executor.abort();
      break;
    }
  }
  
  return results;
}

部分失败处理

function handlePartialFailure(
  results: ToolResult[]
): ToolResult {
  const successful = results.filter(r => r.success);
  const failed = results.filter(r => !r.success);
  
  if (failed.length === 0) {
    // 全部成功
    return {
      success: true,
      output: aggregateResults(successful),
    };
  }
  
  if (successful.length === 0) {
    // 全部失败
    return {
      success: false,
      error: 'All tools failed',
    };
  }
  
  // 部分成功
  return {
    success: true,
    output: aggregateResults(successful),
    warnings: failed.map(f => f.error),
  };
}

进度反馈

实时进度

const executor = new StreamingToolExecutor(tools, {
  maxConcurrency: 5,
  onProgress: (progress) => {
    // 更新 UI
    updateProgressBar(progress);
  },
});

for await (const result of executor.execute(toolCalls)) {
  console.log(`✓ ${result.tool} completed`);
}

进度条

function updateProgressBar(progress: Progress) {
  const { completed, total } = progress;
  const percentage = (completed / total * 100).toFixed(0);
  
  process.stdout.write(`\rProgress: ${percentage}% [${completed}/${total}]`);
}

投机性执行

预测下一步

async function speculativeExecute(
  messages: Message[]
): Promise<void> {
  // 预测可能的工具调用
  const predictions = predictNextTools(messages);
  
  // 后台预执行（不阻塞）
  predictions.forEach(pred => {
    executeTool(pred.tool, pred.input)
      .then(result => {
        // 缓存结果
        speculativeCache.set(pred.key, result);
      })
      .catch(() => {
        // 预测错误，忽略
      });
  });
}

使用预执行结果

async function executeTool(
  call: ToolCall
): Promise<ToolResult> {
  const key = `${call.name}:${JSON.stringify(call.input)}`;
  
  // 检查是否有预执行结果
  if (speculativeCache.has(key)) {
    return speculativeCache.get(key);
  }
  
  // 正常执行
  return await tool.execute(call.input, context);
}

性能测量

执行时间

async function measureToolExecution(
  call: ToolCall
): Promise<{ result: ToolResult; duration: number }> {
  const start = performance.now();
  const result = await executeTool(call);
  const duration = performance.now() - start;
  
  // 记录指标
  recordMetric(call.name, duration);
  
  return { result, duration };
}

下一步

了解并发控制的限制策略
探索缓存策略的实现
查看启动优化的技术

PreviousREADME Nextstartup

hashtag设计理念

hashtag问题

hashtag解决方案

hashtag核心实现

hashtagStreamingToolExecutor

hashtag并发控制

hashtag最大并发数

hashtag工具分组

hashtag错误处理

hashtag错误级联

hashtag部分失败处理

hashtag进度反馈

hashtag实时进度

hashtag进度条

hashtag投机性执行

hashtag预测下一步

hashtag使用预执行结果

hashtag性能测量

hashtag执行时间

hashtag下一步

设计理念

问题

解决方案

核心实现

StreamingToolExecutor

并发控制

最大并发数

工具分组

错误处理

错误级联

部分失败处理

进度反馈

实时进度

进度条

投机性执行

预测下一步

使用预执行结果

性能测量

执行时间

下一步