feat: 支持多语言提示词本地化和界面优化
- 添加 prompt_locale 参数支持简体中文、繁体中文和英文提示词本地化 - 移除内置 agents 配置以简化系统架构 - 更新 ContextBuilder 使用动态提示词模板而非硬编码内容 - 在 AgentLoop、Web 接口和 AgentService 中传递 locale 参数 - 添加输出语言指令确保用户界面内容按指定语言生成 - 扩展前端 LanguageSwitcher 组件支持三种语言选项 - 优化 Header 和侧边栏组件的响应式布局和文本截断处理 - 更新测试用例验证不同语言环境下的提示词正确性
This commit is contained in:
@ -2,6 +2,12 @@
|
||||
|
||||
> **For agentic workers:** REQUIRED SUB-SKILL: Use superpowers:subagent-driven-development (recommended) or superpowers:executing-plans to implement this plan task-by-task. Steps use checkbox (`- [ ]`) syntax for tracking.
|
||||
|
||||
Related product planning artifacts:
|
||||
|
||||
- [Product Discovery Report](../../product-discovery/skill-replay-eval/product-discovery-report.md)
|
||||
- [PRD](../../product-discovery/skill-replay-eval/PRD-skill-replay-eval.md)
|
||||
- [Launch And Maintenance Runbook](../../product-discovery/skill-replay-eval/launch-maintenance-runbook.md)
|
||||
|
||||
**Goal:** Replace heuristic-only skill draft evaluation with replay-style reports that cover all tools through either safe execution or LLM surrogate judgment, while preserving base skill content during draft synthesis.
|
||||
|
||||
**Architecture:** Extend the existing skill learning pipeline instead of replacing it. Add replay report fields to `SkillDraftEvalReport`, introduce focused helper modules under `beaver/skills/learning/`, then wire the enhanced evaluator through the existing `SkillLearningPipelineService.evaluate_draft()` path and Skills UI.
|
||||
|
||||
@ -1,5 +1,11 @@
|
||||
# Skill Replay Eval Design
|
||||
|
||||
Related product planning artifacts:
|
||||
|
||||
- [Product Discovery Report](../../product-discovery/skill-replay-eval/product-discovery-report.md)
|
||||
- [PRD](../../product-discovery/skill-replay-eval/PRD-skill-replay-eval.md)
|
||||
- [Launch And Maintenance Runbook](../../product-discovery/skill-replay-eval/launch-maintenance-runbook.md)
|
||||
|
||||
## Goal
|
||||
|
||||
Improve skill draft evaluation so it measures real task behavior instead of relying on heuristic draft scoring. The new evaluation must cover every tool involved in a skill, while separating tools that can be executed safely from tools that require LLM surrogate judgment.
|
||||
|
||||
Reference in New Issue
Block a user