[prompt-clustering] Copilot Agent Prompt Clustering Analysis — 2026-05-15 #32333
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot Agent Prompt Clustering Analysis. A newer discussion is available at Discussion #32602. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
Analysis Period: Last 30 days (2026-04-27 → 2026-05-15)
Total Tasks Analyzed: 998
Clusters Identified: 8
Overall Success Rate: 80.0% (798 merged / 196 closed / 4 open)
Eight thematic clusters emerge from NLP analysis of copilot agent task descriptions. The dominant cluster (~49% of tasks) covers general workflow code and bug fixes, while the smallest is a tight outlier of 20 WIP/placeholder lint-fix PRs with only a 30% merge rate. Cache-memory & experiment tasks have the highest success rate at 90.6%; AWF firewall/config work and WIP lint-fixes have the lowest.
Cluster Map
Daily Volume
Cluster Summary Table
workflow, files, bug, job, path, runmcp, cli, tool, mode, playwright, gatewaymodel, agent, daily, sub, token, workflowshared, workflow, workflows, import, agentic, otlpsafe, safe output, output, safe outputs, outputs, issueawf, firewall, version, config, awf config, defaultexperiment, cache, variant, workflow, memory, promptstarted description, thanks asking, asking work, work started, asking, date formComplexity vs Success
Per-Cluster Breakdown (top 5 representative PRs each)
C5 — General workflow code & bug fixes
Size: 492 tasks (49.3% of total)
Outcome: 404 merged, 85 closed, 3 open → 82.1% merge rate
Avg per PR: 22.8 files changed · +504/−480 lines · 3.8 commits · 1.5 reviews · 2.3 comments
Top keywords:
workflow, files, bug, job, path, run, failure, agent, engine, githubGH_AW_INFO_ENGINE_IDinto setup steps so setup OTel spans emit `gh-aw.engine.iC0 — MCP servers / CLI / Playwright tooling
Size: 111 tasks (11.1% of total)
Outcome: 87 merged, 24 closed, 0 open → 78.4% merge rate
Avg per PR: 33.5 files changed · +366/−204 lines · 3.6 commits · 1.4 reviews · 2.9 comments
Top keywords:
mcp, cli, tool, mode, playwright, gateway, tools, help, validation, mcp gatewayemojifrontmatter field #32200 and--stagedflag tocompilefor forced staged workflowsC2 — Agent prompt / model optimization
Size: 90 tasks (9.0% of total)
Outcome: 78 merged, 12 closed, 0 open → 86.7% merge rate
Avg per PR: 25.2 files changed · +432/−154 lines · 3.4 commits · 1.6 reviews · 2.1 comments
Top keywords:
model, agent, daily, sub, token, workflow, run, inline, alias, inline subC6 — Shared workflow imports / consolidation
Size: 80 tasks (8.0% of total)
Outcome: 61 merged, 19 closed, 0 open → 76.2% merge rate
Avg per PR: 40.2 files changed · +731/−321 lines · 4.4 commits · 2.0 reviews · 4.5 comments
Top keywords:
shared, workflow, workflows, import, agentic, otlp, agentic workflow, frontmatter, agentic workflows, importsgh aw deployto orchestrate remote workflow rollout via PRC4 — Safe-output handlers (issue/discussion/PR)
Size: 76 tasks (7.6% of total)
Outcome: 60 merged, 15 closed, 1 open → 78.9% merge rate
Avg per PR: 28.3 files changed · +392/−134 lines · 4.1 commits · 1.4 reviews · 3.1 comments
Top keywords:
safe, safe output, output, safe outputs, outputs, issue, agent, workflow, handler, discussionupdate_pull_requestupdate-branch soft 422s as non-fatal in safe outputsSafeOutputTargetConfigC3 — AWF firewall / config / version bumps
Size: 65 tasks (6.5% of total)
Outcome: 44 merged, 21 closed, 0 open → 67.7% merge rate
Avg per PR: 98.1 files changed · +992/−639 lines · 4.0 commits · 1.6 reviews · 5.8 comments
Top keywords:
awf, firewall, version, config, awf config, default, schema, bump, awf firewall, proxymax-runsfrom 100 to 500 across compiler, schema, and docs/reflectno longer fails during smoke runsC1 — Cache memory & prompt experiments
Size: 64 tasks (6.4% of total)
Outcome: 58 merged, 6 closed, 0 open → 90.6% merge rate
Avg per PR: 13.3 files changed · +340/−82 lines · 3.5 commits · 1.7 reviews · 1.8 comments
Top keywords:
experiment, cache, variant, workflow, memory, prompt, state, cache memory, run, experimentsdetail_levelA/B experiment to daily architecture diagram workflow outputC7 — WIP / placeholder lint-fix PRs
Size: 20 tasks (2.0% of total)
Outcome: 6 merged, 14 closed, 0 open → 30.0% merge rate
Avg per PR: 37.1 files changed · +286/−24 lines · 1.9 commits · 0.6 reviews · 0.3 comments
Top keywords:
started description, thanks asking, asking work, work started, asking, date form, form plan, plan progress, thanks, description dateSampled Data Table — 10 most recent PRs per cluster (80 total)
GH_AW_INFO_ENGINE_IDinto setup steps so setup OTel spansgh aw statusandgh aw logsbypass local filesystem when `--rCreatePullRequestsConfig--stagedflag tocompilefor forced staged workflows--docker-host-path-prefixin generaw-gpu-runner-T4to stop c/reflectfor Copilot model discovery in daily-model-inventorgh aw deployto orchestrate remote workflow rollout via PRemojifrontmatter fieldobservability.otlp.ignore-if-missingto downgrade missing OTLP callowedlabel filters for safe-outputsupdate_pull_requestupdate-branch soft 422s as non-fatal in sSafeOutputTargetConfig@copilotmentions in PR Sous Chef safe outputsmax-runsfrom 100 to 500 across compiler, schema, a/reflectno longer fails duringtestworkflow golden ofirewall.effective-token-steeringcompidetail_levelA/B experiment to daily architecture diagram workflreasoning_depthA/B experiment to daily-security-red-team workflprompt_styleA/B experiment and variant-gatedKey Findings
thanks asking,work started,description date,form plan) reveal these are auto-generated lint-fix attempts that stall. Worth investigating why these specific recoveries fail so often.Recommendations
workflow,files,bug,job,path) are essentially the union of everything that didn't fit elsewhere. A second pass focusing just on C5 would surface finer structure.Methodology
## Changes) for each of 1000 copilot PRs created 2026-04-27 → 2026-05-15. After filtering prompts <30 chars, 998 PRs remained.gh,aw,pkg,pr, etc).References: §25913157542
Beta Was this translation helpful? Give feedback.
All reactions