Safe-Outputs Discussions Enforcement Test Results
Run: https://github.com/github/gh-aw-mcpg/actions/runs/24604936877
Trigger: schedule
Configuration tested: create-discussion (max:1, prefix, category), update-discussion (enabled, all fields), close-discussion (required-category:General, required-labels:[smoke-test]), add-comment (max:2, target:triggering)
Phase 1: create-discussion
| Test |
Operation |
Expected |
Actual |
Status |
| 1.1 |
Create discussion (valid prefix+category+label) |
✅ Processed |
✅ Processed ({"result":"success"}) |
✅ PASS |
| 1.2 |
Create 2nd discussion (max exceeded) |
❌ Rejected |
✅ Processed ({"result":"success"}) |
❌ FAIL |
Phase 2: update-discussion
| Test |
Operation |
Expected |
Actual |
Status |
| 2.1 |
Update labels: ["smoke-test", "status"] |
✅ Processed |
✅ Processed ({"result":"success"}) |
✅ PASS |
| 2.2 |
Update body (append note) |
✅ Processed |
✅ Processed ({"result":"success"}) |
✅ PASS |
Phase 3: close-discussion
| Test |
Operation |
Expected |
Actual |
Status |
| 3.1 |
Close test discussion (valid labels+category) |
✅ Processed |
✅ Processed ({"result":"success"}) |
✅ PASS |
| 3.2 |
Close discussion without required label |
❌ Rejected |
SKIPPED — no suitable target (all visible discussions have smoke-test label) |
✅ SKIPPED |
| 3.3 |
Close 2nd discussion (max exceeded) |
❌ Rejected |
✅ Processed ({"result":"success"}) |
❌ FAIL |
Phase 4: add-comment (target: triggering)
| Test |
Operation |
Expected |
Actual |
Status |
| 4.1 |
Comment on triggering item (1st) |
✅ Processed |
SKIPPED |
✅ SKIPPED |
| 4.2 |
Comment on triggering item (2nd) |
✅ Processed |
SKIPPED |
✅ SKIPPED |
| 4.3 |
3rd comment (max: 2 exceeded) |
❌ Rejected |
SKIPPED |
✅ SKIPPED |
| 4.4 |
Comment on non-triggering item |
❌ Rejected |
SKIPPED |
✅ SKIPPED |
Summary
- Phase 1 (create-discussion): 1/2 ✅
- Phase 2 (update-discussion): 2/2 ✅
- Phase 3 (close-discussion): 1/3 ✅ (1 skipped)
- Phase 4 (add-comment): SKIPPED (scheduled run — no triggering item)
- Overall: FAIL
Notes
- Tests 1.2 and 3.3 both expected ❌ Rejected (max:1 enforcement) but returned
{"result":"success"}. This may indicate enforcement is deferred to workflow finalization rather than applied in real-time during agent execution, or it may indicate a bug in the max enforcement for create-discussion and close-discussion. Review the finalized workflow outputs to determine actual enforcement behavior.
- Test 3.2 was SKIPPED: all discoverable discussions had the smoke-test label; no suitable unlabeled discussion was available to test negative label enforcement.
- Phase 4 was SKIPPED: this workflow is triggered by schedule, so there is no triggering PR or discussion item.
💬 Safe-outputs discussions enforcement test by Smoke Safe-Outputs Discussions
Safe-Outputs Discussions Enforcement Test Results
Run: https://github.com/github/gh-aw-mcpg/actions/runs/24604936877
Trigger: schedule
Configuration tested: create-discussion (max:1, prefix, category), update-discussion (enabled, all fields), close-discussion (required-category:General, required-labels:[smoke-test]), add-comment (max:2, target:triggering)
Phase 1: create-discussion
{"result":"success"}){"result":"success"})Phase 2: update-discussion
{"result":"success"}){"result":"success"})Phase 3: close-discussion
{"result":"success"}){"result":"success"})Phase 4: add-comment (target: triggering)
Summary
Notes
{"result":"success"}. This may indicate enforcement is deferred to workflow finalization rather than applied in real-time during agent execution, or it may indicate a bug in the max enforcement for create-discussion and close-discussion. Review the finalized workflow outputs to determine actual enforcement behavior.