Conversation
Contributor
There was a problem hiding this comment.
Pull request overview
Adds richer observability and auditing capabilities for agentic workflow runs, including baseline comparison, execution-path insights, a policy evaluation CLI, and an opt-in GitHub Actions job summary section.
Changes:
- Introduces new JSON schemas for observability reports and observability guardrail policies.
- Adds baseline comparison to the audit report (compare against last successful run) and surfaces observability insights in audit/logs output.
- Adds an opt-in workflow compilation step to generate an “Observability” GitHub Actions job summary section, plus a new
observability-policyCLI command for policy evaluation.
Reviewed changes
Copilot reviewed 27 out of 27 changed files in this pull request and generated 5 comments.
Show a summary per file
| File | Description |
|---|---|
| schemas/observability-report.json | New schema describing run/fleet observability payloads (lineage, execution, reasoning, tooling/network/actuation, insights/findings). |
| schemas/observability-policy.json | New schema for policy rules evaluated against observability payloads. |
| pkg/workflow/observability_job_summary_test.go | Tests ensuring an opt-in job-summary step is injected (and absent by default). |
| pkg/workflow/frontmatter_types_test.go | Adds frontmatter parsing coverage for observability.job-summary. |
| pkg/workflow/frontmatter_types.go | Adds ObservabilityConfig and exposes it via FrontmatterConfig. |
| pkg/workflow/compiler_yaml_main_job.go | Hooks observability summary generation into main job step synthesis. |
| pkg/workflow/compiler_yaml_ai_execution.go | Implements opt-in observability summary step and mode extraction. |
| pkg/parser/schemas/main_workflow_schema.json | Extends workflow frontmatter schema to allow observability.job-summary on/off. |
| pkg/parser/schema_test.go | Validates schema acceptance of observability.job-summary. |
| pkg/cli/observability_schema_test.go | Verifies the observability report schema includes key sections/defs. |
| pkg/cli/observability_policy_test.go | Adds unit tests for policy evaluation + schema parsing. |
| pkg/cli/observability_policy_command_test.go | Tests CLI evaluation output, JSON emission, and blocking behavior. |
| pkg/cli/observability_policy_command.go | Implements new observability-policy eval command and rendering/error behavior. |
| pkg/cli/observability_policy.go | Adds policy/payload types and matching logic for rule evaluation. |
| pkg/cli/observability_insights_test.go | Adds tests for audit/logs observability insight generation. |
| pkg/cli/observability_insights.go | Implements observability insight generation + console rendering helper. |
| pkg/cli/mcp_schema_test.go | Ensures schema generation works for ObservabilityPolicy. |
| pkg/cli/logs_report.go | Adds aggregated observability insights to logs data and console rendering. |
| pkg/cli/audit_test.go | Updates audit tests to account for new Comparison field. |
| pkg/cli/audit_report_render.go | Renders new “Comparison” and “Observability Insights” sections in audit output. |
| pkg/cli/audit_report.go | Adds Comparison + ObservabilityInsights to audit JSON and populates insights. |
| pkg/cli/audit_comparison_test.go | Adds tests for comparison behavior (no baseline / risky / stable). |
| pkg/cli/audit_comparison.go | Implements baseline lookup, snapshot extraction, delta/classification, and recommendations. |
| pkg/cli/audit.go | Computes current snapshot + attaches baseline comparison into audit output. |
| cmd/gh-aw/main.go | Registers the new observability-policy command under the analysis group. |
| actions/setup/js/generate_observability_summary.test.cjs | Adds vitest coverage for summary generation and opt-out behavior. |
| actions/setup/js/generate_observability_summary.cjs | Implements job summary synthesis from runtime artifacts (aw_info, agent_output, gateway logs). |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Contributor
|
@copilot review comments, lint go |
Co-authored-by: pelikhan <[email protected]> Agent-Logs-Url: https://github.com/github/gh-aw/sessions/3c9ee3d6-0a27-4f53-810e-025fc6e26234
Contributor
Addressed all review comments in commit
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR adds baseline-aware auditing, policy-driven observability checks, and ready-to-use monitoring workflows so teams can detect risky agentic behavior earlier and operationalize it across repositories.
This is important since with agentic workflows, we need to be looking at behavior not just outcomes: a workflow can keep passing while becoming more expensive, less controlled, or less predictable. Standard GitHub Actions status does not tell you that.
Logs: better
gh aw logsreporting across multiple runsAudits: smarter
gh aw auditoutput that explains what happened in a runAgentic Observability Kit