security: sanitize ledger error details from unauthenticated endpoints by rwilliamspbg-ops · Pull Request #84 · rwilliamspbg-ops/Sovereign_Map_Federated_Learning

rwilliamspbg-ops · 2026-04-08T01:20:34Z

Unauthenticated endpoints (/health, /readyz, /api/v1/capabilities) were returning raw init_error and error strings from the ledger subsystem. These can contain SQL DSNs, driver error messages, or credentials depending on the backend configuration.

Summary

Replace opaque error strings with a boolean has_error flag on all open endpoints. Full error details remain available on the auth-protected /api/v1/ledger endpoint.

Before (/health response, unauthenticated):

"ledger": {
  "ready": false,
  "storage_mode": "cockroach-compatible-inmemory",
  "init_error": "failed to connect: pq: password authentication failed for user 'admin' (dsn=******db:5432/...)",
  "error": "dial tcp: connection refused"
}

After:

"ledger": {
  "ready": false,
  "storage_mode": "cockroach-compatible-inmemory",
  "has_error": true
}

Changes:

HealthCheck (/health): removed init_error/error, replaced with has_error bool
ReadinessCheck (/readyz): same
GetCapabilities (/api/v1/capabilities): removed ledger_state.init_error, replaced with has_error bool
GetLedger (/api/v1/ledger, auth-gated): unchanged — full details still exposed here
Applied consistent strings.TrimSpace to both error sources in has_error evaluation
Updated TestCockroachBackendFallbackMetadata to assert has_error == true

Validation

go test ./internal/api/...

Evidence

N/A — no runtime behavior change for authenticated callers; open endpoints lose error string fields only.

Checklist

Risk and Rollback

Risk level: low
Rollback plan: revert two commits (e094c08, d2b686e); no schema or config changes
Operational blast radius: clients parsing init_error/error fields from open endpoints will see those keys absent; has_error bool added in their place

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Ryan <221235059+rwilliamspbg-ops@users.noreply.github.com>

…l.go Agent-Logs-Url: https://github.com/rwilliamspbg-ops/Sovereign_Map_Federated_Learning/sessions/04ecac5d-df42-4026-b6bb-e83b7b28abad Co-authored-by: rwilliamspbg-ops <221235059+rwilliamspbg-ops@users.noreply.github.com>

Agent-Logs-Url: https://github.com/rwilliamspbg-ops/Sovereign_Map_Federated_Learning/sessions/04ecac5d-df42-4026-b6bb-e83b7b28abad Co-authored-by: rwilliamspbg-ops <221235059+rwilliamspbg-ops@users.noreply.github.com>

…mesh-artifacts # Conflicts: # scripts/check_dashboard_queries.py

Copilot

Pull request overview

This PR adds “digital twin mesh” audit artifacts and observability upgrades, including a hash-chained proof ledger (with optional Cockroach/Postgres SQL storage), a new Grafana audit dashboard, and supporting scripts/CI tweaks.

Changes:

Introduce hash-chained ledger entries with per-stream sequencing, idempotency/replay detection, checkpoints, and a /api/v1/ledger/reconcile endpoint; add optional SQL-backed ledger storage.
Add an “Sovereign Audit Gold Standard” Grafana dashboard plus improved PromQL query validation.
Add a continuous digital-twin mesh traffic generator script and commit a bundle of captured audit artifacts; switch some GitHub workflows to install a minimal requirements-ci.txt.

Reviewed changes

Copilot reviewed 26 out of 29 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
scripts/run_continuous_digital_twin_mesh.sh	Adds a looped traffic generator that emits offers/attestations/TPM events and polls key endpoints.
scripts/provision-grafana-dashboards.js	Registers a new “audit” dashboard definition for script-based provisioning/deploy.
scripts/check_dashboard_queries.py	Improves PromQL parsing to avoid misclassifying group labels as metrics.
requirements-ci.txt	Introduces a minimal CI Python dependency set (currently only NumPy).
internal/api/ledger.go	Expands the in-memory ledger to include stream sequencing, hash chaining, checkpoints, reconcile reporting, and an interface for pluggable backends.
internal/api/ledger_sql.go	Adds a Cockroach/Postgres-compatible SQL ledger backend with schema bootstrap, record, query, checkpoint, and readiness logic.
internal/api/handlers.go	Switches handler to ledger interface, adds readiness endpoints, idempotency header support, ledger reconcile route, and richer health/readiness metadata.
internal/api/handlers_test.go	Updates capability contract expectations and adds tests for replay/idempotency, reconcile, readiness, and fallback metadata.
grafana/provisioning/dashboards/audit_overview.json	Adds a new audit-focused Grafana dashboard JSON (uid `sovereign-audit-gold-standard`).
go.mod	Adds `github.com/lib/pq` for SQL ledger connectivity.
go.sum	Updates sums for `github.com/lib/pq` and removes older unused sums.
audit_results/digital_twin_continuous_20260408T001440Z/trust_snapshot_three_node.json	Captured trust snapshot evidence from the continuous mesh run.
audit_results/digital_twin_continuous_20260408T001440Z/train_status_final.json	Captured final training status evidence.
audit_results/digital_twin_continuous_20260408T001440Z/train_start.json	Captured training start response evidence.
audit_results/digital_twin_continuous_20260408T001440Z/README.md	Documents the contents of the captured artifact bundle.
audit_results/digital_twin_continuous_20260408T001440Z/ops_events_three_node.json	Captured recent ops/events evidence.
audit_results/digital_twin_continuous_20260408T001440Z/offers_after_three_node.json	Captured marketplace offers snapshot evidence.
audit_results/digital_twin_continuous_20260408T001440Z/offer_lidar.json	Captured lidar offer response evidence.
audit_results/digital_twin_continuous_20260408T001440Z/offer_image.json	Captured image offer response evidence.
audit_results/digital_twin_continuous_20260408T001440Z/offer_gps.json	Captured GPS offer response evidence.
audit_results/digital_twin_continuous_20260408T001440Z/continuous_digital_twin_mesh.log	Captured runtime log from the continuous generator.
audit_results/digital_twin_continuous_20260408T001440Z/attestations_after_three_node.json	Captured attestations feed snapshot evidence.
audit_results/digital_twin_continuous_20260408T001440Z/attest_lidar.json	Captured lidar attestation response evidence.
audit_results/digital_twin_continuous_20260408T001440Z/attest_image.json	Captured image attestation response evidence.
audit_results/digital_twin_continuous_20260408T001440Z/attest_gps.json	Captured GPS attestation response evidence.
.gitignore	Ignores `/bin/` artifacts.
.github/workflows/fedavg-benchmark-compare.yml	Switches workflow to install dependencies from `requirements-ci.txt`.
.github/workflows/deploy.yml	Switches workflow to install dependencies from `requirements-ci.txt`.
.github/workflows/build.yml	Switches workflow to install dependencies from `requirements-ci.txt`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T01:26:26Z

+	ledgerReady, ledgerErr := h.ledger.Readiness()
+	status := "healthy"
+	if !ledgerReady || strings.TrimSpace(h.ledgerInitError) != "" {
+		status = "degraded"
+	}

-	response := map[string]string{
-		"status":  "healthy",
+	response := map[string]interface{}{
+		"status":  status,
 		"service": "sovereign-map-fl",
 		"time":    time.Now().UTC().Format(time.RFC3339),
+		"ledger": map[string]interface{}{
+			"ready":        ledgerReady,
+			"storage_mode": h.ledger.StorageMode(),
+			"init_error":   h.ledgerInitError,
+			"error":        ledgerErr,
+		},


/health, /readyz, and /api/v1/capabilities are unauthenticated endpoints, but the response now includes ledger.init_error and ledger.error. Those error strings can include internal connection details (and potentially credentials, depending on driver/DSN errors). Consider omitting these fields on open endpoints, or returning a sanitized/high-level status code/message and exposing details only on an auth-protected debug endpoint/logs.