Address residual P3 docs drift from re-audit of PR #408 by igerber · Pull Request #424 · igerber/diff-diff

igerber · 2026-05-13T10:48:14Z

Summary

Audit follow-up to PR #408. The restored CI reviewer caught one actionable docs-drift item: stale mentions of "plug-in-only SE" and a `survey_design` mutex that #408 itself lifted.

CHANGELOG: The `paths_of_interest` entry's mutex list still named `survey_design` and `heterogeneity` in the by_path precondition gate. Both were lifted later in the same Unreleased cycle (Compose by_path / paths_of_interest with survey_design (Wave 4 #10) #408 lifted `survey_design`, dCDH by_path Wave 5 #11: + heterogeneity (predict_het per-by_level) #412 composed in `heterogeneity`). Update to reflect the shipped gate (`drop_larger_lower / L_max / design2 / honest_did` only) and call out the lifted mutexes explicitly.
`_compute_path_effects` docstring in `chaisemartin_dhaultfoeuille.py`: described SE only as "plug-in via `_plugin_se(...)`". Add a dedicated SE-formation section listing both the non-survey plug-in path and the new survey path (path-restricted per-period IF routed through `_survey_se_from_group_if`, replicate-weight `df_survey` propagation via `replicate_n_valid_list`, and the shared post-call `_refresh_path_inference` reconciling stored inference fields against the final `df_survey`).
`_compute_path_placebos` docstring: mirrored the same plug-in-only description. Add a parallel SE-formation section pointing back at `_compute_path_effects` for the shared survey contract.

No runtime behavior change - documentation accuracy only.

Test plan

CI - no test changes; no functional code changed.

🤖 Generated with Claude Code

Restored CI reviewer caught one actionable docs-drift item: stale mentions of "plug-in-only SE" and a `survey_design` mutex that #408 itself lifted. 1. CHANGELOG paths_of_interest entry listed `survey_design` in the by_path precondition gate alongside `heterogeneity` / `design2` / `honest_did`. Both `survey_design` (lifted by #408) and `heterogeneity` (composed in by #412) have been removed from that gate. Update the parenthetical to reflect the shipped gate and note the lifted mutexes explicitly to avoid misleading readers about current behavior. 2. `_compute_path_effects` docstring described SE only as "plug-in via _plugin_se(...)". Add a dedicated SE-formation section listing the non-survey plug-in path and the new survey path (path-restricted per-period IF routed through `_survey_se_from_group_if`, replicate-weight df propagation via `replicate_n_valid_list`, and the shared post-call `_refresh_path_inference` reconciling stored inference fields against the final `df_survey`). 3. `_compute_path_placebos` docstring mirrored the same plug-in-only description. Add a parallel SE-formation section pointing back at `_compute_path_effects` for the shared survey contract. No runtime behavior change. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-13T10:51:25Z

Overall Assessment

✅ Looks good

Executive Summary

Docs-only PR; no executable estimator, weighting, variance, assumption, or default-behavior code changed.
Affected methodology surface: ChaisemartinDHaultfoeuille by_path / paths_of_interest per-path event-study and placebo survey-design SE documentation.
The updated _compute_path_effects / _compute_path_placebos docstrings match the shipped implementation and the registry note on per-path survey-design SE (diff_diff/chaisemartin_dhaultfoeuille.py:L5921-L5943, diff_diff/chaisemartin_dhaultfoeuille.py:L6299-L6316, diff_diff/chaisemartin_dhaultfoeuille.py:L6079-L6102, diff_diff/chaisemartin_dhaultfoeuille.py:L6428-L6447, diff_diff/chaisemartin_dhaultfoeuille.py:L7852-L7886, docs/methodology/REGISTRY.md:L643-L645).
No P0/P1 findings. One residual P3 docs nit remains in the edited changelog entry.

Methodology

No findings. The touched docs now accurately describe the existing non-survey plug-in path, survey cell-period allocator, replicate-weight n_valid accumulation, and final per-path inference refresh for ChaisemartinDHaultfoeuille.

Code Quality

No findings. Docs-only patch.

Performance

No findings. Docs-only patch.

Maintainability

No findings.

Tech Debt

No findings. No new untracked deferred work is introduced, and nothing here needs a TODO.md deferral.

Security

No findings.

Documentation/Tests

Severity: P3. Impact: CHANGELOG.md:L16-L16 still points to chaisemartin_dhaultfoeuille.py:1118 for the by_path / paths_of_interest precondition gate, but the current gate is at diff_diff/chaisemartin_dhaultfoeuille.py:L1216-L1266. This is minor residual docs drift only; it does not affect methodology or runtime behavior. Concrete fix: update the inline line reference to the current block, or drop the numeric line anchor and describe the gate textually.
No test-related findings. Since the PR is documentation-only, the lack of test changes is reasonable.

The CHANGELOG paths_of_interest entry referenced `chaisemartin_dhaultfoeuille.py:1118` for the by_path precondition gate, but later edits (lifting the survey_design mutex in #408 and composing in heterogeneity in #412) have drifted the actual gate location to ~L1216. Drop the numeric anchor entirely and describe the gate textually so future edits do not silently invalidate the line reference. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-13T10:55:44Z

🔁 AI review rerun (requested by @igerber)

Head SHA: 0134fc6620fd88f3819c814429ddaea3f4b1ecb4

Overall Assessment

✅ Looks good

Executive Summary

Re-review scope is satisfied: the prior residual P3 on the stale paths_of_interest changelog line reference is addressed in CHANGELOG.md:L16-L16.
This is a docs-only PR; I found no executable changes to estimator logic, weighting, variance/SE, identification assumptions, or defaults.
Affected methodology surface: ChaisemartinDHaultfoeuille per-path survey-design SE / placebo documentation. The updated _compute_path_effects and _compute_path_placebos docstrings are consistent with the shipped implementation and the Methodology Registry.
No P0/P1 findings.
No new P2/P3 findings in the changed hunks.
No test changes are needed for this patch. I did not run the test suite.

Methodology

No findings. I cross-checked the touched docs against the live survey path and registry contract. The new _compute_path_effects / _compute_path_placebos docstrings now describe the non-survey plug-in path, _survey_se_from_group_if routing, replicate n_valid accumulation, and the final per-path _refresh_path_inference refresh in line with diff_diff/chaisemartin_dhaultfoeuille.py:L5921-L5943, diff_diff/chaisemartin_dhaultfoeuille.py:L6299-L6316, diff_diff/chaisemartin_dhaultfoeuille.py:L6079-L6102, diff_diff/chaisemartin_dhaultfoeuille.py:L6434-L6447, diff_diff/chaisemartin_dhaultfoeuille.py:L7852-L7886, and docs/methodology/REGISTRY.md:L643-L679.

Code Quality

No findings. Docs-only patch.

Performance

No findings. No runtime path changed.

Maintainability

No findings. The previous stale hard-coded gate reference in the paths_of_interest changelog entry was replaced with a textual description that matches the current fit() gate at diff_diff/chaisemartin_dhaultfoeuille.py:L1216-L1268.

Tech Debt

No findings. The PR does not introduce new deferred work that needs TODO.md.

Security

No findings.

Documentation/Tests

No findings. The patch removes the specific docs drift called out in the prior review, and the lack of test changes is reasonable for a documentation-only PR.

…ps + stale by_path gate list in CHANGELOG Holistic re-audit of merged igerber#408 (compose `by_path` × `survey_design` Wave 4) + igerber#424 (post-merge docs-drift cleanup). Per-PR CI on igerber#424 couldn't see the combined post-PR holistic state. Local agentic codex review surfaced 2 sibling-surface test gaps + 1 real `[Unreleased]` CHANGELOG drift. **Sibling-surface coverage**: igerber#408's Wave-4 PR shipped replicate-weight regressions for `by_path` (`test_per_path_replicate_se_finite`, `test_per_path_inference_refreshes_to_lower_final_df`) but the parallel `paths_of_interest` selector only had analytical / gate / unobserved-path tests under survey. Both selectors share the same `_compute_path_effects` and `_compute_path_placebos` IF code path, so the test gap was a selector-symmetry oversight, not a methodology gap. Added: - `test_paths_of_interest_replicate_weight_per_path_se_finite` — pins finite per-horizon SE under Rao-Wu (JK1) AND the `_refresh_path_inference` contract (every per-path entry's `t_stat` matches `safe_inference` at the FINAL `df_survey`, not the per-path snapshot from before replicate-weight fits appended to the shared `_replicate_n_valid_list`). - `test_paths_of_interest_survey_design_placebo_replicate_weight` — same invariants on the `_compute_path_placebos` branch. **CHANGELOG drift**: the original `[Unreleased]` `by_path` entry (added when by_path first shipped) said `trends_linear`, `trends_nonparam`, `heterogeneity`, `design2`, `honest_did`, and `survey_design` all raise `NotImplementedError`. Each subsequent gate-lift PR shipped its own `[Unreleased]` entry, but none of them went back to update this original entry's stale gated-features list. Users reading the changelog in order get contradictory upgrade guidance. Rewrote the gates list to reflect actual current state: only `design2` + `honest_did` remain gated. Pre-existing single-CHANGELOG-cycle hygiene gap, surfaced by the igerber#408 holistic audit but applies independently of any specific subsequent PR. Holistic pilot finding NOT addressed: phantom `heterogeneity was composed in` claim in igerber#424's CHANGELOG. That's correct in real main (igerber#412 lifted the heterogeneity gate), but appears as a code/doc mismatch in the pilot because pilot construction (per `feedback_holistic_pilot_true_merge_cherry_pick_pitfall`) is `igerber#408 + igerber#424 deltas only` and doesn't include intermediate sibling PRs like igerber#412. This is a structural limitation of the strict-delta pilot pattern — no fix-PR action needed in main.

igerber added the ready-for-ci Triggers CI test workflows label May 13, 2026

igerber merged commit 73c2391 into main May 13, 2026
31 of 32 checks passed

igerber deleted the fix-audit-408 branch May 13, 2026 23:02

igerber mentioned this pull request May 14, 2026

Fix #408 holistic audit residuals: sibling-surface replicate-weight test gaps + stale by_path gate list #435

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address residual P3 docs drift from re-audit of PR #408#424

Address residual P3 docs drift from re-audit of PR #408#424
igerber merged 2 commits into
mainfrom
fix-audit-408

igerber commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

igerber commented May 13, 2026

Summary

Test plan

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant