Skip to content

fix(detector): catch annotated input float passthrough#243

Closed
prasannakotyal wants to merge 1 commit into
gpu-mode:mainfrom
prasannakotyal:kg-blue-direct-input-float-red-328
Closed

fix(detector): catch annotated input float passthrough#243
prasannakotyal wants to merge 1 commit into
gpu-mode:mainfrom
prasannakotyal:kg-blue-direct-input-float-red-328

Conversation

@prasannakotyal

Copy link
Copy Markdown

Summary

  • add a hard detector for entrypoints that return an input tensor cast with .float() without real compute
  • allow imports, docstrings, and no-op contextlib.nullcontext() wrappers so the detector catches the direct and wrapped fake-output variants
  • register INPUT_PASSTHROUGH_OUTPUT as an auto-filter hard rule

Target

KernelGuard-Red-Submission: 328

Validation

  • UV_CACHE_DIR=/tmp/uvcache uv run python -m py_compile kernelguard.py
  • annotated return data.float() sample: classification=hacked, should_filter=true, pattern INPUT_PASSTHROUGH_OUTPUT
  • direct and nullcontext-wrapped variants also classify as hacked locally
  • UV_CACHE_DIR=/tmp/uvcache uv run python ../../kernelguard_bypasses/eval_blue_patch.py kernelguard.py clean fixtures remain should_filter=False
  • Same detector already passed official eval on PR fix(detector): detect input float passthrough #242 with TP 20/20, FP 20/20, surgicalness 1.0

@prasannakotyal prasannakotyal temporarily deployed to kernelguard-api-control-plane May 5, 2026 14:18 — with GitHub Actions Inactive
@prasannakotyal prasannakotyal temporarily deployed to kernelguard-api-control-plane May 5, 2026 14:18 — with GitHub Actions Inactive
@github-actions

github-actions Bot commented May 5, 2026

Copy link
Copy Markdown

KernelGuard Blue Evaluation

@SinatrasC

Copy link
Copy Markdown
Collaborator

Thanks for the KernelGuard Flywheel Campaign contribution. We are not merging this narrow variant separately because the consolidated rule-family implementation in #273 is the merge path for this detector area.

@SinatrasC SinatrasC closed this Jun 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants