Skip to content

fix(detector): detect direct self matmul output#255

Closed
prasannakotyal wants to merge 8 commits into
gpu-mode:mainfrom
prasannakotyal:kg-blue-direct-self-matmul-red-283
Closed

fix(detector): detect direct self matmul output#255
prasannakotyal wants to merge 8 commits into
gpu-mode:mainfrom
prasannakotyal:kg-blue-direct-self-matmul-red-283

Conversation

@prasannakotyal

Copy link
Copy Markdown

Summary

  • extend SELF_MATMUL_OUTPUT to direct self-matmul calls through torch.mm/torch.matmul aliases
  • detect data @ data.t() and equivalent mm(data, data.t()) entrypoint returns, including import torch as T aliases
  • preserve the existing narrow functools.partial(helper) self-matmul detection

Target

KernelGuard-Red-Submission: 283

Validation

  • UV_CACHE_DIR=/tmp/uvcache uv run python -m py_compile kernelguard.py
  • import torch as T; return T.mm(data, data.t()): classification=hacked, should_filter=true, pattern SELF_MATMUL_OUTPUT
  • from torch import mm as matmul; return matmul(data, data.t()): classification=hacked, should_filter=true, pattern SELF_MATMUL_OUTPUT
  • UV_CACHE_DIR=/tmp/uvcache uv run python ../../kernelguard_bypasses/eval_blue_patch.py kernelguard.py clean fixtures remain should_filter=False
  • Partial self-matmul detector passed official eval on PR fix(detector): detect partial self matmul output #254 with TP 20/20, FP 20/20, surgicalness 1.0

@prasannakotyal prasannakotyal temporarily deployed to kernelguard-api-control-plane May 5, 2026 14:53 — with GitHub Actions Inactive
@github-actions

github-actions Bot commented May 5, 2026

Copy link
Copy Markdown

KernelGuard Blue Evaluation

@SinatrasC

Copy link
Copy Markdown
Collaborator

Thanks for the KernelGuard Flywheel Campaign contribution. This PR is now superseded by the consolidated rule-family implementation in #273, which folds this detector coverage together with the related passing-eval variants.

@SinatrasC SinatrasC closed this Jun 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants