fix: hash request kwargs and headers correctly by yetval · Pull Request #255 · D4Vinci/Scrapling

yetval · 2026-04-26T02:14:33Z

Summary
Request fingerprint collisions: Request.update_fingerprint() previously hashed only kwarg names and lowercased header values. Distinct requests could collapse to the same fingerprint when fp_include_kwargs or fp_include_headers were enabled, which can silently break scheduler deduplication, cache replay, and checkpoint restore. Fix: hash kwarg names together with their values and preserve header values as-is.

Repro
from scrapling.spiders.request import Request

r1 = Request("https://example.com", timeout=1)
r2 = Request("https://example.com", timeout=2)

assert r1.update_fingerprint(include_kwargs=True) != r2.update_fingerprint(include_kwargs=True)

Files changed
scrapling/spiders/request.py — fingerprint kwargs/header handling

samrusani · 2026-04-26T20:02:56Z

I did a quick local check against this patch and the behavior matches the PR description:

from scrapling.spiders.request import Request

assert Request("https://example.com", timeout=1).update_fingerprint(include_kwargs=True) != Request("https://example.com", timeout=2).update_fingerprint(include_kwargs=True)
assert Request("https://example.com", headers={"X-Test": "A"}).update_fingerprint(include_headers=True) != Request("https://example.com", headers={"X-Test": "a"}).update_fingerprint(include_headers=True)

tests/spiders/test_request.py also passed locally for me after applying the patch (28 passed).

One small suggestion: it would be worth adding these as regression tests in tests/spiders/test_request.py, since the contribution guide asks bug fixes to include code that reproduces the bug. That would lock in both the kwargs-value collision and the header-value case handling.

yetval · 2026-04-26T20:33:07Z

Hey @samrusani, I have went ahead and added tests. Thanks for the suggestions!

D4Vinci · 2026-04-30T13:37:15Z

-            kwargs = (key.lower() for key in self._session_kwargs.keys() if key.lower() not in ("data", "json"))
-            data["kwargs"] = "".join(set(_convert_to_bytes(key).hex() for key in kwargs))
+            filtered_kwargs = {
+                key.lower(): str(value)


Using str(value) for kwarg values is fragile if someone passes a non-primitive.

Thanks! Just fixed this.

D4Vinci

LGTM

D4Vinci · 2026-04-30T15:46:36Z

Good catch as always @yetval
We will merge once the checks pass again.

fix: hash request kwargs and headers correctly

b173635

test: add request fingerprint regressions

a5a5652

D4Vinci requested changes Apr 30, 2026

View reviewed changes

fix: str(value)

334b0f5

yetval requested a review from D4Vinci April 30, 2026 13:59

Merge branch 'dev' into fix/request-fingerprint

809d478

D4Vinci approved these changes Apr 30, 2026

View reviewed changes

D4Vinci added bug Something isn't working Ready-to-Merge This PR is approved and ready to merge once the author addresses the raised points. labels Apr 30, 2026

D4Vinci merged commit 90a853d into D4Vinci:dev Apr 30, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: hash request kwargs and headers correctly#255

fix: hash request kwargs and headers correctly#255
D4Vinci merged 4 commits intoD4Vinci:devfrom
yetval:fix/request-fingerprint

yetval commented Apr 26, 2026 •

edited

Loading

Uh oh!

samrusani commented Apr 26, 2026

Uh oh!

yetval commented Apr 26, 2026

Uh oh!

D4Vinci Apr 30, 2026

Uh oh!

yetval Apr 30, 2026

Uh oh!

D4Vinci left a comment

Uh oh!

D4Vinci commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

yetval commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samrusani commented Apr 26, 2026

Uh oh!

yetval commented Apr 26, 2026

Uh oh!

D4Vinci Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

yetval Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

D4Vinci left a comment

Choose a reason for hiding this comment

Uh oh!

D4Vinci commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yetval commented Apr 26, 2026 •

edited

Loading