Skip to content

[pull] master from apify:master#203

Merged
pull[bot] merged 2 commits into
threatcode:masterfrom
apify:master
May 26, 2026
Merged

[pull] master from apify:master#203
pull[bot] merged 2 commits into
threatcode:masterfrom
apify:master

Conversation

@pull
Copy link
Copy Markdown

@pull pull Bot commented May 26, 2026

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

barjin and others added 2 commits May 26, 2026 17:30
#3673)

`BrowserPool.newPage()` serializes calls through `pLimit(1)`. `p-limit`
resumes queued callbacks in the previous task's AsyncLocalStorage
context, so a queued `newPage` call can inherit a sibling's aborted
`cancelTask` and its first `tryCancel()` throws `InternalTimeoutError` pre-emptively.

This fix explicitly passes the correct `AsyncLocalStorage` context into
the `pLimit` arrow function to ensure task isolation.

Closes #3670
The \`cheerio-enqueue-links-base\` and \`playwright-enqueue-links-base\`
actors crawled \`https://www.jamesallen.com/faq/\`, which was already
flaky and recently changed structure entirely — the cheerio variant has
been failing on every scheduled E2E run, and the playwright one was
already skipped as \`too flaky\`.

Replace the third-party dependency with a self-contained \`node:http\`
server (no extra deps) started inside each actor. The fixture serves a
\`/start\` page with \`<base href="/sub/">\` and a handful of relative +
one absolute link, plus the linked pages. The crawler enqueues with a
glob restricting to \`/sub/**\`, so base-href correctness determines
whether the relative links match and get enqueued — expected dataset
size is 4 (start + 3 sub pages), 1 if base-href handling is broken.
Verified the negative case by temporarily disabling the \`<base>\`
branch in \`extractUrlsFromCheerio\` and confirming the test fails.

Also unskips the playwright variant (its flakiness was the external
site) and drops \`proxyConfiguration\` since localhost shouldn't be
proxied.
@pull pull Bot locked and limited conversation to collaborators May 26, 2026
@pull pull Bot added the ⤵️ pull label May 26, 2026
@pull pull Bot merged commit 3773845 into threatcode:master May 26, 2026
@pull pull Bot had a problem deploying to github-pages May 26, 2026 16:19 Failure
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants