feat: share `SessionPool` instance between crawlers by barjin · Pull Request #3499 · apify/crawlee

barjin · 2026-03-18T10:52:08Z

BasicCrawler and subclasses now accept sessionPool option for passing a pre-initialized SessionPool instance. This is mutually exclusive with sessionPoolOptions.

SessionPool instances now do not share the persistence keys (similar to e.g. Statistics).

Closes #3445

Each SessionPool now gets an auto-incrementing `id` used to derive a unique `persistStateKey` (`SDK_SESSION_POOL_STATE_{id}`), preventing multiple instances from overwriting each other's persisted state. An explicit `id` or `persistStateKey` can still be provided to control persistence across restarts. BasicCrawler forwards its crawler ID to the SessionPool when available.

Adds a `sessionPool` option to BasicCrawlerOptions that accepts an already-initialized SessionPool, allowing multiple crawlers to share the same pool. When a shared pool is injected, the crawler skips creating a new one and does not tear it down or reset its store — the caller owns its lifecycle. Mutually exclusive with `sessionPoolOptions`.

Covers: using a shared pool instance, verifying it's not torn down by the crawler, sharing across sequential crawlers, and the mutual exclusivity guard with sessionPoolOptions.

Copilot

Pull request overview

This PR improves session pool isolation by default (unique persistence keys per pool) and adds support for injecting an existing SessionPool into BasicCrawler to enable session sharing across crawlers.

Changes:

Add SessionPoolOptions.id and change the default persistStateKey format to include the pool id for isolation.
Add BasicCrawlerOptions.sessionPool (mutually exclusive with sessionPoolOptions) and ensure injected pools are not reset/teardown by the crawler.
Update/add tests to cover persistence isolation and injected/shared session pool behavior.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
`packages/core/src/session_pool/session_pool.ts`	Introduces `id` and derives a unique default `persistStateKey` per pool instance.
`packages/basic-crawler/src/internals/basic-crawler.ts`	Allows passing an existing `SessionPool` and tracks ownership to avoid tearing down shared pools.
`test/core/session_pool/session_pool.test.ts`	Adjusts persistence tests for the new persist key behavior and adds isolation coverage.
`test/core/crawlers/basic_crawler.test.ts`	Adds tests for accepting an injected `SessionPool`, lifecycle behavior, and sharing across crawlers.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

Fix "it's" → "its" typo in JSDoc, validate `sessionPool` option as `instanceof SessionPool`, add missing teardown in test, and assert actual session reuse in the shared pool test.

janbuchar · 2026-03-18T16:57:08Z

+                    throw new Error(
+                        'Cannot use both `sessionPool` and `sessionPoolOptions` — pass either a pre-built SessionPool instance or options to create one, not both.',
+                    );


Can we just abolish sessionPoolOptions?

Possibly, but then we cannot determine whether the crawler owns the SessionPool instance or not... which means, no automatic teardown() (maybe not as problematic, as mentioned here), but also no automatic .resetStore() on crawler restart (which, maybe that's not as problematic either, who knows?)

Anyway, I tried dropping sessionPoolOptions and keeping only sessionPool in this commit.

Tbf, I didn't mind the sessionPoolOptions that much. Constructing the SessionPool explicitly each time shifts the responsibility to the user (see problems with resetStore and teardown above), which might feel burdensome.

I'm soft-leaning towards keeping both sessionPoolOptions (for BC and low-effort use-cases) and sessionPool (for SessionPool sharing and poweruser scenarios). Wdyt?

Okay, you made me check how the Python version does it. Key findings:

No session pool options, if you want to fine tune it, pass in a configured instance

~~init can be called multiple times, but the first teardown just... tears it down, which is not ideal 😁~~

EDIT init can only be called once, subsequent initialization attempts throw an error. There is also an active property on the session pool (and other components that require async init/teardown, e.g., Statistics). BasicCrawler.run checks this active property.

I think the Python way is the right direction - it takes care of the init/teardown logic for the user, without providing two ways to customize the session pool.

Also, can't we require Node.js 24+ so that we can use using and Symbol.dispose?

it takes care of the init/teardown logic for the user

I'm all for this, but we cannot safely call .teardown() / .resetStore() on a shared pool, potentially used by multiple crawler instances at once (which is what the original issue is asking for).

I made the SessionPool lazy-initialized, i.e., any call to its async methods checks whether it has been initialized - and runs init() if not.

The teardown calls are trickier, though. I'm now tracking whether this.sessionPool is owned by the crawler, or if it has been passed from the outside. .teardown() is then called only if it's owned. Imo this is the best compromise we can have here, but feel free to change my mind :)

Also, can't we require Node.js 24+ so that we can use using and Symbol.dispose?

I suppose we can implement Symbol.dispose to call teardown(), providing a more idiomatic interface for disposal, but I'm failing to see how it should help us with the shared pool instance 🤔

Since we're not using the Symbol.dispose approach anywhere else yet (and the implementation would be just calling this.teardown() anyway), I decided against adding it right now. Imo it would just prompt more questions (why does SessionPool have it and other classes do not?).

That being said, I'd be all for adding support for the using / Symbol.dispose API across the project in some later PR.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

This PR removes the static `SessionPool.open()` method. Until the `new SessionPool()` instance has been interacted with, it doesn't access any storage. Once the user (or another Crawlee component) calls one of the `SessionPool` async methods, the `SessionPool` is automatically initialized. This simplifies usage in e.g. `AdaptivePlaywrightCrawler` (where parts of the context pipeline are called separately before the `.initialize()` call), makes things clearer for users, and does away with potentially confusing temporal coupling (`SessionPool.initialize()` had to be called before any other operation, otherwise SessionPool would throw exceptions). Prerequisite for #3499 --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

janbuchar

Let's do this!

barjin added 3 commits March 18, 2026 11:25

test(basic-crawler): add tests for injected SessionPool

19a0f09

Covers: using a shared pool instance, verifying it's not torn down by the crawler, sharing across sequential crawlers, and the mutual exclusivity guard with sessionPoolOptions.

barjin self-assigned this Mar 18, 2026

barjin requested a review from Copilot March 18, 2026 10:52

Copilot started reviewing on behalf of barjin March 18, 2026 10:52 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

Comment thread packages/core/src/session_pool/session_pool.ts Outdated

Comment thread packages/basic-crawler/src/internals/basic-crawler.ts

Comment thread test/core/crawlers/basic_crawler.test.ts

Comment thread test/core/crawlers/basic_crawler.test.ts

barjin added 2 commits March 18, 2026 12:20

fix: address PR review comments

e01e929

Fix "it's" → "its" typo in JSDoc, validate `sessionPool` option as `instanceof SessionPool`, add missing teardown in test, and assert actual session reuse in the shared pool test.

chore: run linter

ae88680

barjin requested review from janbuchar and l2ysho March 18, 2026 11:44

janbuchar reviewed Mar 18, 2026

View reviewed changes

barjin added 7 commits March 19, 2026 14:11

Merge branch 'v4' into feat/pass-session-pool

bb3c94f

chore: drop leading underscore

20d2b0d

feat: lazy-init SessionPool on method calls

33f10ff

Merge branch 'lazy-init-session-pool' into feat/pass-session-pool

cf968ed

feat: drop sessionPoolOptions, only allow sessionPool

778a97d

docs: drop sessionPoolOptions from examples

d05fe98

feat: reset / teardown own sessionPool

54b9598

vladfrangu reviewed Mar 20, 2026

View reviewed changes

Comment thread docs/upgrading/upgrading_v4.md

barjin added 2 commits March 23, 2026 16:33

feat: lazy-initialized SessionPool, drop SessionPool.open()

dff7332

chore: make initialize() protected

6b32a79

barjin mentioned this pull request Mar 23, 2026

feat: lazy-initialize SessionPool, drop SessionPool.open() #3513

Merged

barjin and others added 6 commits March 23, 2026 16:58

docs: add upgrading guide

c7c6b72

Update packages/core/src/session_pool/session_pool.ts

54061bd

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

perf: do not access KVS without persistenceOptions.enable

3a85cab

chore: make KVS access optional

a3c014f

chore: initialize SessionPool in BasicCrawler constructor

8956f19

Merge branch 'feat/lazy-init-session-pool' into feat/pass-session-pool

2bef436

barjin changed the base branch from v4 to feat/lazy-init-session-pool March 24, 2026 16:20

barjin added 2 commits March 24, 2026 17:27

chore: revert changes to fix failing tests

ef63ebe

Merge branch 'feat/lazy-init-session-pool' into feat/pass-session-pool

42a76d7

Base automatically changed from feat/lazy-init-session-pool to v4 March 24, 2026 16:36

Merge branch 'v4' into feat/pass-session-pool

c58abc5

barjin changed the title ~~feat: accept pre-initialized SessionPool instance~~ feat: share SessionPool instance between crawlers Mar 24, 2026

barjin added 2 commits March 25, 2026 13:34

docs: fix SessionPool examples

c583d58

chore: fix tests

013a7e8

barjin mentioned this pull request Mar 26, 2026

feat: allow binding a Session instance to a Request #3518

Merged

Merge branch 'v4' into feat/pass-session-pool

24d94d0

barjin requested review from janbuchar and vladfrangu April 16, 2026 12:10

janbuchar approved these changes Apr 16, 2026

View reviewed changes

barjin merged commit bd7943d into v4 Apr 17, 2026
6 checks passed

barjin deleted the feat/pass-session-pool branch April 17, 2026 13:32

barjin mentioned this pull request Apr 17, 2026

Allow sharing a SessionPool instance across multiple crawlers #3445

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: share `SessionPool` instance between crawlers#3499

feat: share `SessionPool` instance between crawlers#3499
barjin merged 26 commits into
v4from
feat/pass-session-pool

barjin commented Mar 18, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janbuchar Mar 18, 2026

Uh oh!

barjin Mar 19, 2026

Uh oh!

barjin Mar 19, 2026

Uh oh!

janbuchar Mar 19, 2026 •

edited

Loading

Uh oh!

janbuchar Mar 19, 2026

Uh oh!

barjin Mar 20, 2026

Uh oh!

barjin Apr 16, 2026

Uh oh!

Uh oh!

Uh oh!

janbuchar left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

barjin commented Mar 18, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janbuchar Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

barjin Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

barjin Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

janbuchar Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janbuchar Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

barjin Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

barjin Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

janbuchar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

janbuchar Mar 19, 2026 •

edited

Loading