Try simple-minded call expression cache #19505

ilevkivskyi · 2025-07-25T11:06:40Z

This gives a modest 1% improvement on self-check (compiled), but it gives almost 40% on mypy -c "import colour". Some comments:

I only cache CallExpr, ListExpr, and TupleExpr, this is not very principled, I found this as a best balance between rare cases like colour, and more common cases like self-check.
Caching is fragile within lambdas, so I simply disable it, it rarely matters anyway.
I cache both messages and the type map, surprisingly the latter only affects couple test cases, but I still do this generally for peace of mind.
It looks like there are only three things that require cache invalidation: binder, partial types, and deferrals.

In general, this is a bit scary (as this a major change), but also perf improvements for slow libraries are very tempting.

ilevkivskyi · 2025-07-25T11:41:52Z

Nice! Interpreted colour in mypy_primer got 3x faster.

ilevkivskyi · 2025-07-25T11:50:53Z

Hm, both changes in the primer however look suspicious, I will take a look at them later.

sterliakov · 2025-07-25T12:19:01Z

mypy/checkexpr.py

+            # Deeply nested generic calls can deteriorate performance dramatically.
+            # Although in most cases caching makes little difference, in worst case
+            # it avoids exponential complexity.
+            # TODO: figure out why caching within lambdas is fragile.


I also discovered that in #19408. Accepting a lambda has explicit type context dependency (infer_lambda_type_using_context), so generic calls do really require re-accepting the lambda every time, context from outer generic may have propagated later.

OK, thanks for the pointer. I will update the comment.

sterliakov · 2025-07-25T12:23:13Z

mypy/errors.py

@@ -931,7 +932,8 @@ def prefer_simple_messages(self) -> bool:
        if self.file in self.ignored_files:
            # Errors ignored, so no point generating fancy messages
            return True
-        for _watcher in self._watchers:
+        if self._watchers:


Could you explain this change? Watchers used to be additive and that sounded reasonable to me...

Previously, if any of the active watchers was ignoring errors, we could use simpler messages, but in presence of caching this is not valid anymore. For example, we can accept an expression when there is enclosing ignoring watcher, but then the caching watcher will record simple message, and if next time we, by chance, accept same expression in same type context, but without the ignoring watcher, an incorrect (i.e. way too terse) error message will be pulled from the cache.

Without this change 6 tests fail because of terse/simplistic error messages are used.

sterliakov · 2025-07-25T12:38:26Z

mypy/checkexpr.py

+            # Although in most cases caching makes little difference, in worst case
+            # it avoids exponential complexity.
+            # TODO: figure out why caching within lambdas is fragile.
+            elif isinstance(node, (CallExpr, ListExpr, TupleExpr)) and not (


Would it be difficult to allow dicts and sets here? Inline dictionaries are relatively common and even heavier than lists, and sets just for consistency.

Also operator exprs can be really heavy (#14978) and are fundamentally similar to CallExpr, are they worth considering?

The problem with dicts/sets is that I see around 0.3% regression on self-check when I add them (but maybe this is just noise). My reasoning is that most code has a bunch of shallow dictionaries, and for those caching is just busy-work that will never be used (note caching is not free, since mypyc is slow on creating local type maps and watchers).

Anyway, I am open to considering more expression kinds to cache, but lets put those in separate PR(s).

ilevkivskyi · 2025-07-25T14:11:43Z

OK, it looks like I found the problem with mitmproxy, it was a pre-existing bug in multiassign_from_union, will push a fix in a minute.

github-actions · 2025-07-25T14:54:00Z

Diff from mypy_primer, showing the effect of this PR on open source code:

werkzeug (https://github.com/pallets/werkzeug)
+ src/werkzeug/datastructures/structures.py:193: error: Generator has incompatible item type "tuple[Union[K, Any], Union[list[V], list[Any]]]"; expected "tuple[K, V]"  [misc]

ilevkivskyi · 2025-07-25T15:02:41Z

OK, the two previous changes in mypy_primer are fixed, and the new one error is correct, it was previously hidden by the bug in multi-assign from union.

sterliakov · 2025-07-25T21:29:22Z

LG! I'm still a bit scared to approve this now, will take another look at this tomorrow:)

ilevkivskyi · 2025-07-28T13:14:35Z

@sterliakov Did you have time for another look?

Btw comparison with current master went below noise level, probably because of #19515. But I also played with some "micro-benchmarks" involving deeply nested calls, and this PR gives performance improvements on order of 100x (no exaggeration). If there are no concrete suggestions, I think we should give this a try.

sterliakov

Yeah, sorry, I think this is good to go!

comparison with current master went below noise level

that sounds OK: mypy itself isn't generic-heavy, so it shouldn't demonstrate huge improvements. colour changes speak for themselves:)

ilevkivskyi added 5 commits July 25, 2025 09:34

Try simple-minded call expression cache

a48015a

Re-organize cache

f2d15c4

Try caching also lists and tuples

c836f53

Skip cache immediately if deferred

32827a8

Add some comments

721facc

ilevkivskyi requested review from JukkaL, hauntsaninja and sterliakov July 25, 2025 11:06

This comment has been minimized.

Sign in to view

sterliakov reviewed Jul 25, 2025

View reviewed changes

Fix bug in multiassign from union

344a84a

sterliakov approved these changes Jul 28, 2025

View reviewed changes

ilevkivskyi merged commit bd94bcb into python:master Jul 28, 2025
20 checks passed

ilevkivskyi deleted the call-expr-cache branch July 28, 2025 17:18

sterliakov mentioned this pull request Jul 28, 2025

perf: try to cache inner contexts of overloads #19408

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Try simple-minded call expression cache #19505

Try simple-minded call expression cache #19505

ilevkivskyi commented Jul 25, 2025

Uh oh!

This comment has been minimized.

ilevkivskyi commented Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 25, 2025

Uh oh!

sterliakov Jul 25, 2025

Uh oh!

ilevkivskyi Jul 25, 2025

Uh oh!

sterliakov Jul 25, 2025

Uh oh!

ilevkivskyi Jul 25, 2025

Uh oh!

sterliakov Jul 25, 2025

Uh oh!

ilevkivskyi Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 25, 2025

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 25, 2025

Uh oh!

sterliakov commented Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 28, 2025

Uh oh!

sterliakov left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Try simple-minded call expression cache #19505

Try simple-minded call expression cache #19505

Conversation

ilevkivskyi commented Jul 25, 2025

Uh oh!

This comment has been minimized.

ilevkivskyi commented Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 25, 2025

Uh oh!

sterliakov Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

ilevkivskyi Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

sterliakov Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

ilevkivskyi Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

sterliakov Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

ilevkivskyi Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

ilevkivskyi commented Jul 25, 2025

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 25, 2025

Uh oh!

sterliakov commented Jul 25, 2025

Uh oh!

ilevkivskyi commented Jul 28, 2025

Uh oh!

sterliakov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!