More Sway compiler optimizations #7093

xunilrj · 2025-04-16T18:13:44Z

Description

This PR is a prequel to #7015, trying to optimize the compiler to alleviate the hit of having more items, impl etc...

We have here two optimizations:
1 - We spend a lot of time counting newlines when converting byte offsets to LineCol. Now, we calculate line offsets just once, and use binary search later to find which line a byte offset is at.
2 - QueryEngine::get_programs_cache_entry was cloning the whole TyProgram. That is why did_change_with_caching was always getting worse, as the program was increasing in size. Now all compilation stages are behind Arc, which makes the clone free.

Analysis

Putting a dbg!(...) like the image below, and calling counts (https://github.com/nnethercote/counts).

cargo bench -- traverse 2>&1 | grep "sway-types/src/span.rs:29:9" | counts

I get the following results:

972102 counts
(  1)   156720 (16.1%, 16.1%): [sway-types/src/span.rs:29:9] self.pos = 0
(  2)    15900 ( 1.6%, 17.8%): [sway-types/src/span.rs:29:9] self.pos = 104
(  3)    15840 ( 1.6%, 19.4%): [sway-types/src/span.rs:29:9] self.pos = 107
(  4)     2280 ( 0.2%, 19.6%): [sway-types/src/span.rs:29:9] self.pos = 19281
(  5)     2280 ( 0.2%, 19.9%): [sway-types/src/span.rs:29:9] self.pos = 19285
(  6)     2280 ( 0.2%, 20.1%): [sway-types/src/span.rs:29:9] self.pos = 19287
(  7)     2280 ( 0.2%, 20.3%): [sway-types/src/span.rs:29:9] self.pos = 19292
(  8)     2280 ( 0.2%, 20.6%): [sway-types/src/span.rs:29:9] self.pos = 19323
(  9)     2280 ( 0.2%, 20.8%): [sway-types/src/span.rs:29:9] self.pos = 19327
( 10)     2280 ( 0.2%, 21.0%): [sway-types/src/span.rs:29:9] self.pos = 19329
( 11)     2280 ( 0.2%, 21.3%): [sway-types/src/span.rs:29:9] self.pos = 19334
( 12)      870 ( 0.1%, 21.4%): [sway-types/src/span.rs:29:9] self.pos = 4285
...

This means that line_col is being called 972k times. 16% is for position zero, which should be trivial. The rest will iterate the whole file source code to count number of lines. Making the real complexity of the work here something like O(qty * self.pos). And some values of self.pos are not trivial at all.

Checklist

I have linked to any relevant issues.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation where relevant (API docs, the reference, and the Sway book).
- If my change requires substantial documentation changes, I have requested support from the DevRel team
I have added tests that prove my fix is effective or that my feature works.
I have added (or requested a maintainer to add) the necessary Breaking* or New Feature labels where relevant.
I have done my best to ensure that my PR adheres to the Fuel Labs Code Review Standards.
I have requested a review from the relevant team or maintainers.

codspeed-hq · 2025-04-16T18:30:45Z

CodSpeed Performance Report

Merging #7093 will improve performances by ×140

_{Comparing xunilrj/sway-optimizations-2 (8fed389) with master (7cb7809)}

Summary

⚡ 11 improvements
✅ 11 untouched benchmarks

Benchmarks breakdown

	Benchmark	`BASE`	`HEAD`	Change
⚡	`did_change_with_caching`	520.9 ms	3.8 ms	×140
⚡	`traverse`	230.9 ms	48.3 ms	×4.8
⚡	`code_action`	28.3 ms	7.2 ms	×3.9
⚡	`completion`	21.1 ms	5.3 ms	×3.9
⚡	`document_symbol`	19.1 ms	3.1 ms	×6.1
⚡	`find_all_references`	47.4 ms	5.2 ms	×9.1
⚡	`highlight`	44.7 ms	5.8 ms	×7.7
⚡	`inlay_hints`	18.7 ms	2.2 ms	×8.6
⚡	`rename`	47.4 ms	5.2 ms	×9.1
⚡	`parent_decl_at_position`	18.8 ms	3.1 ms	×6.1
⚡	`tokens_at_position`	18.8 ms	3.1 ms	×6.1

JoshuaBatty

This is really neat. Amazing performance increase. Hats off!

tritao · 2025-04-17T14:56:55Z

Crazy speedup, amazing 🚀

Just to note that "compile" and "did_change_with_caching" steps on LSP benchmark seem to have gotten around 7-9% slower though, seems unlikely to have been caused by the changes though given the speedups?

xunilrj · 2025-04-17T20:30:27Z

Crazy speedup, amazing 🚀

Just to note that "compile" and "did_change_with_caching" steps on LSP benchmark seem to have gotten around 7-9% slower though, seems unlikely to have been caused by the changes though given the speedups?

I also managed to optimize "did_change_with_caching".

Voxelot · 2025-04-18T19:47:25Z

wut 🤯

xunilrj had a problem deploying to fuel-sway-bot April 16, 2025 18:14 — with GitHub Actions Error

xunilrj force-pushed the xunilrj/sway-optimizations-2 branch from 9c37556 to 103a146 Compare April 16, 2025 18:16

xunilrj temporarily deployed to fuel-sway-bot April 16, 2025 18:17 — with GitHub Actions Inactive

xunilrj self-assigned this Apr 16, 2025

xunilrj force-pushed the xunilrj/sway-optimizations-2 branch from 70210d0 to 54c0454 Compare April 16, 2025 21:16

xunilrj temporarily deployed to fuel-sway-bot April 16, 2025 21:16 — with GitHub Actions Inactive

xunilrj temporarily deployed to fuel-sway-bot April 16, 2025 21:26 — with GitHub Actions Inactive

xunilrj had a problem deploying to fuel-sway-bot April 16, 2025 23:16 — with GitHub Actions Error

xunilrj temporarily deployed to fuel-sway-bot April 16, 2025 23:19 — with GitHub Actions Inactive

xunilrj temporarily deployed to fuel-sway-bot April 17, 2025 00:23 — with GitHub Actions Inactive

xunilrj temporarily deployed to fuel-sway-bot April 17, 2025 00:46 — with GitHub Actions Inactive

This was referenced Apr 17, 2025

chore: bump to 0.67.1 #7090

Merged

Debug trait and its auto implementation #7015

Merged

JoshuaBatty temporarily deployed to fuel-sway-bot April 17, 2025 07:14 — with GitHub Actions Inactive

JoshuaBatty previously approved these changes Apr 17, 2025

View reviewed changes

xunilrj added 7 commits April 17, 2025 11:00

calculate line_col for Position just once

f75bf1f

span using Source with new lines map

7e66c00

fmt and clippy issues

dbfec35

fix line_col

e39cf58

fmt and clippy issues

ae000aa

fix Source serde post-deserialization

70d4006

better naming to differentiate zero and one based index

781f399

xunilrj added 2 commits April 17, 2025 15:13

fix Span and Source PartialOrd and Ord impls

419b767

better code for calc_line_starts

f33b676

xunilrj dismissed JoshuaBatty’s stale review via f33b676 April 17, 2025 18:25

xunilrj force-pushed the xunilrj/sway-optimizations-2 branch from b0109be to f33b676 Compare April 17, 2025 18:25

xunilrj temporarily deployed to fuel-sway-bot April 17, 2025 18:25 — with GitHub Actions Inactive

removing bytecount from deps

510d2de

xunilrj temporarily deployed to fuel-sway-bot April 17, 2025 18:30 — with GitHub Actions Inactive

eliminate TyProgram::clone when using cache

7b44fad

xunilrj temporarily deployed to fuel-sway-bot April 17, 2025 20:07 — with GitHub Actions Inactive

clippy and fmt issues

1204dac

xunilrj temporarily deployed to fuel-sway-bot April 17, 2025 20:18 — with GitHub Actions Inactive

xunilrj marked this pull request as ready for review April 17, 2025 22:07

xunilrj requested review from a team as code owners April 17, 2025 22:07

xunilrj enabled auto-merge (squash) April 17, 2025 22:08

Merge branch 'master' into xunilrj/sway-optimizations-2

f4c1ed0

JoshuaBatty temporarily deployed to fuel-sway-bot April 18, 2025 22:28 — with GitHub Actions Inactive

IGI-111 approved these changes Apr 21, 2025

View reviewed changes

Merge branch 'master' into xunilrj/sway-optimizations-2

8fed389

IGI-111 temporarily deployed to fuel-sway-bot April 21, 2025 10:49 — with GitHub Actions Inactive

tritao approved these changes Apr 21, 2025

View reviewed changes

sdankel approved these changes Apr 21, 2025

View reviewed changes

xunilrj merged commit f607a67 into master Apr 21, 2025
41 checks passed

xunilrj deleted the xunilrj/sway-optimizations-2 branch April 21, 2025 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

More Sway compiler optimizations #7093

More Sway compiler optimizations #7093

Uh oh!

xunilrj commented Apr 16, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Apr 16, 2025 •

edited

Loading

Uh oh!

JoshuaBatty left a comment

Uh oh!

tritao commented Apr 17, 2025 •

edited

Loading

Uh oh!

xunilrj commented Apr 17, 2025

Uh oh!

Voxelot commented Apr 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

More Sway compiler optimizations #7093

More Sway compiler optimizations #7093

Uh oh!

Conversation

xunilrj commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Analysis

Checklist

Uh oh!

codspeed-hq bot commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #7093 will improve performances by ×140

Summary

Benchmarks breakdown

Uh oh!

JoshuaBatty left a comment

Choose a reason for hiding this comment

Uh oh!

tritao commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xunilrj commented Apr 17, 2025

Uh oh!

Voxelot commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xunilrj commented Apr 16, 2025 •

edited

Loading

codspeed-hq bot commented Apr 16, 2025 •

edited

Loading

tritao commented Apr 17, 2025 •

edited

Loading

Voxelot commented Apr 18, 2025 •

edited

Loading