Benchmark LSP Event Execution Times for Rust Analyzer vs sway-lsp #4901

JoshuaBatty · 2023-08-02T23:48:03Z

It would be beneficial to create identical projects in Sway and Rust, and then benchmark the time taken to execute each Language Server Protocol (LSP) event in rust-analyzer and sway-lsp.

This would give us a quantitative basis for understanding the performance differences between the two implementations. Identifying any significant discrepancies could lead to targeted optimization efforts.

JoshuaBatty · 2023-08-07T05:06:52Z

Ok, in order to create compare the 2 language servers, I first used chatGPT to create a large ~1700 line rust project with lots of types and variable assignments. I then created a new sway project and tweaked the file slightly so it would compile.

The rust and sway projects can be found here.

I then triggered each of the below LSP features 10 times in both sway-lsp and rust-analyzer and recorded the time it took to complete. The numbers below are the averaged result for each feature.

Request Type	rust-analyzer	sway-lsp	Speed Difference	Notes
didChange	305us	2270ms	7443x	Due to sway-compiler
semanticTokens/full	45us	2300ms	51,111x	Blocks in sway-lsp until compilation is finished
hover	115us	495ms	4304x
documentHighlight	83us	1070ms	12,892x
definition	128us	1050ms	8203x
prepareRename	47us	3520ms	74,894x
rename	54us	470ms	8704x

I was expecting to see such a large difference in didChange and semanticTokens/full because we are compiling the entire project and all of it's dependencies on each key pressed event, whereas rust-analyzer is using incremental compilation.

I wasn't expecting our requests to take such a long time though. I plan to look into how we can optimise these next.

JoshuaBatty · 2023-08-09T02:48:17Z

Ok I set up a benchmarking framework using criterion so we can get more accurate results.
Here are the results for the LSP requests.

Request Type	duration
hover	442ms
documentHighlight	441ms
definition	433ms
prepareRename	433ms
rename	436ms
semantic_tokens	464ms
document_symbol	447ms
completion	893ms
inlay_hints	15ms
code_action	444ms
code_lens	34ms
on_enter	572ms

They all look like they take roughly the same amount of time when unit testing the function in isolation and not taking the client <=> server interaction into consideration.

I decided to perform benchmarks on the functions defined on our internal TokenMap type as this is the main data structure we use in the server for performing the above requests. Here are the results.

TokenMap method	duration
tokens_for_file	10ms
idents_at_position	438ms
tokens_at_position	437ms
token_at_position	439ms
parent_decl_at_position	437ms

It seems that if we can optimise these methods then all those wins will bubble up the LSP requests as well.

@xunilrj made a good suggestion about also serializing the responses to JSON so can assess the "size" of the response.

## Description This PR adds a way to benchmark the LSP requests and `TokenMap` methods using `criterion`. I've also added a large sway benchmark project that can be used when running benchmarks now and in the future to measure performance improvements. Here is the result of the benchmarks. | Request Type | duration | |----------------------|---------------| | hover | 442ms | documentHighlight | 441ms | | definition | 433ms | prepareRename | 433ms | | rename | 436ms | | TokenMap method | duration | |----------------------|---------------| | tokens_for_file | 230ms | idents_at_position | 438ms | | tokens_at_position | 437ms | token_at_position | 439ms | | parent_decl_at_position | 437ms | related to #4901

## Description This PR speeds up all features in the language server, at the cost of a small performance hit during AST traversal. The first optimisation is the `Position` type in `sway-types/src/span.rs`. This has been changed to take a reference instead of using clones. The `line_col()` method was getting called a lot in order to convert between the span's start and end to the `lsp_types::Range` type that is used in LSP. As such, we were calculating this for every token in the language server when iterating over the `TokenMap` for all of the features. Removing the clones led to a 40% speed up. The last optimisation introduces a new `TokenIdent` type. This is similar to `sway_types::Ident` but holds data that is more useful for LSP, specifically, a `PathBuf` and `Range`. We are now precomputing these _during_ AST traversal and using the new `TokenIdent` as the key into the `TokenMap`. This avoids needing to pass the source engine around to convert `SourceId`'s to paths and also avoids needing to continually compute the range more than once. I think the small performance hit we take moving this computation to the traversal stage is worth the insane speed increases we get for all other LSP features. I have a plan in #4959 to parallelize traversing the AST's using the `rayon` crate in a follow up PR that overcomes this performance regression. Here is the result of running `cargo bench` with these optimisations applied. | Type | Previous Duration | Current Duration | Speedup | Improvement % | |---------------------|-------------------|------------------|---------|---------------| | on_enter | 572ms | 567ns | 1008818.34x | 100.0% | | code_lens | 34ms | 16.059µs | 2117.19x | 99.95% | | idents_at_position | 438ms | 3.281ms | 133.50x | 99.25% | | prepareRename | 433ms | 6.635ms | 65.26x | 98.47% | | token_at_position | 439ms | 6.76ms | 64.94x | 98.46% | | definition | 433ms | 6.726ms | 64.38x | 98.45% | | hover | 442ms | 7.389ms | 59.82x | 98.33% | | document_symbol | 447ms | 7.49ms | 59.68x | 98.32% | | semantic_tokens | 464ms | 24.48ms | 18.95x | 94.72% | | completion | 893ms | 51.03ms | 17.50x | 94.29% | | parent_decl_at_position | 437ms | 42.49ms | 10.28x | 90.28% | | tokens_at_position | 437ms | 42.87ms | 10.19x | 90.19% | | code_action | 444ms | 56.97ms | 7.79x | 87.17% | | documentHighlight | 441ms | 115ms | 3.83x | 73.92% | | rename | 436ms | 115.35ms | 3.78x | 73.54% | | inlay_hints | 15ms | 9.75ms | 1.54x | 35.0% | | compile_and_traverse | 1.795sec | 1.9715sec | 0.91x | -9.83% | | tokens_for_file | 10ms | 11.37ms | 0.88x | -13.7% | Related to: #4901

JoshuaBatty · 2023-08-31T05:10:48Z

Closing this as we have benchmarking in the server now and the rust-analyzer speeds have been captured in this issue.

JoshuaBatty added the language server LSP server label Aug 2, 2023

JoshuaBatty self-assigned this Aug 2, 2023

JoshuaBatty mentioned this issue Aug 9, 2023

Add benchmark framework to sway-lsp #4927

Merged

7 tasks

JoshuaBatty mentioned this issue Aug 16, 2023

Optimize language server performance #4961

Merged

7 tasks

JoshuaBatty closed this as completed Aug 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark LSP Event Execution Times for Rust Analyzer vs sway-lsp #4901

Benchmark LSP Event Execution Times for Rust Analyzer vs sway-lsp #4901

JoshuaBatty commented Aug 2, 2023

JoshuaBatty commented Aug 7, 2023

JoshuaBatty commented Aug 9, 2023 •

edited

Loading

JoshuaBatty commented Aug 31, 2023

Benchmark LSP Event Execution Times for Rust Analyzer vs sway-lsp #4901

Benchmark LSP Event Execution Times for Rust Analyzer vs sway-lsp #4901

Comments

JoshuaBatty commented Aug 2, 2023

JoshuaBatty commented Aug 7, 2023

JoshuaBatty commented Aug 9, 2023 • edited Loading

JoshuaBatty commented Aug 31, 2023

JoshuaBatty commented Aug 9, 2023 •

edited

Loading