perf(source): only filter up to 200 entries and dont use the cache #1574

folke · 2023-05-15T15:06:10Z

This PR changes the way get_entries works, but semantically it's still the same as before:

don't cache filtered entries. For Tailwindcss and others, this leads to lots of memory usage
always calculate filtered items on the fly, but stop once desired max_entries is reached
only re-calculate filtered entries when the context changes

hrsh7th · 2023-05-15T16:40:07Z

lua/cmp/source.lua

      end
    end
  end
-  self.cache:set({ 'get_entries', tostring(self.revision), ctx.cursor_before_line }, entries)


I think get_entries cache will reduce the CPU cost. Why do you remove this?

It leads to big memory issues with tailwind and probably others.

I'll see if I can incorporate the cache again in some other way that doesn't include all the memory overhead

Currently, nvim-cmp will save the all of caches for each user input. (f -> fo -> foo).
It can be helped for <BS> but I think we can avoid it (The <BS> use-case can be ignored).

I just added caching back, but only if the filtered entries list is < max_item_count.
It's only then that we can re-use the cache, otherwise we might miss entries that could have matched, but were not included.

This also still fixes the memory issue since we won't store too big entry arrays in the cache

I believe this solution is actually optimal. When the entries are huge, we filter and break as soon as we have 200 matches.

As soon as we filter down to less than 200 matches, the cache is used and the big entries list no longer needs to be fully processed.

I agree that this fix improves the performance.
It also makes sense that the current nvim-cmp is already an incorrect implementation, so there is no negative behavior.

However, I doubt that this approach is the right way.

That being said, I'm not planning any drastic improvements to nvim-cmp, so I agree with efficiency first.

hrsh7th · 2023-05-15T16:40:36Z

lua/cmp/source.lua

-  end)()
+  if self.filtered.ctx and self.filtered.ctx.id == ctx.id then
+    return self.filtered.entries
+  end


Very good. Thank you!

yioneko · 2023-05-15T16:56:54Z

Just my little thought: current implantation of limiting entries without pre-sorting is wrong and leads to suboptimal result candidates and early break of filtering seems to conflict with this. In the theory, all the entires should be done with filtering to yield the best candidates. However, by early break we'd gain huge performance improvement because the current fuzzy matching implementation is not efficient enough for some extreme cases. This might be a choice between correctness and performance.

folke · 2023-05-15T16:59:45Z

@yioneko the current cmp implementation already does pre-filtering and only sorts the resulting entries.

But I do agree, that it would be better to include your changes as well, so that we can remove that 200 limit.

Maybe you should also open a PR with your async changes?

yioneko · 2023-05-15T17:12:22Z

@folke What I meant is that sorting should be done before limiting to preserve entries with higher expectation, and this actually conflict with early break of breaking by max_items_count. The current implementaion (not including this pr) is actually wrong.

Maybe you should also open a PR with your async changes?

My changes are pretty hacky and should not be considered for merging. Also I don't have much time to work on formalizing it currently. (Would like to if I have time, I also have some other ideas for optimization)

hrsh7th · 2023-05-16T03:12:57Z

@folke I can merge this for now. Do you have any work left to do?

folke · 2023-05-16T04:51:30Z

Good to go!

I'll work on an async PR after this to further optimize the code.

hrsh7th · 2023-05-16T05:01:06Z

Thank you~!

…indow * upstream/main: (31 commits) fix entry highlight in complete-menu (hrsh7th#1593) Remove max_item_count from source configuration feat: cmp async (hrsh7th#1583) ghost text inline (hrsh7th#1588) Fix demo video in README.md (hrsh7th#1585) docs: fix adjecent typo (hrsh7th#1577) docs: fix typos, add confirm.behavior example cfg (hrsh7th#1576) perf(source): only filter up to 200 entries and dont use the cache (hrsh7th#1574) fix(ghost_text): safely apply virtual_text highlight (hrsh7th#1563) Improve misc.merge Fix hrsh7th#897 Added a modified=false to documentation buffer, so it can be removed without E89 errors (hrsh7th#1557) Fix hrsh7th#1556 fix 1533, add regression test (hrsh7th#1558) Add `buftype=nofile` for entries_win and docs_win - reddit user mention this... Fix hrsh7th#1550 Format with stylua Add test for hrsh7th#1552 Close hrsh7th#1552 Revert hrsh7th#1534 temporaly fix typo (hrsh7th#1551) ...

perf(source): only filter up to 200 entries and dont use the cache

b55b269

folke marked this pull request as draft May 15, 2023 15:06

folke mentioned this pull request May 15, 2023

high ram and cpu usage with tailwindcss #1009

Closed

2 tasks

folke marked this pull request as ready for review May 15, 2023 16:15

hrsh7th reviewed May 15, 2023

View reviewed changes

perf(source): cache filtered entries for current context

aea0336

folke force-pushed the get_entries branch from 076c54a to aea0336 Compare May 15, 2023 17:04

hrsh7th merged commit 768548b into hrsh7th:main May 16, 2023

lunacd mentioned this pull request May 17, 2023

Pull nvim-cmp for completion performance improvement LunarVim/LunarVim#4171

Closed

bluz71 mentioned this pull request May 23, 2023

feat: cmp async #1583

Merged

mortezadadgar mentioned this pull request Feb 17, 2024

TailwindCSS performance issues #1828

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(source): only filter up to 200 entries and dont use the cache #1574

perf(source): only filter up to 200 entries and dont use the cache #1574

folke commented May 15, 2023 •

edited

Loading

hrsh7th May 15, 2023

folke May 15, 2023

folke May 15, 2023

hrsh7th May 15, 2023

folke May 15, 2023

folke May 15, 2023

hrsh7th May 15, 2023

hrsh7th May 15, 2023

yioneko commented May 15, 2023

folke commented May 15, 2023

yioneko commented May 15, 2023 •

edited

Loading

hrsh7th commented May 16, 2023 •

edited

Loading

folke commented May 16, 2023

hrsh7th commented May 16, 2023

perf(source): only filter up to 200 entries and dont use the cache #1574

perf(source): only filter up to 200 entries and dont use the cache #1574

Conversation

folke commented May 15, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yioneko commented May 15, 2023

folke commented May 15, 2023

yioneko commented May 15, 2023 • edited Loading

hrsh7th commented May 16, 2023 • edited Loading

folke commented May 16, 2023

hrsh7th commented May 16, 2023

folke commented May 15, 2023 •

edited

Loading

yioneko commented May 15, 2023 •

edited

Loading

hrsh7th commented May 16, 2023 •

edited

Loading