Persistent state #9143

intarga · 2023-12-22T21:27:36Z

Resolves part of #401, along with #9154

TODO LIST:

kirawi · 2023-12-22T22:26:53Z

Are you interested in working on buffer state, history, or both? I was going to get back to persistent undo this week, but if you intend to tackle it then I'll leave it to you.

kirawi · 2023-12-22T22:30:29Z

However, if this is tackling persistent undo as well, then I believe @pascalkuthe wanted to avoid using serde.

intarga · 2023-12-22T23:18:14Z

Hey @kirawi! This is not tackling undo, I looked at your PR and I don’t think there’s any overlap. I’m going to make a detailed post for discussion on the issue once I have a working protype, but I’ll write up some of my findings here now in case they’re helpful:

I looked into Neovim’s ShaDa (one single file in MessagePack format), following archseer’s suggestion. It’s used to persist most of what we want except notably undo history, and there seem to be some good reasons for that. For one thing, undo histories get big very quickly, and if we want to save a significant amount of history for a significant number of files, they will bloat the shada file probably more than we want. Neovim instead seems to have one undo file per edited file, which seems like a better approach, though I couldn’t find any info on the details of the format for their undofiles.

From this though, I think it makes sense for persistent undo to be implemented separately from the rest.

pascalkuthe · 2023-12-23T01:14:37Z

So the problem I have with serde is that it doesn't allow incremental or streaming serialization and deserialization.

I think for undofile having that would be nice but not a hard requirement. (N)vim has a pretty simple file format and simply fully writes/reads the undofile whenever it writes/reads the file.

I bieve we could do better by changing the format a bit so we only append whenever we save (and make it easy to evict old revsions). But I am not set on that. If we comeup with a scheme to instead evict in memory, keep the undofile fairly small writing on each save (just like vim) may be ok and then we could use serde/bincode (very fast and mature serialization format).

For undofiles the ultimate question is probably garbage collection. How do we keep the u dofile from growing forever.

For session data/something like shada in vim I do like the idea of doing something similar vht here again append only serialization and streaming deserialization are important.

That wknt work well with serde. The pvject headers are simple e ought to write your own parser (no need for msgpack).

They are fully seldescibing and describe the size of some arbitrary data that follows the header. We can read that data and deserialize with serde (And don't need msgpack I prefer bincode).

intarga · 2023-12-23T15:16:23Z

I have no particular attachment to serde or messagepack, so I'll happily switch to bincode.

For session data/something like shada in vim I do like the idea of doing something similar vht here again append only serialization and streaming deserialization are important.

I'm not sure I agree about "append only". I see the benefit that it would make simple writes faster, but at the cost of flexibility and ease in reading and trimming. The design of shada in nvim seems to at least initially have been motivated by append-only writes, which is why it's a concat of msgpack objects instead of an array, but it seems like they ultimately decided not to follow through with this and instead they merge with the existing file when writing. While merging is presumably more expensive than appending, it has some notable advantages:

Entries from different contemporary sessions can be shown in chronological order, instead of the order their sessions quit.
Entry types where it only makes sense to have a single entry (clipboard contents and split layout, perhaps) can avoid duplication, and can choose the last entry chronologically.
Length limits for different entry types can be enforced separately (i.e. limit oldfiles to 500 entries, but command history to 1000).
Spamming entries of one type does not risk evicting all entries of another (i.e. if I send a ton of commands, I don't risk losing all my oldfiles).
Entries of the same type can stay clustered, which makes reading easier and more flexible.
There's a straight forward answer to when and how to trim the file. If we go append-only, it's not clear what to do. If we're enforcing a limit of X entries, then surely that means we have to trim every time we write. Unless I'm missing something, trimming will have to involve at least partly reading and deserialising the file so we can figure out what to remove. If we're reading the file every time we write, I'm not sure I see a meaningful benefit to append-only over merging.

Some, but not all, of these problems could be solved by having dedicated files for different entry types, but then you have to write a bunch of files instead of just one 🤷‍♀️

pascalkuthe · 2023-12-28T16:41:47Z

I personally always thought that having multiple files would be the way to go. Especially for command history that should work well. That could work more or less the same as zsh history (it's 3xactly the same concept). I don't really think it's a problem having multiple datafiles.

I think a big advantage of appendonly is that you avoid frequently writing large amounts of data to disk. It's not a huge deal but reducing background io is nice.

For other files like registers I agree that it doesn't make sense to have appendonly files. But I would just keep these entirely separate.

I never understood why nvim went with a single file model it seems to make everything (including the merging) much more complicated without much tangiböe benefit.

intarga · 2023-12-28T16:54:51Z

Ok, I'll have a go at the multi-file approach then 👍

gyreas · 2024-01-02T08:56:15Z

If you need an alternative name to ShaDa, can you consider Hexion (Helix session)?

It's tongue-in-check tho

helix-loader/src/session.rs

helix-view/src/register.rs

helix-loader/src/session.rs

intarga · 2024-02-09T18:27:27Z

@the-mikedavis When opening files with their saved position, should the view be aligned to centre?

Although I would personally prefer to not align, I noticed that helix aligns to centre even on buffer switches, so I'm assuming that's the convention, and in that case we only need to persist the selection, not the view. Should I stick with that? Should alignment be configurable?

gabydd · 2024-02-09T18:56:33Z

I think the aligning when switching might actually be a bug, cause it can cause a lot of shifting when doing something like ga (going back and forth to the last accessed buffer)

intarga · 2024-02-09T19:02:15Z

@gabydd I also don't like that behaviour, though I don't think it's a bug. The code for it looks pretty intentional, and vim notably seems to behave the same

gabydd · 2024-02-09T19:03:21Z

Ah okay I'll try to take a better look when I have the time

intarga · 2024-02-09T19:12:27Z

    fn replace_document_in_view(&mut self, current_view: ViewId, doc_id: DocumentId) {
        let view = self.tree.get_mut(current_view);
        view.doc = doc_id;
        view.offset = ViewPosition::default();

        let doc = doc_mut!(self, &doc_id);
        doc.ensure_view_init(view.id);
        view.sync_changes(doc);
        doc.mark_as_focused();

        align_view(doc, view, Align::Center);
    }

I'm not completely sure, but I think this is the relevant function, and that alignment looks pretty intentional.

If I get approval I would very much like to change this behaviour. Though I think to remove the shifting on buffer switched we might have to persist ViewPosition information in Document, since it currently only seems to have information about the selections.

the-mikedavis · 2024-02-10T14:47:51Z

There was a PR about this that I think is not yet finished and could probably be picked up and brought across the finish line if you're interested: #7414

I would prefer that we save the View's offset (ViewPosition) and use it if possible, centering only if the cursor wouldn't be visible.

intarga · 2024-02-10T17:01:23Z

I would gladly take that on! It seems though that the work might already be done. I found this PR #7568 by the same author which seems to address the issues raised in that thread, and is waiting on review. The author mentions an unresolved issue, but looking at the code, I think that was just a misunderstanding.

helix-view/src/editor.rs

useful in the case of bare git repos, where the git dir is not always named .git, and so the previous exclusion wouldn't catch it.

intarga · 2024-12-23T13:45:02Z

@useche Apologies for the late response.

I was having an issue (which I don't remember) with the fact that they were initialized so early. In any case, even now some reasons come to mind: (1) I'm not a fan of global variables and the persistent state location is something that only the persistent code cares about, (2) it's nice to have related code close by, (3) the persistent state location might now change with :reload-config and I didn't feel that comfortable changing the global variables at this point. I see that they should be synchronized, though, so that technically shouldn't be a problem. Let me know your thoughts.

My opposition to this was only that I think we should try to stick to existing conventions where possible. (3) seems like a good reason to break the convention though.

I'm not sure whether we should eventually mix those with this pull request or rather wait until this is checked in and then create a new pull request with those. Let me know if you have an opinion about that.

I think it would be best to have split persistence in a follow on PR to make things easier for reviewers, as this one is already quite big.

Continuing our conversation, I implemented a new way to set the persistence options. I wanted to simplify it based on the conversations we had. Now my configuration looks like:
[editor.persistence]
all.enabled = true
all.max-entries = 1000
all.scope = "per-workspace"
autostart-splits = true
The option all will be the default for all the specific options. Specific options (like "search" or "commands") can be changed for more specific options. The patch is in the top of my github tree useche/helix. Let me know what you think.

I'm not sure I see the point in all.max-entries since the content of the entries is so different. I also don't think all.enabled and all.scope should be separate; I would prefer all = "global"|all = "workspace"|all = "off". And perhaps default would be a more intuitive name than all.

On another note, it would be good to have integration testing of per-workspace persistence

intarga · 2024-12-23T13:46:07Z

@Axlefublr It's rebased on master now

Axlefublr · 2024-12-23T13:52:46Z

@intarga thank you so much! and what wonderful timing, I was just in the process of git magicking my fork to add a new feature while being on an older commit; now I can just rebase on master like normal :D

ThanHenderson · 2025-01-28T20:37:20Z

Thanks a lot for this. Is there a timeline to merge here?

intarga · 2025-01-28T20:43:37Z

Thanks a lot for this. Is there a timeline to merge here?

It's up to the maintainers now to review it. I have no idea where this fits on their priority list, I imagine they have a lot on their plates.

Axlefublr · 2025-02-14T00:47:43Z

can you please rebase on master? there are quite a few merge conflicts, that I couldn't figure out how to handle myself

gabydd · 2025-02-14T01:30:34Z

I have it merged with master here gabydd@eca36b7

Axlefublr · 2025-02-14T01:35:01Z

@gabydd thank you! I ended up figuring things out though. however the more rarely a pr is rebased on master, the more convoluted trying to figure things out is going to be, so that's the main reason I ask

intarga force-pushed the persistent_state branch from d22bfcd to 4987e1a Compare December 28, 2023 16:40

intarga force-pushed the persistent_state branch from c3c5be0 to 9d53156 Compare December 30, 2023 16:28

kirawi mentioned this pull request Jan 2, 2024

Persistent State (session) #401

Open

the-mikedavis reviewed Jan 2, 2024

View reviewed changes

helix-loader/src/session.rs Outdated Show resolved Hide resolved

helix-view/src/register.rs Outdated Show resolved Hide resolved

the-mikedavis added C-enhancement Category: Improvements S-experimental Status: Ongoing experiment that does not require reviewing and won't be merged in its current state. labels Jan 2, 2024

intarga commented Jan 2, 2024

View reviewed changes

helix-loader/src/session.rs Outdated Show resolved Hide resolved

intarga force-pushed the persistent_state branch 3 times, most recently from 7e98d28 to 45b7c16 Compare February 13, 2024 15:03

intarga force-pushed the persistent_state branch from a029c78 to cb48389 Compare February 19, 2024 19:06

intarga mentioned this pull request Apr 22, 2024

Restore view offset when switching buffers with multiple windows #7568

Closed

intarga force-pushed the persistent_state branch 2 times, most recently from 7a5d8ff to 3e13a61 Compare May 1, 2024 19:02

Ordoviz reviewed Jul 19, 2024

View reviewed changes

helix-view/src/editor.rs Outdated Show resolved Hide resolved

intarga added 21 commits December 23, 2024 13:48

fix quirky file persistence behaviour

db766ed

fix integration tests

25778b1

save cloning by passing by ref to persistence functions

393390f

persist clipboard

4efee37

add on/off config options for persistence

599010d

fix bug: writes on untruncated histfiles

bb20c0e

trim persistence files

7bf7624

add config option to exclude files form old_file_locs

0009e15

add trim config options for persistence

286bc85

add command to reload history

091c210

fix rebase breakage

41f61a1

add .*/COMMIT_EDITMSG to persistent file exclusions

1cc2299

useful in the case of bare git repos, where the git dir is not always named .git, and so the previous exclusion wouldn't catch it.

default to <cache dir>/helix/state if state dir is None

f1af509

run docgen

fa63147

split persistence config options into own struct

45cfed3

add documentation for persistent state

7eb4183

only trim persistent state files if persistent state is enabled

24b8029

add integration test for persistent state

c60ad89

avoid repeated loading of config to check persistence config in startup

3933bf5

address hanging TODOs

d407d3d

fix line feed handling in integration test for windows

ea5d2df

intarga force-pushed the persistent_state branch from 37ee3b3 to ea5d2df Compare December 23, 2024 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Persistent state #9143

Persistent state #9143

intarga commented Dec 22, 2023 •

edited

Loading

kirawi commented Dec 22, 2023

kirawi commented Dec 22, 2023

intarga commented Dec 22, 2023

pascalkuthe commented Dec 23, 2023

intarga commented Dec 23, 2023

pascalkuthe commented Dec 28, 2023

intarga commented Dec 28, 2023

gyreas commented Jan 2, 2024

intarga commented Feb 9, 2024

gabydd commented Feb 9, 2024

intarga commented Feb 9, 2024

gabydd commented Feb 9, 2024

intarga commented Feb 9, 2024 •

edited

Loading

the-mikedavis commented Feb 10, 2024

intarga commented Feb 10, 2024

intarga commented Dec 23, 2024 •

edited

Loading

intarga commented Dec 23, 2024

Axlefublr commented Dec 23, 2024

ThanHenderson commented Jan 28, 2025

intarga commented Jan 28, 2025

Axlefublr commented Feb 14, 2025

gabydd commented Feb 14, 2025

Axlefublr commented Feb 14, 2025

Persistent state #9143

Are you sure you want to change the base?

Persistent state #9143

Conversation

intarga commented Dec 22, 2023 • edited Loading

kirawi commented Dec 22, 2023

kirawi commented Dec 22, 2023

intarga commented Dec 22, 2023

pascalkuthe commented Dec 23, 2023

intarga commented Dec 23, 2023

pascalkuthe commented Dec 28, 2023

intarga commented Dec 28, 2023

gyreas commented Jan 2, 2024

intarga commented Feb 9, 2024

gabydd commented Feb 9, 2024

intarga commented Feb 9, 2024

gabydd commented Feb 9, 2024

intarga commented Feb 9, 2024 • edited Loading

the-mikedavis commented Feb 10, 2024

intarga commented Feb 10, 2024

intarga commented Dec 23, 2024 • edited Loading

intarga commented Dec 23, 2024

Axlefublr commented Dec 23, 2024

ThanHenderson commented Jan 28, 2025

intarga commented Jan 28, 2025

Axlefublr commented Feb 14, 2025

gabydd commented Feb 14, 2025

Axlefublr commented Feb 14, 2025

intarga commented Dec 22, 2023 •

edited

Loading

intarga commented Feb 9, 2024 •

edited

Loading

intarga commented Dec 23, 2024 •

edited

Loading