Use a spinner for model loading information #161

tehmatt · 2023-04-30T02:32:46Z

Switch from annoyingly long log messages to a spinner during model loading.
This has the downside of always printing, rather than being controlled by RUST_LOG, but also means we only get a single line of output that's updated as loading progresses.

An example of the end result:

tehmatt · 2023-04-30T02:33:40Z

llama-cli/Cargo.toml

 clap = {version = "4.1.8", features = ["derive"]}
 color-eyre = {version = "0.6.2", default-features = false}
 env_logger = "0.10.0"
 log = "0.4"
 num_cpus = "1.15.0"
 once_cell = "1.17.1"
 rustyline = "11.0.0"
-spinners = "4.1.0"
+spinoff = { version = "0.7.0", default-features = false, features = ["dots2"] }


These are pretty similar, but spinoff is much more versatile as it allows updating the message

Switch from annoyingly long `log` messages to a spinner during model loading. This has the downside of always printing, rather than being controlled by `RUST_LOG`, but also means we only get a single line of output that's updated as loading progresses.

tehmatt · 2023-04-30T02:36:50Z

llama-cli/src/cli_args.rs

+                            .success(&format!(
+                                "Loaded {tensor_count} tensors from '{}' ({}) after {}ms",
+                                file.to_string_lossy(),
+                                bytesize::to_string(byte_size, false),


bytesize will handle differently sized models more aesthetically than using a fixed 1MB divisor.

tehmatt · 2023-04-30T02:38:53Z

llama-rs/src/loader_common.rs

@@ -105,7 +105,7 @@ pub enum LoadProgress<'a> {
        /// The path to the model part.
        file: &'a Path,
        /// The number of bytes in the part.
-        byte_size: usize,
+        byte_size: u64,


I'm open to reverting this, but it seems reasonable to match the type of https://doc.rust-lang.org/std/fs/struct.Metadata.html#method.len.

tehmatt · 2023-04-30T03:14:47Z

llama-rs/src/loader2.rs

@@ -190,7 +196,7 @@ pub(crate) fn load(

    (load_progress_callback)(LoadProgress::PartLoaded {
        file: &path,
-        byte_size: 0,
+        byte_size: filesize,


I guess the argument for leaving this 0 is that we don't know the exact size of the tensors since there's other stuff in the file too. It seems reasonable to fall back to the whole file in that case, but the other decent option would be to change the type to Option<u64> and not print tensor size if we get None.

philpax · 2023-04-30T16:00:09Z

Looks good to me! We're going to merge #162 first, but I'll fix up this PR and merge it in afterwards :)

philpax · 2023-05-04T18:32:05Z

Thanks for the PR! Definitely nice to see the spinner, especially for non-mmappable models :)

tehmatt commented Apr 30, 2023

View reviewed changes

Merge branch 'main' into tehmatt-spinner

064ba8c

philpax merged commit 5d61d81 into rustformers:main May 4, 2023

tehmatt deleted the tehmatt-spinner branch May 5, 2023 06:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a spinner for model loading information #161

Use a spinner for model loading information #161

tehmatt commented Apr 30, 2023 •

edited

Loading

tehmatt Apr 30, 2023

tehmatt Apr 30, 2023

tehmatt Apr 30, 2023

tehmatt Apr 30, 2023 •

edited

Loading

philpax commented Apr 30, 2023

philpax commented May 4, 2023

Use a spinner for model loading information #161

Use a spinner for model loading information #161

Conversation

tehmatt commented Apr 30, 2023 • edited Loading

tehmatt Apr 30, 2023

Choose a reason for hiding this comment

tehmatt Apr 30, 2023

Choose a reason for hiding this comment

tehmatt Apr 30, 2023

Choose a reason for hiding this comment

tehmatt Apr 30, 2023 • edited Loading

Choose a reason for hiding this comment

philpax commented Apr 30, 2023

philpax commented May 4, 2023

tehmatt commented Apr 30, 2023 •

edited

Loading

tehmatt Apr 30, 2023 •

edited

Loading