Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improved ANSI passthrough. #1596

Merged
merged 7 commits into from
Dec 8, 2021
Merged

Conversation

eth-p
Copy link
Collaborator

@eth-p eth-p commented Mar 23, 2021

This pull request overhauls how bat interprets and re-emits ANSI escape sequences.
Prior to these changes, bat used a naiive heuristic that re-emits every encountered sequence on a new line. The encountered sequences are only cleared whenever the exact literal \x1B[0m is encountered.

In 60aad68, I added a simple ANSI sequence parser that only tracks the most recently-encountered ANSI SGR (color, style, etc.) sequence for each attribute. I haven't benchmarked the difference in CPU usage, but it significantly increases the performance of the pager (the cause of #1481).

Memory usage: (Top: before, Bottom: after)
image

In 5aa94e8, I fixed ANSI passthrough support when wrapping is disabled. The non-wrapping branch didn't have code to handle ANSI, which led to inconsistencies across lines.

Example: printf '\x1B[33mYellow\nShould be yellow.\x1B[m' | bat --wrap=never
image


I did my best to manually check for any regressions, but I may have missed some. If anyone finds any issues with how ANSI is handled now, please let me know.


Additionally, if/when console-rs/console#95 is merged, we should update it. Without the fix, strings such as \x1B(0lqk will be incorrectly interpreted as [\x1B(0l, qk] instead of [\x1B(0, lqk].

Edit by @Enselic: We included the fix in bat a month ago in #1934 🎉

@keith-hall
Copy link
Collaborator

Nice work! 🎆

I did my best to manually check for any regressions, but I may have missed some.

Do we not have any tests that cover the non-wrapping behavior? Otherwise I'd expect to see some failures due to some differences in our highlighted test files for example.
Also, if we have tests covering input containing ANSI escape codes, that could improve our confidence that all is good ;) (I don't have chance to check atm)

@eth-p
Copy link
Collaborator Author

eth-p commented Mar 23, 2021

I don't think we have any tests specifically covering input with ANSI sequences. I definitely would've broken snapshots tests with this fix, but everything passed without an issue.

@sharkdp
Copy link
Owner

sharkdp commented Mar 27, 2021

Awesome! Excited to get this integrated soon. It might take me a few more days until I find the time to review it though.

It would be great to see actual (before and after) benchmark results. Maybe with some of the "highlighted" files in tests/syntax-tests (which doesn't seem to work currently)?

@eth-p
Copy link
Collaborator Author

eth-p commented Mar 28, 2021

It would be great to see actual (before and after) benchmark results.

I unfortunately don't have a machine that's reliable enough to accurately benchmark bat's wall time. It seems my laptop can introduce noise (thermal throttling? background CPU usage?) that may vary results by up to 40% per run, and I don't want to provide any unreliable benchmarks.

That being said... I can still quantify the difference in how many bytes bat is writing to stdout or the pager. In some cases, it's a small but meaningful 9% difference.

Using the pre-highlighted Rust syntax test:

ColsSyntaxStylePictureOLDNEWDiff%
80textPlain920168380191.07%
120RustPlain990739113191.98%
120RustFull69273263422291.55%

And in more extreme cases such as the output from the command in #1481:

ColsSyntaxStylePictureOLDNEWDiff%
120textFull90856097610828420.1191%

As far as the changes in 5aa94e8 are concerned though, they will have negative impact on unhighlighted text with --wrap=never. Prior to that commit, --wrap=never was faster than it should have been, since ANSI sequences were not actually being interpreted.

Maybe with some of the "highlighted" files in tests/syntax-tests (which doesn't seem to work currently)?

They do work with -l txt. I suspect the reason they don't work is because Syntect is highlighting parts of the ANSI color sequences as their own highlight blocks, and bat doesn't try to look for sequences across blocks.

A smarter implementation would likey be for bat to just disable all highlighting when it encounters its first ANSI escape sequence, but I feel like that would be a breaking change that requires some discussion around it first.

@sharkdp
Copy link
Owner

sharkdp commented Apr 3, 2021

I unfortunately don't have a machine that's reliable enough to accurately benchmark bat's wall time. It seems my laptop can introduce noise (thermal throttling? background CPU usage?) that may vary results by up to 40% per run, and I don't want to provide any unreliable benchmarks.

I wrote hyperfine to be able to deal with effects like these when benchmarking command line programs. I'm not asking you to use it, but if you are interested, you could give it a try 😄

hyperfine [options] './bat-old …' './bat-new'
  • There is a builtin outlier detection that will tell you if background processes completely break your benchmark. Obviously, you should still always try to run benchmarks on a quiet PC. Shuting down things like spotify, dropbox, browsers, … can have a huge impact.
  • Hyperfine performs a series of benchmarks and performs a statistical analysis. You will get an estimate for the noise in your benchmark results. If you are still unsure, you can run a t-test on the results that will tell you if the speedup/slowdown is statistically significant.
  • There are options to use warmup runs before the actual benchmark. This can help with disk caching effects (which are likely to play a role here as well) and possibly even with thermal throttling.

Concerning the last point (thermal throttling): On Linux, there is a way to temporarily disable any frequency scaling, which can help with benchmarking.

@eth-p
Copy link
Collaborator Author

eth-p commented Apr 3, 2021

@sharkdp That worked really well, even on my Mac.

Okay, so... as expected, this pull request does introduce some cost at runtime. In the worst case (e.g. #1481), it takes 27% longer when not printing to a terminal:

image

And in the best case (no ANSI), it doesn't really change anything:

image

Now when it actually prints something to the screen... let's just say that the old one couldn't be benchmarked.

hyperfine -r10 --show-output './bat-old -ltxt --decorations=always --color=always --paging=never lots-of-ansi.txt'

image

The fixed version worked a lot better:

image

@sharkdp sharkdp mentioned this pull request Jul 25, 2021
11 tasks
src/printer.rs Show resolved Hide resolved
@eth-p
Copy link
Collaborator Author

eth-p commented Oct 2, 2021

Updated this PR to be merge-able into master.

@God-damnit-all
Copy link

I was wondering why color was getting stripped when piping to bat, I was considering opening an issue but I found this PR. I really hope this gets merged soon.

@Enselic
Copy link
Collaborator

Enselic commented Nov 19, 2021

@ImportTaste Does passing --color=never --wrap=never preserve colors? See https://github.com/sharkdp/bat#garbled-output (you did not miss that, it was added minutes ago)

@sharkdp sharkdp added this to the v0.19 milestone Nov 22, 2021
@Enselic
Copy link
Collaborator

Enselic commented Dec 8, 2021

I merged the code with origin/master and fixed some lints.

I have confirmed that the added integration test fails on master. In other words, it tests that the fix in this PR is necessary and works. Which is a great test to have.

I do think the code could use some de-duplication, but that was the case even before this PR, so I don't think we need to do that as part of this PR.

Unless I'm mistaken, I think that #1976 allows us to do comprehensive benchmarking of this PR. Here are the results on my low-end desktop:

  • The grep-output-ansi-sequences.txt test is much faster, from 912 ms to 177 ms with --wrap=character.
  • Performance for other files with --wrap=character is unchanged.
  • As already mentioned by Ethan, performance does worsen quite a bit with --wrap=never for grep-output-ansi-sequences.txt, going from 129 ms to 157 ms. But, as explained, that's because we cheated before.
  • For other files with --wrap=never (that does not contain ANSI escape sequences) there is very little change in performance.

IMHO we should go ahead and merge this PR now, because it is an overall improvement. And we can always keep working on the code.

Here is the full benchmark comparison if you want to look. Left is git master e250da8. Right is the latest commit in this PR namely
4c044f5. (I realize now that 9f36470 is not in the benchmark, but that's just lint fixes and does not make a difference) :
master-vs-this-pr

@Enselic Enselic removed the needs-work label Dec 8, 2021
@sharkdp
Copy link
Owner

sharkdp commented Dec 8, 2021

Thank you for looking into this. I haven't had the time to fully review it. From a high-level view, it looks like the vscreen module could use some unit tests, but I am okay with merging this.

@Enselic
Copy link
Collaborator

Enselic commented Dec 8, 2021

I haven't done a full review review either in the sense that I understand how the code works. But it looks like completely reasonable code, and I'm confident enough in the code to believe it's going to be easy to understand how it works when the need arises. I'm confident enough of that to set Approved on the code at least, and be willing to merge it.

All the evidence so far points that it is an improvement over what we have at master right now, without any regressions. And we can always keep working on the code even if we merge it.

Since you too are OK with merging it, I will merge it now.

@Enselic Enselic merged commit 63ad538 into sharkdp:master Dec 8, 2021
@Enselic
Copy link
Collaborator

Enselic commented Dec 8, 2021

Closed #1481

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants