Re-write ANSI color handling #225

mgeier · 2016-02-02T17:58:51Z

Initially, I only wanted to have less ugly colors for Python exceptions shown in LaTeX output.
But then I found out that the whole ANSI-handling code is quite a mess (which I already hinted on in #214).

The code handling HTML and LaTeX was completely separate and showed quite different behavior.

I re-implemented the whole thing, now most of the code is shared between HTML and LaTeX output. I also implemented the 256 indexed colors and 24-bit RGB colors which are supported by the live notebook. And I fixed several bugs along the way (e.g. #174).

There are a few caveats, though:

The behavior is still different from the live notebook, but I consider my implementation more correct in several points. I'll probably list the bugs in a separate issue there, once I got some feedback here.
The 8 (resp. 16) default colors are still different between HTML and LaTeX because I didn't dare to change the color definitions. But probably this should be adjusted?
I had to add 2 \newcommands in the LaTeX template because there were errors when I tried to use something like \textcolor[RGB]{255,0,0}{some text} within a Verbatim environment. I think it may have something to do with the commas (I read somewhere that they might be "active", whatever that means ...), but I don't know enough TeX-fu to find, let alone fix the error. Can somebody help with that?

Closes jupyter#214. As a side effect, this also fixes jupyter#174.

minrk · 2016-02-02T18:24:55Z

Since this fixes "several bugs", would you mind adding tests covering the things you found that weren't correct before that are now fixed?

mgeier · 2016-02-02T19:57:20Z

Oh, I forgot: sure, I'll update the tests, but I've never used them, can you please point me to the documentation how to run the tests?

Also, I wanted to wait for some feedback ... If you don't like my implementation, there's no point in making tests for it ...

takluyver · 2016-02-02T20:17:27Z

I don't know if it's explicitly documented anywhere, but running nosetests in the root of the repository should do the trick. If you get errors about missing css, run python setup.py css to fetch it.

I'll try to have a look over the implementation if Min doesn't get round to it first.

minrk · 2016-02-02T20:41:08Z

I think the basic principal of the new implementation is sound. ANSI processing is just fiddly enough that any changes make me squeamish without tests (accepts warranted criticism for the poor test coverage in the first place).

minrk · 2016-02-02T20:41:41Z

nbconvert/filters/ansi.py

-        text += '</span>'
-    return text
+    if isinstance(fg, int):
+        classes.append(_FG_HTML[fg])


Are KeyErrors impossible on these lookups?

Well, "impossible" is a very strong word, but yes, if fg (or bg) is a single int, it is supposed to be within range(8). AFAICT, there is no code path that allows otherwise (assuming no monkey-patching and stuff).

mgeier · 2016-02-03T09:39:53Z

@takluyver Thanks, it would probably be helpful to add those instructions to CONTRIBUTING.md. Also how to make a "develop" installation locally ... and how to build the docs ...

@minrk Thanks for having a look! I strongly agree that this ANSI stuff is quite fiddly (and underspecified), I'll update the tests soon.

mgeier · 2016-02-03T18:47:50Z

While updating the tests, I found a bug in ipython_genutils.text.strip_ansi(). Is there a way to find out if this is used anywhere else?

If not, I guess it would make sense to move it to nbconvert/filters/ansi.py, right?
And fix its bugs, of course.

minrk · 2016-02-03T19:39:11Z

@mgeier yes, if there are bugs in something from genutils, it's appropriate to add an implementation here and stop using the genutils version.

mgeier · 2016-02-03T20:10:28Z

OK, will do. Shall I also file a PR to remove it from genutils or will it just rot there?

Another question:
Which of those regular expressions are more efficient/nicer/cooler/easier to understand...?

\x1b\\[([^@-~]*)([@-~])

\x1b\\[(.*?)([@-~])

They should do the same thing, but I kinda prefer the second one, because it uses less characters in total but more different ones.

mgeier · 2016-02-03T20:27:12Z

OK, I updated the tests, copied strip_ansi() from ipython_genutils (and hopefully fixed it). As expected, Travis CI doesn't complain anymore.
For now, I changed to the second regex, if you want me to change back to the first one (or an entirely different one), please tell me.

To be quite honest, I don't really want to add tests for my 256 color and 24-bit extension, but if you insist, I'll do it. Also, the test cases are quite repetitious, this should probably be fixed, too.
And why on earth are you using this self._try_*() thing, isn't that utterly useless boilerplate?

BTW, why don't you use py.test, wouldn't that be much cooler?

The third point of my initial message is still open, any TeX gurus out there?

minrk · 2016-02-03T20:47:00Z

@mgeier I'd leave it alone. It is my hope to never make another release of genutils, and making a PR removing anything would break things (like current stable nbconvert), so we shouldn't ever do that.

minrk · 2016-02-03T20:51:42Z

BTW, why don't you use py.test, wouldn't that be much cooler?

We use pytest in some new projects, but we've been using nose in IPython for years, so that's what the tests have been written against when added to existing projects. And, having used both pytest and nose, pytest doesn't really offer benefits over nose that justify anyone spending the time to port existing, working test code.

takluyver · 2016-02-03T22:52:41Z

I just grepped through the repos I have cloned for other uses of strip_ansi(), it appears to be used only in nbconvert and in IPython (which already has its own copy). There are a couple of repos I don't have (ipyparallel, ipywidgets), but I think it's unlikely they use it.

mgeier · 2016-02-04T17:11:04Z

I've asked on stackexchange about the LaTeX problem mentioned above, and I promptly got an answer with a great (albeit somewhat verbose) solution. I've pushed a new commit with that.

From my side, this is ready to be merged now, do you want me to make further changes?
If not, shall I squeeze and rebase my commits?

@minrk As a side note, I want to make a few comments about py.test, but this doesn't really have anything to do with this PR:

I've never really used nosetests, so I'm probably missing something, but from what I've seen, py.test has huge benefits since it allows much more detailed failure reports with much less boilerplate. It's somewhat magical ...
I might have overlooked that in nosetests, but py.test offers very easy means to create parametrized tests, which would be a perfect fit for the ANSI tests. In contrast to the manual loop that's used now, this would make a separate test case for each individual test condition, showing in a much clearer way (in case of a test failure) what the conditions actually were. And it wouldn't give up at the first failure but continue with the other conditions within the test case.
If you want, I can try to rewrite the ANSI tests to show what I mean ...

In general, py.test has the advantage that one can use simple assert statements instead of the ugly specialized assertSomething functions/methods. The output in case of failure is still great.

... that justify anyone spending the time to port existing, working test code.

And that's the beautiful thing: py.test can run the current nbconvert tests without change! There is nothing to port!

If you change to py.test (which I think would be a good idea), you can gradually change tests to use the cooler assert statements, parametrized tests, ... whenever there is time.

minrk · 2016-02-04T19:36:00Z

Please don't change the tests to use py.test as part of this PR. I'm aware it has some benefits. I do typically choose it for new projects. I think they are significantly overstated.

mgeier · 2016-02-05T08:39:49Z

@minrk Don't worry, I wasn't going to do that. If anyone is interested, I can do it in a separate PR, if not, I won't do anything.

minrk · 2016-02-06T10:56:15Z

Thanks! At some point, we probably will. The fact that the tests run unmodified definitely makes it easier to start.

mgeier · 2016-02-11T15:06:09Z

Is this scheduled for merging or do I have to make some changes?

takluyver · 2016-02-11T17:40:11Z

nbconvert/filters/tests/test_ansi.py

+            '\x1b[1;33mhello': '<span class="ansiyellow ansibold">hello</span>',
+            '\x1b[37mh\x1b[0;037me\x1b[;0037ml\x1b[00;37ml\x1b[;;37mo': '<span class="ansigray">h</span><span class="ansigray">e</span><span class="ansigray">l</span><span class="ansigray">l</span><span class="ansigray">o</span>',
+            'hel\x1b[0;32mlo': 'hel<span class="ansigreen">lo</span>',
+            'hello': 'hello',


Apologies if I'm missing it, but can we get a test case where two different formatting options overlap, e.g.:

# [BOLD]hello[YELLOW]world[RESET] \x1b[1mhello\x1b[33mworld\x1b0m

^ Don't trust that I've constructed that example correctly.

No, I don't think you're missing it. This wasn't possible before, so there were no tests for it.

I added a test case, but first I changed to test generators and combined the test cases for ansi2html() and ansi2latex() to avoid the annoying duplication of test strings.

UPDATE: I reverted the de-duplication (1446e2a) because it seemed to be incompatible with Python 2.

takluyver · 2016-02-13T18:27:26Z

Thanks! I think the generator tests might be causing the failure on Python 2 on Travis - I vaguely remember having problems with that machinery before and stripping out all uses of it, but I forget the details now.

mgeier · 2016-02-14T09:06:04Z

OK, then I'm dropping 1446e2a and just add the new test case (twice).

takluyver · 2016-02-14T10:17:23Z

Thanks. I think this is looking OK; I'll give it a day or two more for other people to have a look, and we can merge if nothing else is brought up.

Carreau · 2016-02-15T01:09:59Z

No objections.

From a quick look I think most of these might be available in Pygments, if we already depends on it.

takluyver · 2016-02-15T11:40:08Z

nbconvert does already depend on pygments for highlighting code in HTML and Latex output. However, I don't immediately see an API in pygments for parsing ANSI escapes. If that machinery is there but not publicly exposed, I'd suggest we merge this for now as it looks like a concrete improvement, and look separately at collaborating with pygments or some other package to reduce code duplication.

minrk · 2016-02-15T14:09:28Z

👍 Thanks, @mgeier!

Carreau · 2016-02-15T18:18:56Z

I'd suggest we merge this for now as it looks like a concrete improvement, and look separately at collaborating with pygments or some other package to reduce code duplication.

Seem like a plan. I'm in Pygments codebase these days. I'll see if I can have a look.

takluyver · 2016-02-15T19:22:33Z

In ur codebase, killin ur bugz? ;-)

Re-write ANSI color handling

mgeier · 2016-02-16T08:44:01Z

Thanks for merging!

I had a quick look at the Pygments code and I only found ANSI sequences as output. But if there is also code for input, it sure would make sense to re-use this instead of re-implementing.

Re-write ANSI color handling

6be45f6

Closes jupyter#214. As a side effect, this also fixes jupyter#174.

minrk reviewed Feb 2, 2016
View reviewed changes

mgeier added 2 commits February 3, 2016 21:17

Move strip_ansi() from ipython_genutils and change regex

5e61dbb

Fix ANSI tests

409398f

Avoid \newcommand for ANSI RGB colors

825474b

minrk mentioned this pull request Feb 4, 2016

PDF conversion choke on some bytes (ansi sequences.??) #228

Closed

minrk added this to the 4.2 milestone Feb 4, 2016

takluyver reviewed Feb 11, 2016
View reviewed changes

Remove apparently useless boilerplate from ANSI tests

f468965

ANSI tests: add test case for overlapping formatting

cbd76bd

mgeier force-pushed the ansi branch from 24e6c62 to cbd76bd Compare February 14, 2016 09:05

mgeier mentioned this pull request Feb 14, 2016

16 ANSI colors? #245

Closed

takluyver added a commit that referenced this pull request Feb 15, 2016

Merge pull request #225 from mgeier/ansi

5646f0b

Re-write ANSI color handling

takluyver merged commit 5646f0b into jupyter:master Feb 15, 2016

mgeier deleted the ansi branch February 16, 2016 08:41

mgeier mentioned this pull request Feb 24, 2016

Fix ANSI code for "bold" jupyter/notebook#988

Closed

mgeier mentioned this pull request Mar 18, 2016

Re-factor ANSI color handling jupyter/notebook#1230

Merged

mgeier mentioned this pull request Jan 20, 2022

Fix memory leak with notebooks and ANSI colors in Firefox (#9431) jupyterlab/jupyterlab#11273

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-write ANSI color handling #225

Re-write ANSI color handling #225

mgeier commented Feb 2, 2016

minrk commented Feb 2, 2016

mgeier commented Feb 2, 2016

takluyver commented Feb 2, 2016

minrk commented Feb 2, 2016

minrk Feb 2, 2016

mgeier Feb 3, 2016

mgeier commented Feb 3, 2016

mgeier commented Feb 3, 2016

minrk commented Feb 3, 2016

mgeier commented Feb 3, 2016

mgeier commented Feb 3, 2016

minrk commented Feb 3, 2016

minrk commented Feb 3, 2016

takluyver commented Feb 3, 2016

mgeier commented Feb 4, 2016

minrk commented Feb 4, 2016

mgeier commented Feb 5, 2016

minrk commented Feb 6, 2016

mgeier commented Feb 11, 2016

takluyver Feb 11, 2016

mgeier Feb 13, 2016

takluyver commented Feb 13, 2016

mgeier commented Feb 14, 2016

takluyver commented Feb 14, 2016

Carreau commented Feb 15, 2016

takluyver commented Feb 15, 2016

minrk commented Feb 15, 2016

Carreau commented Feb 15, 2016

takluyver commented Feb 15, 2016

mgeier commented Feb 16, 2016

Re-write ANSI color handling #225

Re-write ANSI color handling #225

Conversation

mgeier commented Feb 2, 2016

minrk commented Feb 2, 2016

mgeier commented Feb 2, 2016

takluyver commented Feb 2, 2016

minrk commented Feb 2, 2016

minrk Feb 2, 2016

Choose a reason for hiding this comment

mgeier Feb 3, 2016

Choose a reason for hiding this comment

mgeier commented Feb 3, 2016

mgeier commented Feb 3, 2016

minrk commented Feb 3, 2016

mgeier commented Feb 3, 2016

mgeier commented Feb 3, 2016

minrk commented Feb 3, 2016

minrk commented Feb 3, 2016

takluyver commented Feb 3, 2016

mgeier commented Feb 4, 2016

minrk commented Feb 4, 2016

mgeier commented Feb 5, 2016

minrk commented Feb 6, 2016

mgeier commented Feb 11, 2016

takluyver Feb 11, 2016

Choose a reason for hiding this comment

mgeier Feb 13, 2016

Choose a reason for hiding this comment

takluyver commented Feb 13, 2016

mgeier commented Feb 14, 2016

takluyver commented Feb 14, 2016

Carreau commented Feb 15, 2016

takluyver commented Feb 15, 2016

minrk commented Feb 15, 2016

Carreau commented Feb 15, 2016

takluyver commented Feb 15, 2016

mgeier commented Feb 16, 2016