Rendering images in terminal #384

FedeClaudi · 2020-10-14T20:00:06Z

FedeClaudi
Oct 14, 2020

Hi Will,

I was wandering if you had any plan about adding support for images in rich?
Or something like ascii drawings (e.g. see braille)?

Cheers,
Fede

willmcgugan · 2020-10-15T08:57:15Z

willmcgugan
Oct 15, 2020
Maintainer

That would be a nice feature, but I probably wouldn't implement it in the main library. I need to draw a line under the feature-set to keep it maintainable.

I'm hoping that there will eventually be an ecosystem of third-party libraries using the Console protocol for these things.

Maybe it's something you would like to tackle yourself?

1 reply

FedeClaudi Oct 15, 2020
Author

That makes a lot of sense, of course.

I've used the protocol in pyinspect and it really makes it easy to build new rich-compatible objects, such a nice feature!

rahuldave · 2021-01-13T16:08:21Z

rahuldave
Jan 13, 2021

iterm2 has an image protocol: https://iterm2.com/documentation-images.html . My particular use case is converting Jupyter notebooks to markdown and then showing them on terminal by using rich to show the markdown, with images.

This repo on GitHub shows how to use the rich console protocol to render cells: https://github.com/sandal-tan/nbcat/blob/master/nbcat/notebook.py but how ought one to hook into the markdown to render images using items image protocol?

Suspect I need to change the following methods in ImageItem:

def on_enter(self, context: "MarkdownContext") -> None:
        self.link = context.current_style.link
        self.text = Text(justify="left")
        super().on_enter(context)

    def __rich_console__(
        self, console: Console, options: ConsoleOptions
    ) -> RenderResult:
        link_style = Style(link=self.link or self.destination or None)
        title = self.text or Text(self.destination.strip("/").rsplit("/", 1)[-1])
        if self.hyperlinks:
            title.stylize(link_style)
        yield Text.assemble("🌆 ", title, " ", end="")

but should the text yielded just be the escape sequence? Or ought I be using some other machinery within rich?

0 replies

willmcgugan · 2021-01-13T16:18:24Z

willmcgugan
Jan 13, 2021
Maintainer

You'll need to wrap the escape sequence in a Control instance to prevent Rich from word wrapping etc.

I'm not sure if the iTerm protocol could be integrated more deeply in to Rich. i.e. could you put it in a Panel? I suspect not.

4 replies

rahuldave Jan 13, 2021

Thanks! I suspect that this is not a main library thing to put into Rich. I teach ML and Stats stuff and I have a million notebooks so being able to page the notebooks at command line is the major gain for me. iterm2's imgls and imgcat certainly help from a productivity perspective, but quickly understanding what notebook has what teaching material is super useful.

paw-lu Jul 7, 2021

You'll need to wrap the escape sequence in a Control instance to prevent Rich from word wrapping etc.

Trying to impliment this @willmcgugan, but it's not obvious to me how to properly use Control to prevent wrapping. Is there an example or docs on this?

willmcgugan Jul 7, 2021
Maintainer

@paw-lu There is no obvious way to do this atm. I think control.py would have to be modified to allow for an aribitrary string to be rendered.

Even so, it doesn't look like this extensions could possibly work in a Rich renderable. i.e. you couldn't put it in a panel, because it is independent from the regular grid of characters. But it may work with just plain Markdown.

There is another image extension supported by more terminals called "sixtel". That does work with the character grid, but it's going to be more work to implement.

paw-lu Jul 10, 2021

@paw-lu There is no obvious way to do this atm. I think control.py would have to be modified to allow for an aribitrary string to be rendered.

@willmcgugan ok got it. Thanks for the guidance.

There is another image extension supported by more terminals called "sixtel". That does work with the character grid, but it's going to be more work to implement.

Yeah I was trying something similar—where you draw pictures using unicode characters. The problem is that I'm not sure how to get rich to play nicely with position-dependent characters.

Just starting with the basics, I've found the correct combination to get rich.console.Console.print to play nicely, but things get more complicated when you try to place it in a table:

# Good
print("Python's print")
print(image_string)

# Good, but all arguments mandatory
print("Rich's print (with some args)")
console.print(string_image, highlight=False, crop=False, overflow="ignore")

# Mess
print("Rich's print in a table")
tab = table.Table()
tab.add_column("a")
tab.add_column("b", overflow="ignore")
tab.add_row("1", string_image)

print("Rich's print in a table")
con.print(tab, highlight=False, crop=False, overflow="ignore")

Is there a good way to tell rich to leave the content alone and to lock in its position?

lllama · 2021-09-20T20:06:16Z

lllama
Sep 20, 2021

In looking into this, it seems that there are a couple of terminal protocols that allow for images in terminals. (sixel and ReGIS in particular.) These seem like they could be better things to look into using, rather than the proprietary protocol that iTerm uses. (I believe iTerm supports sixel anyway.)

0 replies

MrBeeMovie · 2022-03-19T16:38:37Z

MrBeeMovie
Mar 19, 2022

Is anyone still interested in this? I really wanted this for a personal project I'm working on. Here is some code I wrote that does an ok job writing to string in console markup. Not the best but ok. I may keep working on this by itself as a separate project at some point and publish to pypi.

from PIL import Image


def to_string(img: Image, dest_width: int, unicode: bool = True) -> str:
    img_width, img_height = img.size
    scale = img_width / dest_width
    dest_height = int(img_height / scale)
    dest_height = dest_height + 1 if dest_height % 2 != 0 else dest_height
    img = img.resize((dest_width, dest_height))
    output = ""

    for y in range(0, dest_height, 2):
        for x in range(dest_width):
            if unicode:
                r1, g1, b1 = img.getpixel((x, y))
                r2, g2, b2 = img.getpixel((x, y + 1))
                output = output + f"[rgb({r1},{g1},{b1}) on rgb({r2},{g2},{b2})]▀[/]"
            else:
                r, g, b = img.getpixel((x, y))
                output = output + f"[on rgb({r},{g},{b})] [/]"

        output = output + "\n"

    return output

1 reply

carecavoador Sep 20, 2022

This code is actually really nice. I tried it and could read the pixel values from an image and display it on the terminal.

However, I don't fully understand ths f string f"[rgb({r1},{g1},{b1}) on rgb({r2},{g2},{b2})]▀[/]" would you mind explaining it, @MrBeeMovie? What is the on doing and why are you drawing the character ▀ only half of the times? How can you overlap the ▀ character on the subsequent lines?

This does exactly what I was trying to do. I just don't understand how.

jerch · 2022-06-10T09:16:27Z

jerch
Jun 10, 2022

@willmcgugan
Since most mainline terminal emulators adopted the SIXEL protocol in the last years - would this qualify to rethink first class graphics support in rich? It would help alot with very common tasks like data plotting with higher output quality/resolution.

Ofc there is a big BUT:
The tricky part for such a component would be the sixel encoding itself, which needs image quantization for pixel data exceeding a certain color palette size (typically 256 colors on newer TEs). While this is normally not an issue for data plots (can almost always be reworked to contain less than 256 colors), it still would need the quantization step for hi-res picture or gradient output (might pull in native dependencies, pretty sure that quantization cannot be done in pure python code in a timely fashion).

So if you are interested in such a component, I might be able to help with the impl to some extend.

NB:
This is not true yet for vector graphics caps in TEs, other than SIXEL ReGIS has not seen a widespread adoption, and I doubt that it ever will happen (the format spec is quite outdated/limited compared to more modern vector markup formats). Maybe we get an SVG-terminal subspec running somewhere in the future...

1 reply

willmcgugan Jun 10, 2022
Maintainer

It would be a nice to have. For Textual as well.

FedeClaudi · 2022-06-10T09:25:46Z

FedeClaudi
Jun 10, 2022
Author

We've thinking about integrating sixel in Term.jl, a Julia library inspired by rich: FedeClaudi/Term.jl#94
As you say it's not trivial to do it in a way that plays well with all the other things that rich can do, but it would be quite powerful. In particular while rendering markdown to terminal you could display images too, not too bad!

It's probably not trivial, but if anyone can do it it's Will :P

1 reply

jerch Jun 10, 2022

If you dont mind, gonna comment over there (to not pollute the discussion here..)

ronf · 2022-08-27T17:21:14Z

ronf
Aug 27, 2022

I'm interested in adding support for iTerm2 inline images in my project, which is currently based on Rich for most of its other rich text formatting. At least initially, I don't need images to be a full-blown Rich renderable (to put in tables, etc.), but I would like to output a mix of rich text and these image escape sequences to a Rich console. It looks like I need to use a Control() instance for this, as mentioned above, but I'm not quite seeing how to add a custom Control() sequence for this. It looks like the options for ControlType and the mapping from those to ANSI escape sequences are pre-defined. Without modifying/patching the Rich library, is there a way I can add a custom escape sequence that doesn't impact formatting of other rich output?

On a related note, is there a way to extend the list of allowed markup tokens? I was thinking of trying to add a new [img] tag. The markup parser does seem to unknown tags in some cases, but later this information gets stripped out when trying to use on a call to render(), leading to some rather ugly hacks to try and preserve this information long enough to know where images need to be inserted. Is there a hook of some kind to add new markup tokens?

Thanks for any tips you can provide, and for all your work on Rich! It's a really excellent library, and I'm loving what it is capable of!

0 replies

ronf · 2022-09-08T18:40:43Z

ronf
Sep 8, 2022

Following up on this, I eventually figured out that passing "control=True" does work in a call to Segment(), and there's no need to use a Control() instance or extend that to support new escape sequences.

My first cut at implementing a renderable for iTerm2 inline images looked something like the following:

class InlineImage:
    def __init__(self, data, **kwargs):
        def _b64(value):
            return base64.b64encode(value).decode('ascii')

        kwargs.update(inline=1)

        if 'name' in kwargs:
            kwargs['name'] = _b64(kwargs['name'].encode('utf-8'))

        args = ';'.join(f'{k}={v}' for k, v in kwargs.items())

        self._ctrl = f'\x1b]1337;File={args}:{_b64(data)}\a'

    def __rich_console__(self, console, options):
        yield Segment(self._ctrl, control=True)
        yield Segment('\n')

This works as long as you only try to render images on lines by themselves, and doing so will always leave the cursor at the beginning of the line following the bottom of the image rendered.

The InlineImage() class takes arguments described at https://iterm2.com/documentation-images.html to scale the image to different sizes (using width, height, and preserveAspectRatio). The "inline" argument will always automatically be set, and it would be easy to do that for the size argument as well, but it looks like that's not a required field. You can also optionally set a "name" for the image.

I had an additional requirement that I wanted to be able to print a prefix to the left of the image on all the lines it took up. I had previously asked about how to do this for other renderables in #867 and the WithPrefix function worked well for that (thanks, Will!). However, to get WithPrefix to work here required a bit more work. Basically, this renderable needs to be told how many lines it will take up, and then it needs to render that number of blank lines without the control=True so that Rich can add the right number of prefixes (one per line). I do this by moving the cursor up to the top of the image after drawing it, and then letting Rich fill in the other characters on each line. As long as they don't overlap with the portions of the screen where the image is, this works great. The changes required were not that extensive:

class InlineImage:
    def __init__(self, data, lines=1, **kwargs):
        def _b64(value):
            return base64.b64encode(value).decode('ascii')

        kwargs.update(inline=1)

        if 'name' in kwargs:
            kwargs['name'] = _b64(kwargs['name'].encode('utf-8'))

        args = ';'.join(f'{k}={v}' for k, v in kwargs.items())
        move = f'\x1b[{lines-1}A' if lines > 1 else ''

        self._ctrl = f'\x1b]1337;File={args}:{_b64(data)}\a{move}'
        self._text = lines * '\n'

    def __rich_console__(self, console, options):
        yield Segment(self._ctrl, control=True)
        yield Segment(self._text)

However, this requires that the caller pass in the lines argument specifying the height of the image in lines. If you are scaling using the version of the height parameter which has no units (meaning it is measured in cells), you can just set lines and height to the same value. However, if you want to use one of the other variations of width & height, it gets a bit more complicated. You'll end up needing to send other escape sequences to iTerm2 to have it report its character cell width, height, and scaling (and update that if the terminal font changes), and you'll need something like the Pillow image library to get the original image size and aspect ratio, so that you can do the calculations needed to determine what the height the image will be after scaling (accounting for aspect ratio if you preserve that), and then divide that by iTerm2's character cell height. I did get that working here, but since it requires interactive I/O to query terminal settings, it's not easy to extract that as a standalone example. Still, I hope others might find this info useful.

Once lines is passed in, the WithPrefix function in #867 worked great, but it's still not possible to get these images to render properly in something like a table or grid. From what I can see, Rich doesn't really have a way to tell it that some output is more than one line tall, Also, if you do move the cursor backward to let it put data on subsequent lines, it's difficult to get it to not overwrite the image in the process.

It might be possible to draw a table completely FIRST and then move the cursor back up to the appropriate positions for each of the images you want to draw, moving the cursor back down at the end. The tricky part here would be determining what exact row and column each image should be drawn at. In theory, some kind of placeholders could be put into the table that could then be analyzed later by using console.render_lines() (similar to how WithPrefix works), but this isn't something I have attempted since I don't yet have a need for it.

1 reply

ronf Sep 11, 2022

I took some time yesterday to explore the idea of drawing placeholders for images embedded in a container (like a Table) and then later going back and adding in the images by moving the cursor back to the location of each image. While there are some limitations on this, such as the entire renderable needing to be less than the height of the terminal, it generally seems to work and the code is not all that complicated. Here's a first cut at it:

import base64
import math

from rich.console import Console
from rich.control import Control
from rich.measure import Measurement
from rich.segment import Segment
from rich.style import Style

class InlineImage:
    def __init__(self, data, width, height, **kwargs):
        def _b64(value):
            return base64.b64encode(value).decode('ascii')

        if 'name' in kwargs:
            kwargs['name'] = _b64(kwargs['name'].encode('utf-8'))

        self._data = _b64(data)
        self._width = width
        self._height = height
        self._kwargs = kwargs

    def __rich_console__(self, console, options):
        width = self._width or options.max_width
        height = self._height or options.max_height
        aspect_ratio = width / height

        if width > options.max_width:
            width = options.max_width
            height = math.ceil(width / aspect_ratio)

        if height > options.max_height:
            height = options.max_height
            width = math.ceil(height * aspect_ratio)

        kwargs = dict(self._kwargs, inline=1, width=width, height=height)

        args = ';'.join(f'{k}={v}' for k, v in kwargs.items())
        ctrl = f'\x1b]1337;File={args}:{self._data}\a'
        style = Style.from_meta(dict(img=(ctrl, width, height)))

        yield Segment('\xa0' + height * '\n', style=style)

    def __rich_measure__(self, console, options):
        width = self._width or options.max_width

        return Measurement(width, width)

class WithInlineImages:
    def __init__(self, renderable):
        self._renderable = renderable

    def __rich_console__(self, console, options):
        images = []
        lines = console.render_lines(self._renderable, options,
                                     pad=False, new_lines=True)

        row = len(lines)

        for line in lines:
            column = 0

            for segment in line:
                try:
                    ctrl, width, height = segment.style.meta['img']
                    images.append((column, row, ctrl, width, height))
                except (AttributeError, KeyError):
                    pass

                column += segment.cell_length

            yield from line
            row -= 1

        row = 0
        column = 0

        for x, y, ctrl, width, height in images:
            yield Control.move(x - column, row - y)
            yield Segment(ctrl, control=True)

            column = x + width
            row = y - height + 1

        yield Control.move(-column, row)

Note that this uses the Control class in rich, which doesn't appear to be part of the public API. However, if that's an issue it's easily enough to add in a local version:

def _move(x, y):
    result = ''

    if y:
        result += f'\x1b[{y}B' if y > 0 else f'\x1b[{-y}A'

    if x:
        result += f'\x1b[{x}C' if x > 0 else f'\x1b[{-x}D'

    return result

To use this, you'd first create one or more InlineImage objects, passing in the raw image data and a width & height measured in character cells. This size is the default size to render the image, but more importantly it provides the aspect ratio of the image, which is needed for other resizing to work properly. Once you have your image objects, you can include them in other renderables such as a Table, Grid, or Layout, and then when you want to print the result, you wrap it the containing renderable with WithInlineImages. For example:

img1 = InlineImage(open('img1.jpg', 'rb').read(), 960/7, 1280/12)
img2 = InlineImage(open('img2.jpg', 'rb').read(), 600/7, 800/12)

table = Table('Column 1', 'Column 2', show_lines=True, width=35)
table.add_row(img1, img2)
table.add_row(img2, img1)
table.add_row(img1, img2)
table.add_row(img2, img1)

console = Console()
console.print(WithInlineImages(table))

In this example, the terminal had character cells which were 7 pixels wide and 12 pixels high, with img1 being 960x1280 resolution and img2 being 600x800 resolution.

For this to work, your entire renderable must fit on the screen. Otherwise, attempts to move the cursor up to where the image is inserted won't work. When done, the cursor is moved back to the bottom of the renderable. Right now, this code assumes the renderable is printed with the default end value of '\n', meaning the cursor ends up at the left column of the first line after the value being printed.

Note the use of '\xa0' here. This is a Unicode non-breaking space, and that's what ends up becoming the anchor for where the image will finally be printed. It is followed by a series of newlines for whatever height was computed for the image, to make sure the container provides enough space for the image to be drawn later. The width is guaranteed by returning the computed width via __rich_measure__ as both the min & max width.

Also, this code takes advantage of the Style.from_meta() function to carry image metadata from InlineImage through to WithInlineImages via a Style object. This is also how WIthInlineImages distinguishes between inline images and other renderables.

The initial image size calculations would be best done using a library like Pillow to query the image size, along with what I mentioned above about querying iTerm2 for the character cell size. To keep this example focused on just the Rich-related parts, I have not included those pieces here.

brisvag · 2023-08-23T18:19:35Z

brisvag
Aug 23, 2023

I've played around a bit with rich and the kitty graphics protocol. By using the simple example they have there, almost unchanged, I made a small renderable class that works, but somehow only in ipython after running pretty.install() O.o

To test the following, you will need the kitty terminal, pillow, request, rich and textual.

test_kitty_img.py

#!/usr/bin/env python3

import io
from base64 import standard_b64encode

import requests
from PIL import Image

from textual.app import App
from textual.widgets import Static
from rich import print
from rich.segment import Segment

url = 'https://github.com/textualize/rich/raw/master/imgs/features.png'


class KittyImage:
    def __init__(self, url):
        # download the image, resize and convert to png
        img_response = requests.get(url, stream=True)
        img = Image.open(io.BytesIO(img_response.content))
        self.png = io.BytesIO()
        img.resize(size=(500, 500)).save(self.png, format='png')
        # fill up the buffer using the function from the example
        self.buf = io.BytesIO()
        self.write_chunked(a='T', f=100)
        self.buf.seek(0)
        # generate a Segment for rich to display
        self.segment = Segment(self.buf.read().decode())

    # the following two methods are essentially unchanged from the example in
    # https://sw.kovidgoyal.net/kitty/graphics-protocol/#a-minimal-example
    @staticmethod
    def serialize_gr_command(**cmd):
        payload = cmd.pop('payload', None)
        cmd = ','.join(f'{k}={v}' for k, v in cmd.items())
        ans = []
        w = ans.append
        w(b'\033_G'), w(cmd.encode('ascii'))
        if payload:
            w(b';')
            w(payload)
        w(b'\033\\')
        return b''.join(ans)

    def write_chunked(self, **cmd):
        self.png.seek(0)
        data = standard_b64encode(self.png.read())
        while data:
            chunk, data = data[:4096], data[4096:]
            m = 1 if data else 0
            self.buf.write(self.serialize_gr_command(payload=chunk, m=m, **cmd))
            self.buf.flush()
            cmd.clear()

    def __rich_console__(self, console, options):
        yield self.segment


# small app example
class Img(Static):
    def get_content_width(self, container, viewport):
        return 50

class ImageApp(App):
    def compose(self):
        yield Img(KittyImage(url))


if __name__ == "__main__":
    app = ImageApp()
    app.run()
    img = KittyImage(url)
    print(repr(img))
    print(img)

When I put the code above inside test_kitty_img.py and run the following from ipython:

from test_kitty_img import url, KittyImage
KittyImage(url)

I see this:

Awesome! However, running rich.print(KittyImage(url)) hangs forever, even though I expected exactly the same output.

Similarly, if you simply run the code above as a script (which will open a textual UI and immediatly after print the image with rich.print), you'll get nothing.

It feels so close to working, but I'm missing something to actually be able to use it in an textual app (or at least a command line tool). Any suggestions?

9 replies

brisvag Aug 24, 2023

Found the point where things are breaking: the display hooks from the pretty module are using crop=False.

However, print and textual use crop=True by default, which breaks the graphics payload in the wrong way, and results in hanging/nothing being printed. So I can use console.print(KittyImage(url), crop=False) and it works. Now I just need to find a way to have a renderable say "don't crop!" to the print function.

ihabunek Jan 13, 2024

@brisvag Did you manage to find a way to accomplish this? I'm stuck at pretty much the same point.

brisvag Jan 16, 2024

Unfortunately, no. I didn't really spend more time on this though, so maybe there's a way ^^' Please let us know if you do find out!

danschwarz Jan 18, 2024

Do you think it would be possible to monkeypatch Textual to set crop=False? I'm not saying it'd be wise to do so, but perhaps this is a way to make the image rendering work... 🤔

brisvag Jan 19, 2024

That was my thinking as well, but I got stuck multiple times in trying to do so... It's been a while unfortunately, so I don't remember the details :/

Probably the cleaner way to do it would be to allow a Renderable to say "if you see me, set crop to false befor rendering".

odell0111 · 2023-10-13T02:43:07Z

odell0111
Oct 13, 2023

The Python package, image-in-terminal, performs well by replacing every two pixels of an image with the character ▀ (Upper Half Block). This makes it most suitable for displaying low-resolution images. While images with higher resolutions can also be displayed in the terminal, performance may decrease and the terminal’s contents will need to be zoomed out to view the images. This package served my small project well, and it might also be useful for others

0 replies

lnqs · 2024-05-01T15:34:39Z

lnqs
May 1, 2024

Seems like this thread is open for a while now, but as I played around with Kitty's Terminal Graphics Protocol and Textual recently I'll leave my results here. Maybe someone finds them interesting. And got a working solution, even though it's a bit hacky:

#!/usr/bin/env python

import io
import sys
from base64 import b64encode
from PIL import Image
from click import style
from textual.widgets import Label
from rich.segment import Segment
from textual.widget import Widget
from textual.app import App, ComposeResult
from textual.geometry import Size, NULL_SIZE
from textual.containers import Center, Middle
from rich.console import (
    Console,
    ConsoleOptions,
    ConsoleRenderable,
    RenderResult,
    RichCast,
)
from rich.style import Style


PLACEHOLDER = 0x10EEEE

# fmt: off
NUMBER_TO_DIACRITIC = [
     0x00305, 0x0030d, 0x0030e, 0x00310, 0x00312, 0x0033d, 0x0033e, 0x0033f, 0x00346, 0x0034a, 0x0034b, 0x0034c, 0x00350, 0x00351, 0x00352, 0x00357,
     0x0035b, 0x00363, 0x00364, 0x00365, 0x00366, 0x00367, 0x00368, 0x00369, 0x0036a, 0x0036b, 0x0036c, 0x0036d, 0x0036e, 0x0036f, 0x00483, 0x00484,
     0x00485, 0x00486, 0x00487, 0x00592, 0x00593, 0x00594, 0x00595, 0x00597, 0x00598, 0x00599, 0x0059c, 0x0059d, 0x0059e, 0x0059f, 0x005a0, 0x005a1,
     0x005a8, 0x005a9, 0x005ab, 0x005ac, 0x005af, 0x005c4, 0x00610, 0x00611, 0x00612, 0x00613, 0x00614, 0x00615, 0x00616, 0x00617, 0x00657, 0x00658,
     0x00659, 0x0065a, 0x0065b, 0x0065d, 0x0065e, 0x006d6, 0x006d7, 0x006d8, 0x006d9, 0x006da, 0x006db, 0x006dc, 0x006df, 0x006e0, 0x006e1, 0x006e2,
     0x006e4, 0x006e7, 0x006e8, 0x006eb, 0x006ec, 0x00730, 0x00732, 0x00733, 0x00735, 0x00736, 0x0073a, 0x0073d, 0x0073f, 0x00740, 0x00741, 0x00743,
     0x00745, 0x00747, 0x00749, 0x0074a, 0x007eb, 0x007ec, 0x007ed, 0x007ee, 0x007ef, 0x007f0, 0x007f1, 0x007f3, 0x00816, 0x00817, 0x00818, 0x00819,
     0x0081b, 0x0081c, 0x0081d, 0x0081e, 0x0081f, 0x00820, 0x00821, 0x00822, 0x00823, 0x00825, 0x00826, 0x00827, 0x00829, 0x0082a, 0x0082b, 0x0082c,
     0x0082d, 0x00951, 0x00953, 0x00954, 0x00f82, 0x00f83, 0x00f86, 0x00f87, 0x0135d, 0x0135e, 0x0135f, 0x017dd, 0x0193a, 0x01a17, 0x01a75, 0x01a76,
     0x01a77, 0x01a78, 0x01a79, 0x01a7a, 0x01a7b, 0x01a7c, 0x01b6b, 0x01b6d, 0x01b6e, 0x01b6f, 0x01b70, 0x01b71, 0x01b72, 0x01b73, 0x01cd0, 0x01cd1,
     0x01cd2, 0x01cda, 0x01cdb, 0x01ce0, 0x01dc0, 0x01dc1, 0x01dc3, 0x01dc4, 0x01dc5, 0x01dc6, 0x01dc7, 0x01dc8, 0x01dc9, 0x01dcb, 0x01dcc, 0x01dd1,
     0x01dd2, 0x01dd3, 0x01dd4, 0x01dd5, 0x01dd6, 0x01dd7, 0x01dd8, 0x01dd9, 0x01dda, 0x01ddb, 0x01ddc, 0x01ddd, 0x01dde, 0x01ddf, 0x01de0, 0x01de1,
     0x01de2, 0x01de3, 0x01de4, 0x01de5, 0x01de6, 0x01dfe, 0x020d0, 0x020d1, 0x020d4, 0x020d5, 0x020d6, 0x020d7, 0x020db, 0x020dc, 0x020e1, 0x020e7,
     0x020e9, 0x020f0, 0x02cef, 0x02cf0, 0x02cf1, 0x02de0, 0x02de1, 0x02de2, 0x02de3, 0x02de4, 0x02de5, 0x02de6, 0x02de7, 0x02de8, 0x02de9, 0x02dea,
     0x02deb, 0x02dec, 0x02ded, 0x02dee, 0x02def, 0x02df0, 0x02df1, 0x02df2, 0x02df3, 0x02df4, 0x02df5, 0x02df6, 0x02df7, 0x02df8, 0x02df9, 0x02dfa,
     0x02dfb, 0x02dfc, 0x02dfd, 0x02dfe, 0x02dff, 0x0a66f, 0x0a67c, 0x0a67d, 0x0a6f0, 0x0a6f1, 0x0a8e0, 0x0a8e1, 0x0a8e2, 0x0a8e3, 0x0a8e4, 0x0a8e5,
     0x0a8e6, 0x0a8e7, 0x0a8e8, 0x0a8e9, 0x0a8ea, 0x0a8eb, 0x0a8ec, 0x0a8ed, 0x0a8ee, 0x0a8ef, 0x0a8f0, 0x0a8f1, 0x0aab0, 0x0aab2, 0x0aab3, 0x0aab7,
     0x0aab8, 0x0aabe, 0x0aabf, 0x0aac1, 0x0fe20, 0x0fe21, 0x0fe22, 0x0fe23, 0x0fe24, 0x0fe25, 0x0fe26, 0x10a0f, 0x10a38, 0x1d185, 0x1d186, 0x1d187,
     0x1d188, 0x1d189, 0x1d1aa, 0x1d1ab, 0x1d1ac, 0x1d1ad, 0x1d242, 0x1d243, 0x1d244
]
# fmt: on


class KittyImage(Widget):
    _next_image_id = 1

    class _Renderable:
        def __init__(self, image_id: int, size: Size) -> None:
            self._image_id = image_id
            self._size = size

        def __rich_console__(
            self, _console: Console, _options: ConsoleOptions
        ) -> RenderResult:
            style = Style(color=f"rgb({(self._image_id >> 16) & 255}, {(self._image_id >> 8) & 255}, {self._image_id & 255})")
            id_char = NUMBER_TO_DIACRITIC[(self._image_id >> 24) & 255]
            for r in range(self._size.height):
                line = ""
                for c in range(self._size.width):
                    line += f"{chr(PLACEHOLDER)}{chr(NUMBER_TO_DIACRITIC[r])}{chr(NUMBER_TO_DIACRITIC[c])}{chr(id_char)}"
                line += "\n"
                yield Segment(line, style=style)

    def __init__(
        self,
        image: Image,
        *,
        name: str | None = None,
        id: str | None = None,
        classes: str | None = None,
        disabled: bool = False,
    ) -> None:
        super().__init__(name=name, id=id, classes=classes, disabled=disabled)
        image_buffer = io.BytesIO()
        image.save(image_buffer, format="png")
        self._image_data = image_buffer.getvalue()

        self._image_id = KittyImage._next_image_id
        KittyImage._next_image_id += 1

        self._placement_size = NULL_SIZE

        self._send_image_to_terminal()

    def _send_image_to_terminal(self) -> None:
        data = b64encode(self._image_data)
        while data:
            chunk, data = data[:4096], data[4096:]

            ans = [
                f"\033_Gi={self._image_id},m={1 if data else 0},f=100,q=2".encode(
                    "ascii"
                )
            ]
            if chunk:
                ans.append(b";")
                ans.append(chunk)
            ans.append(b"\033\\")

            # Dangerous. Could interfer with the writer thread. But we can't use textual's functions
            # to write to the terminal.
            # It buffers output. There's no way around that (Driver.flush() is a no-op).
            # This buffering re-chunks the data which leads to a failed transmission.
            sys.__stdout__.buffer.write(b"".join(ans))
            sys.__stdout__.buffer.flush()

    def _create_virtual_placement(self, size: Size) -> None:
        # Same issue as above, even though the size of the data probably would still work with the
        # buffering. But we have this hack in place anyway, so it shouldn't matter anymore.
        sys.__stdout__.buffer.write(
            f"\033_Ga=p,U=1,i={self._image_id},c={size.width},r={size.height},q=2\033\\".encode(
                "ascii"
            )
        )
        sys.__stdout__.flush()

    def render(self) -> ConsoleRenderable | RichCast:
        if self._placement_size != self.content_size:
            self._create_virtual_placement(self.content_size)
            self._placement_size = self.content_size
        return KittyImage._Renderable(self._image_id, self.content_size)


class ImageApp(App[None]):
    def compose(self) -> ComposeResult:
        with Center():
            with Middle():
                yield KittyImage(Image.open("image.png"))

    def on_mount(self) -> None:
        self.query_one(KittyImage).styles.width = 20
        self.query_one(KittyImage).styles.height = 20


if __name__ == "__main__":
    ImageApp().run()

Disclaimer: This isn't tested very well and could probably be improved.

The approach is to send the image to Kitty and use the Unicode Placeholders for the actual display. While it basically works, it comes with a few issues:

Writing to the terminal
textual does a lot of magic with stdout . The driver starts a writer thread that maintains an output buffer. This interferes with Kitty's chunking requirements. The flush() method of the driver is just a no-op. To work around this, the code above writes to sys.__stdout__ directly. While it works I'm a bit afraid what happens if both the writer thread and the code do the write at the same time, even though I guess chances are low this happens.

Reading terminal responses
A clean implementation would let the terminal assign an image ID and read it back. Having terminal responses would also allow to implement proper error handling. However, textual also starts a thread to read the terminal. If we start another read it would be pure luck to get the response we're hoping for. And even then other input would be lost. I didn't find a sensible way to hook into textual's reading logic. Therefore the code assumes there's no images not handled by it, doesn't handle errors and isn't able to clean up after itself.

Aligning the image in the container
Aligning the image inside of the container while using the Unicode Placeholders would require us know how much to pad the image. While we do know the dimensions of the image, we don't know the cell size. Therefore we have no idea how many cells to pad and can't align it.
While Kitty is able to report the cell size we can't get this information as we can't query the terminal for the reasons above.

Edit: The last statement seems to be wrong. Getting terminal size on both pixels and cells can be done by an ioctl. Therefore it should be straight forward to calculate the padding. It just needs to be implemented.

0 replies

AM-I-Human · 2024-06-19T19:02:52Z

AM-I-Human
Jun 19, 2024

Hello I open another issue on textual then I found this.
I think it will be great to use something like this: https://github.com/AnonymouX47/term-image
That support other images terminal using block mode: https://github.com/AnonymouX47/termvisage
And add it as secondary dependency like black does with jupyter for example pip install black[jupyter]
What do you think?

1 reply

AM-I-Human Jun 19, 2024

https://github.com/wookayin/python-imgcat works on Wezterm too. Both seems to have issues with windows paths. On wezterm imcat seems to have some graphical glich when you press arrow up.

lnqs · 2024-09-15T19:04:04Z

lnqs
Sep 15, 2024

I continued to play around with using Kitty's Terminal Graphics Protocol in Textual and rich. And it seems to work quite well now (at least from my tests). I created a Package of my results: https://github.com/lnqs/textual-kitty
Maybe it's of interest for someone.

2 replies

xbais Oct 13, 2024

Amazing job! You saved me a lot of time!

basnijholt Dec 26, 2024

Thank you so much, this is exactly what I was looking for, for my https://github.com/basnijholt/tuitorial project to make terminal-based code tutorials/presentations!

Rendering images in terminal #384

Replies: 14 comments · 21 replies

willmcgugan Oct 15, 2020 Maintainer

FedeClaudi Oct 15, 2020 Author

willmcgugan Jan 13, 2021 Maintainer

willmcgugan Jul 7, 2021 Maintainer

willmcgugan Jun 10, 2022 Maintainer

FedeClaudi Jun 10, 2022 Author

Replies: 14 comments 21 replies

willmcgugan
Oct 15, 2020
Maintainer

FedeClaudi Oct 15, 2020
Author

willmcgugan
Jan 13, 2021
Maintainer

willmcgugan Jul 7, 2021
Maintainer

willmcgugan Jun 10, 2022
Maintainer

FedeClaudi
Jun 10, 2022
Author