Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use skopeo to pull container images #948

Closed
rnjudge opened this issue Apr 13, 2021 · 4 comments · Fixed by #1090
Closed

Use skopeo to pull container images #948

rnjudge opened this issue Apr 13, 2021 · 4 comments · Fixed by #1090
Assignees
Labels
proposal Propose a change to the project refactor Requires significant code refactoring super Super issue - other issues are linked to this one tools work on tools to understand and contribute to the project

Comments

@rnjudge
Copy link
Contributor

rnjudge commented Apr 13, 2021

Background
Reference: https://kubernetes.io/blog/2020/12/02/dont-panic-kubernetes-and-docker/

TL;DR - Kubernetes currently uses Dockershim to access containerd container runtime. Dockershim is being removed from Kubelet which removes support for Docker as a container runtime in favor of other runtimes that use Container Runtime Interface created for K8s.

We would like Tern to run in Kubernetes. This means that we cannot use Docker commands to pull images from the registry with the-i command line option if we want Tern to run in Kubernetes after v1.2.0.

Proposal
Let's try to use subprocess + ORAS instead of the Docker python library to pull container images from a registry when running with the -i option. This will mean that we need to include the go module as a dependency of Tern.

  • Create an OCI image class and implement the load_image() method for it in accordance with how ORAS pulls images.
  • We may need an additional tool on top of ORAS. ORAS will send all the bytes of the downloaded image to stdout and we need to figure out how we store these results to analyze.

Use Cases

  • Deploying Tern as a k8s cluster on a CRD
@rnjudge rnjudge added tools work on tools to understand and contribute to the project proposal Propose a change to the project super Super issue - other issues are linked to this one refactor Requires significant code refactoring labels Apr 13, 2021
@rnjudge rnjudge self-assigned this Apr 13, 2021
@rnjudge rnjudge changed the title Use ORAS to pull container images Use skopeo to pull container images Apr 28, 2021
@rnjudge
Copy link
Contributor Author

rnjudge commented Apr 28, 2021

It does not appear that ORAS supports anonymous pulling of container images from Dockerhub at this time. Skopeo does support this behavior and therefore, we should explore using skopeo instead. A previous PR (#712) started to work on this and we may want to revive parts of this.

@falcorocks
Copy link

+1 on skopeo, I have used it to successfully achieve something similar to what you are trying to do here.
Abandoning docker in favour of something like skopeo would also make it much better for CI, where docker in docker is still a nogo for security reasons!

@rnjudge
Copy link
Contributor Author

rnjudge commented Jul 8, 2021

@mukultaneja is working on this. @falcorocks if you have any tips or pointers, feel free to drop them here :)

@mukultaneja
Copy link
Contributor

@rnjudge @nishakm, how are we going to make skopeo as a requirement to tern moving onwards?

nishakm pushed a commit to nishakm/tern that referenced this issue Dec 9, 2021
This is work towards tern-tools#948

OCI images on disk may be represented in different ways. As such
the layer tarballs may exist at the root of the working directory
rather than in their own path. This change allows alternative
directory paths to be used based on the image layout.

- Added a new property: image_layout to the ImageLayer class. The
  image_layout property will contain either "docker" or "oci" with
  the default being "oci"
- Moved get_untar_dir from rootfs.py to be a method in the ImageLayer
  class. get_untar_dir will now return a directory path to the
  expected path where the layer's contents are untarred based on
  the image layout. Removed an unnecessary function in rootfs.py
  as a result.
- Modified the mount_diff_layers to accept untarred directory
  paths instead of tar files.
- Modified the function mount_overlay_fs in multi_layer.py to collect
  the directory paths instead of the tar file to pass to the
  mount_diff_layers function.
- Added a new get_untar_dir test to the ImageLayer test suite.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 9, 2021
This is work towards tern-tools#948

This change adds a new class - OCIImage which represents the
OCI image schema version 2 available on disk when downloaded by
skopeo. This change also adds a skopeo.py script to the list
of loading methods. Skopeo.py adds functions to check if the skopeo
binary exists on disk and a function to pull images using skopeo.
A test suite for the OCIImage class is also added. This test suit
is similar to the test_class_docker_image.py test suit.

Additionally, we have changed the DockerImage class and associated tests
to set the layer's image_layout property to "docker".

We've switched the test for changes to the oci class to use a container
image and the changes to the docker class to use the dockerfile, as
we need the docker daemon to test Dockerfile builds.

Signed-off-by: Mukul Taneja <mtaneja@vmware.com>
Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 9, 2021
This is work towards tern-tools#948

This commit replaces all instances of the old rootfs get_untar_dir
function with the ImageLayer method get_untar_dir. We also remove
an unused function in passthrough.py that uses layer tar files.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 9, 2021
This is work towards tern-tools#948

This commit makes changes to hook up the CLI option for container
image to use the OCIImage class rather than the DockerImage class.
The argument for the -i option is now --image.

- If the raw_image option is given, we assume that this is the
  output of "docker save" rather than some other tool.
- In the load_full_image function, we provide an "image_type"
  string which should either be "oci" or "docker", "oci" being the
  default. Based on this string, we either return an instance of
  OCIImage or DockerImage.
- For the Dockerfile or Lock features, we explicitly create
  DockerImage objects.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 9, 2021
This is work towards tern-tools#948

Now that skopeo has become the de-facto tool to download container
images, we will switch our instructions to install skopeo and use
it in our build and development environments.

At this time, skopeo is available on Ubuntu 20.04 through the
Kubic project, hence there may need to be some additional changes
in the future as the upstream package creators improve.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 9, 2021
This is work towards tern-tools#948

The html formatting of lists allows for an expansion UI under
the title of the layer object. The title is the layer tarball's
digest. In the Docker image layout, this can be directly taken
from the "tar_file" key. But in the OCI layout, this value is taken
from the "digest" key. This commit allows for that difference and
defaults to the layer's index if neither one of those keys exist.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

OCI images on disk may be represented in different ways. As such
the layer tarballs may exist at the root of the working directory
rather than in their own path. This change allows alternative
directory paths to be used based on the image layout.

- Added a new property: image_layout to the ImageLayer class. The
  image_layout property will contain either "docker" or "oci" with
  the default being "oci"
- Moved get_untar_dir from rootfs.py to be a method in the ImageLayer
  class. get_untar_dir will now return a directory path to the
  expected path where the layer's contents are untarred based on
  the image layout. Removed an unnecessary function in rootfs.py
  as a result.
- Modified the mount_diff_layers to accept untarred directory
  paths instead of tar files.
- Modified the function mount_overlay_fs in multi_layer.py to collect
  the directory paths instead of the tar file to pass to the
  mount_diff_layers function.
- Added a new get_untar_dir test to the ImageLayer test suite.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

This change adds a new class - OCIImage which represents the
OCI image schema version 2 available on disk when downloaded by
skopeo. This change also adds a skopeo.py script to the list
of loading methods. Skopeo.py adds functions to check if the skopeo
binary exists on disk and a function to pull images using skopeo.
A test suite for the OCIImage class is also added. This test suit
is similar to the test_class_docker_image.py test suit.

Additionally, we have changed the DockerImage class and associated tests
to set the layer's image_layout property to "docker".

We've switched the test for changes to the oci class to use a container
image and the changes to the docker class to use the dockerfile, as
we need the docker daemon to test Dockerfile builds.

Signed-off-by: Mukul Taneja <mtaneja@vmware.com>
Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

This commit replaces all instances of the old rootfs get_untar_dir
function with the ImageLayer method get_untar_dir. We also remove
an unused function in passthrough.py that uses layer tar files.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

This commit makes changes to hook up the CLI option for container
image to use the OCIImage class rather than the DockerImage class.
The argument for the -i option is now --image.

- If the raw_image option is given, we assume that this is the
  output of "docker save" rather than some other tool.
- In the load_full_image function, we provide an "image_type"
  string which should either be "oci" or "docker", "oci" being the
  default. Based on this string, we either return an instance of
  OCIImage or DockerImage.
- For the Dockerfile or Lock features, we explicitly create
  DockerImage objects.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

Now that skopeo has become the de-facto tool to download container
images, we will switch our instructions to install skopeo and use
it in our build and development environments.

At this time, skopeo is available on Ubuntu 20.04 through the
Kubic project, hence there may need to be some additional changes
in the future as the upstream package creators improve.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

The html formatting of lists allows for an expansion UI under
the title of the layer object. The title is the layer tarball's
digest. In the Docker image layout, this can be directly taken
from the "tar_file" key. But in the OCI layout, this value is taken
from the "digest" key. This commit allows for that difference and
defaults to the layer's index if neither one of those keys exist.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

This change adds a new class - OCIImage which represents the
OCI image schema version 2 available on disk when downloaded by
skopeo. This change also adds a skopeo.py script to the list
of loading methods. Skopeo.py adds functions to check if the skopeo
binary exists on disk and a function to pull images using skopeo.
A test suite for the OCIImage class is also added. This test suit
is similar to the test_class_docker_image.py test suit.

Additionally, we have changed the DockerImage class and associated tests
to set the layer's image_layout property to "docker".

We've switched the test for changes to the oci class to use a container
image and the changes to the docker class to use the dockerfile, as
we need the docker daemon to test Dockerfile builds.

Signed-off-by: Mukul Taneja <mtaneja@vmware.com>
Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

This commit replaces all instances of the old rootfs get_untar_dir
function with the ImageLayer method get_untar_dir. We also remove
an unused function in passthrough.py that uses layer tar files.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

This commit makes changes to hook up the CLI option for container
image to use the OCIImage class rather than the DockerImage class.
The argument for the -i option is now --image.

- If the raw_image option is given, we assume that this is the
  output of "docker save" rather than some other tool.
- In the load_full_image function, we provide an "image_type"
  string which should either be "oci" or "docker", "oci" being the
  default. Based on this string, we either return an instance of
  OCIImage or DockerImage.
- For the Dockerfile or Lock features, we explicitly create
  DockerImage objects.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

Now that skopeo has become the de-facto tool to download container
images, we will switch our instructions to install skopeo and use
it in our build and development environments.

At this time, skopeo is available on Ubuntu 20.04 through the
Kubic project, hence there may need to be some additional changes
in the future as the upstream package creators improve.

Signed-off-by: Nisha K <nishak@vmware.com>
nishakm pushed a commit to nishakm/tern that referenced this issue Dec 15, 2021
This is work towards tern-tools#948

The html formatting of lists allows for an expansion UI under
the title of the layer object. The title is the layer tarball's
digest. In the Docker image layout, this can be directly taken
from the "tar_file" key. But in the OCI layout, this value is taken
from the "digest" key. This commit allows for that difference and
defaults to the layer's index if neither one of those keys exist.

Signed-off-by: Nisha K <nishak@vmware.com>
rnjudge added a commit that referenced this issue Dec 15, 2021
This commit enables the use of Skopeo to pull container images.
Skopeo uses the OCI schema version 2 to fetch container images.
It lays out the images on disk differently than Docker after a pull.
In order to enable analysis of containers pulled in this way, this PR
introduces the OCIImage class which reflects the expected layout.

In order to deal with the different expected directory structures, we
move a commonly used function in rootfs.py, get_untar_dir(), to the
ImageLayer class, and creates a new property called 'image_layout'.
In this way, container image layouts on disk can be dealt with based
on derived Image classes.

We also introduce the OCIImage class and changes to the
DockerImage class which makes use of the new ImageLayer property
and method. We replace all instances of get_untar_dir() with the
ImageLayer instance's get_untar_dir() method. We connect all
the pieces from the command line option to the image extraction method.
Finally, we add Skopeo to the list of requirements for Tern in the
documentation, Dockerfiles and the development environments.

Lastly, we deal with the different image dictionary layouts
based on the image layout in the html report specifically.

Fixes #948

Signed-off-by: Nisha K <nishak@vmware.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
proposal Propose a change to the project refactor Requires significant code refactoring super Super issue - other issues are linked to this one tools work on tools to understand and contribute to the project
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants