title
Overview

ClearML Agent is a virtual environment and execution manager for DL / ML solutions on GPU machines. It integrates with the ClearML Python Package and ClearML Server to provide a full AI cluster solution.
Its main focus is around:

Reproducing experiments, including their complete environments.
Scaling workflows on multiple target machines.

ClearML Agent executes an experiment or other workflow by reproducing the state of the code from the original machine to a remote machine.

The preceding diagram demonstrates a typical flow where an agent executes a task:

Enqueue a task for execution on the queue.
The agent pulls the task from the queue.
The agent launches a docker container in which to run the task's code.
The task's execution environment is set up:
1. Execute any custom setup script configured.
2. Install any required system packages.
3. Clone the code from a git repository.
4. Apply any uncommitted changes recorded.
5. Set up the python environment and required packages.
The task's script/code is executed.

:::note Python Version ClearML Agent uses the Python version available in the environment or docker in which it executes the code. It does not install Python, so make sure to use a docker or environment with the version you need. :::

While the agent is running, it continuously reports system metrics to the ClearML Server (these can be monitored in the Orchestration page).

Continue using ClearML Agent once it is running on a target machine. Reproduce experiments and execute automated workflows in one (or both) of the following ways:

Programmatically (using Task.enqueue() or Task.execute_remotely())
Through the ClearML Web UI (without working directly with code), by cloning experiments and enqueuing them to the queue that a ClearML Agent is servicing.

The agent facilitates overriding task execution detail values through the UI without code modification. Modifying a task clone’s configuration will have the ClearML agent executing it override the original values:

Modified package requirements will have the experiment script run with updated packages
Modified recorded command line arguments will have the ClearML agent inject the new values in their stead
Code-level configuration instrumented with Task.connect() will be overridden by modified hyperparameters

ClearML Agent can be deployed in various setups to suit different workflows and infrastructure needs:

Bare Metal
Kubernetes
Slurm
Google Colab

References

For more information, see the following:

ClearML Agent CLI for a reference for clearml-agent's CLI commands.
ClearML Agent Environment Variables for a list of environment variables to configure ClearML Agent
Agent Section for a list of options to configure the ClearML Agent in the clearml.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clearml_agent.md

clearml_agent.md

References

Files

clearml_agent.md

Latest commit

History

clearml_agent.md

File metadata and controls

References