Skip to content

Dask-Jobqueue wrapper for the CERN HTCondor cluster

License

Notifications You must be signed in to change notification settings

cernops/dask-lxplus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dask-lxplus

Builds on top of Dask-Jobqueue to enable jobs to run on the CERN HTCondor cluster via LXPLUS.

Summary

from distributed import Client 
from dask_lxplus import CernCluster
import socket

cluster = CernCluster(
    cores = 1,
    memory = '2000MB',
    disk = '10GB',
    death_timeout = '60',
    lcg = True,
    nanny = False,
    container_runtime = 'none',
    log_directory = '/eos/user/b/ben/condor/log',
    scheduler_options = {
        'port': 8786,
        'host': socket.gethostname(),
    },
    job_extra = {
        'MY.JobFlavour': '"longlunch"',
    },
)

CERN extras

There are a few changes in the wrapper to address some of the particular features of the CERN HTCondor cluster, but there are also a few changes to detail here.

Options

lcg: If set to True this will validate and use the LCG python environment per the managed LCG releases. It will send the environment of the submitting scheduler to the batch worker node. DASK normally requires that both the scheduler and the worker is the same python versions and libraries. At CERN this would mean that you should, assuming say the default of CentOS7 worker nodes, that the scheduler is run on something likelxplus.cern.chalso running CentOS7`. An example use would be to do the following before running dask:

$ . /cvmfs/sft.cern.ch/lcg/views/LCG_104/x86_64-el9-gcc11-opt/setup.sh

container_runtime: Can be set to "singularity" or docker or "none". If a runtime is needed for the worker, this selects which will be used for the HTCondor job the worker runs. In principle it should not be necessary when using lcg and should therefore be set to "none". Default though is "singularity".

worker_image: The image that will be used if container_runtime is defined to use one. The default is defined in jobqueue-cern.yaml.

batch_name: Optionally set a string that will identify the jobs in HTCondor. The default is "dask-worker"

About

Dask-Jobqueue wrapper for the CERN HTCondor cluster

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages