Skip to content

Commit

Permalink
Revert "Master (#2)"
Browse files Browse the repository at this point in the history
This reverts commit 1d7495b.
  • Loading branch information
lucpeterson committed Aug 19, 2020
1 parent 1d7495b commit 103fb8b
Show file tree
Hide file tree
Showing 65 changed files with 191 additions and 684 deletions.
19 changes: 0 additions & 19 deletions .gitlab-ci.yml

This file was deleted.

41 changes: 0 additions & 41 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,47 +4,6 @@ All notable changes to Merlin will be documented in this file.
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## [1.7.3]

### Fixed
- Completed 1.7.2 fix for `merlin run-workers`.

## [1.7.2]

### Fixed
- Fatal bug triggered by a spec missing the `env` or `global.parameters` sections.

## [1.7.1]

### Added
- When using the `--samplesfile` flag, the samples file is now copied to `merlin_info` for
provenance.

### Fixed
- Exceptions in connection check sub-process will now be caught.

## [1.7.0]

### Added
- The ability to override any value of the celery configuration thru `app.yaml` in `celery.override`.
- Support and faq entry for `pgen` with `merlin run --pgen` and optional `--parg`.
- Documentation on `level_max_dirs`.
- Easier-to-read provenance specs.
- Documentation on the new 3 types of provenance spec.

### Fixed
- Flux test example data collection for new versions of flux.
- Fixed Docker ubuntu version.
- Removed expansion of env variables in shell sections (`cmd` and `restart`) of provenance
specs. This allows the shell command itself to expand environment variables, and gives
users greater flexibility.
- Allowed environment variables to be properly expanded in study `description.name`.
- Tilde (~) now properly expands as part of a path in non-shell sections.
- The rediss cert_reqs keyword was changed to ssl_cert_reqs.

### Changed
- Updated tutorial redis version to 6.0.5.

## [1.6.2]

### Added
Expand Down
2 changes: 1 addition & 1 deletion Dockerfile
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
FROM ubuntu:18.04
FROM ubuntu
LABEL maintainer="Joe Koning koning1@llnl.gov"

ARG USER=merlinu
Expand Down
2 changes: 1 addition & 1 deletion Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
#
# LLNL-CODE-797170
# All rights reserved.
# This file is part of Merlin, Version: 1.7.3.
# This file is part of Merlin, Version: 1.6.2.
#
# For details, see https://github.com/LLNL/merlin.
#
Expand Down
18 changes: 0 additions & 18 deletions docs/source/faq.rst
Original file line number Diff line number Diff line change
Expand Up @@ -377,24 +377,6 @@ num resource set: Number of resource sets
launch_distribution : The distribution of resources (default: plane:{procs/nodes})
lsf: Verbatim flags only for the lsf parallel launch (jsrun ... <lsf>)

What is level_max_dirs?
~~~~~~~~~~~~~~~~~~~~~~~
``level_max_dirs`` is an optional field that goes under the ``merlin.samples`` section
of a yaml spec. It caps the number of sample directories that can be generated
at a single level of a study's sample hierarchy. This is useful for getting around
filesystem constraints when working with massive amounts of data.

Defaults to 25.

What is pgen?
~~~~~~~~~~~~~
``pgen`` stands for "parameter generator". It's a way to override the parameters in the
``global.parameters`` spec section, instead generating them programatically with a python script.
Merlin offers the same pgen functionality as Maestro.

See `this guide <https://maestrowf.readthedocs.io/en/latest/parameters.html#parameter-generator-pgen>`_ for details on using ``pgen``.
It's a Maestro doc, but the exact same flags can be used in conjunction with ``merlin run``.

Where can I learn more about merlin?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Check out `our paper <https://arxiv.org/abs/1912.02892>`_ on arXiv.
1 change: 0 additions & 1 deletion docs/source/merlin_specification.rst
Original file line number Diff line number Diff line change
Expand Up @@ -290,4 +290,3 @@ see :doc:`./merlin_variables`.
generate:
cmd: |
python $(SPECROOT)/make_samples.py -dims 2 -n 10 -outfile=$(INPUT_PATH)/samples.npy "[(1.3, 1.3, 'linear'), (3.3, 3.3, 'linear')]"
level_max_dirs: 25
61 changes: 4 additions & 57 deletions docs/source/merlin_variables.rst
Original file line number Diff line number Diff line change
Expand Up @@ -21,10 +21,8 @@ The directory structure of merlin output looks like this:
OUTPUT_PATH
MERLIN_WORKSPACE
MERLIN_INFO
<name>.orig.yaml
<name>.partial.yaml
<name>.expanded.yaml
<step_name>.workspace
<provenance_spec.yaml>
<other_step_name>.workspace
WORKSPACE
Expand Down Expand Up @@ -54,7 +52,7 @@ Reserved variables
- The workspace directory for a single step.
- ``$(OUTPUT_PATH)/ensemble_name_$(MERLIN_TIMESTAMP)/step_name/``
* - ``$(MERLIN_INFO)``
- Directory within ``MERLIN_WORKSPACE`` that holds the provenance specs and sample generation results.
- Directory within ``MERLIN_WORKSPACE`` that holds a provenance spec.
Commonly used to hold ``samples.npy``.
- ``$(MERLIN_WORKSPACE)/merlin_info/``
* - ``$(MERLIN_SAMPLE_ID)``
Expand Down Expand Up @@ -93,68 +91,17 @@ Variables defined by a specification file in the ``env`` section, as in this exa
env:
variables:
ID: 42
EXAMPLE_VAR: hello
As long as they're defined in order, you can nest user variables like this:
.. code-block:: yaml
env:
variables:
EXAMPLE_VAR: hello
WORKER_NAME: $(EXAMPLE_VAR)_worker
Like all other Merlin variables, user variables may be used anywhere (as a yaml key or value) within a specification as below:
Like all other Merlin variables, these may be used within specification steps as below:

.. code-block:: yaml
cmd: echo "$(EXAMPLE_VAR), world!"
...
$(WORKER_NAME):
args: ...
If you want to programmatically define the study name, you can include variables
in the ``description.name`` field as long as it makes a valid filename:

.. code-block:: yaml
description:
name: my_$(EXAMPLE_VAR)_study_$(ID)
description: example of programmatic study name
The above would produce a study called ``my_hello_study_42``.
Environment variables
---------------------
Merlin expands Unix environment variables for you. The values of the user variables below would be expanded:
.. code-block:: yaml
env:
variables:
MY_HOME: ~/
MY_PATH: $PATH
USERNAME: ${USER}
However, Merlin leaves environment variables found in shell scripts (think ``cmd`` and ``restart``) alone.
So this step:

.. code-block:: yaml
- name: step1
description: an example
run:
cmd: echo $PATH ; echo $(MY_PATH)
...would be expanded as:

.. code-block:: yaml
- name: step1
description: an example
run:
cmd: echo $PATH ; echo /an/example/:/path/string/
Step return variables
-----------------------------------
Expand Down
2 changes: 1 addition & 1 deletion docs/source/modules/hello_world/hello_world.rst
Original file line number Diff line number Diff line change
Expand Up @@ -188,7 +188,7 @@ The whole file tree looks like this:

A lot of stuff, right? Here's what it means:

* The 3 yaml files inside ``merlin_info/`` are called the provenance specs. They are copies of the original spec that was run, some showing under-the-hood variable expansions.
* The yaml file inside ``merlin_info/`` is called the provenance spec. It's a copy of the original spec that was run.

* ``MERLIN_FINISHED`` files indicate that the step ran successfully.

Expand Down
6 changes: 3 additions & 3 deletions docs/source/modules/installation/installation.rst
Original file line number Diff line number Diff line change
Expand Up @@ -94,7 +94,7 @@ can also be used for the celery broker. This method will be called local-redis.
.. code-block:: bash
# Download redis
wget http://download.redis.io/releases/redis-6.0.5.tar.gz
wget http://download.redis.io/releases/redis-5.0.7.tar.gz
# Untar
tar xvf redis*.tar.gz
Expand Down Expand Up @@ -361,8 +361,8 @@ server and python.
mkdir -p ${HOME}/merlinu/cert_redis
cp result/* ${HOME}/merlinu/cert_redis
The configuration below does not use client
The ``redis:6.0-rc2`` docker service is exchanged for the previous
``redis:latest`` service. The configuration below does not use client
verification ``--tls-auth-clients no`` so the ssl files do not need to
be defined as shown in the :ref:`broker_redis_ssl` section.

Expand Down
4 changes: 2 additions & 2 deletions merlin/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
#
# LLNL-CODE-797170
# All rights reserved.
# This file is part of Merlin, Version: 1.7.3.
# This file is part of Merlin, Version: 1.6.2.
#
# For details, see https://github.com/LLNL/merlin.
#
Expand Down Expand Up @@ -38,7 +38,7 @@
import sys


__version__ = "1.7.3"
__version__ = "1.6.2"
VERSION = __version__
PATH_TO_PROJ = os.path.join(os.path.dirname(__file__), "")

Expand Down
2 changes: 1 addition & 1 deletion merlin/ascii_art.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
#
# LLNL-CODE-797170
# All rights reserved.
# This file is part of Merlin, Version: 1.7.3.
# This file is part of Merlin, Version: 1.6.2.
#
# For details, see https://github.com/LLNL/merlin.
#
Expand Down
76 changes: 41 additions & 35 deletions merlin/celery.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
#
# LLNL-CODE-797170
# All rights reserved.
# This file is part of Merlin, Version: 1.7.3.
# This file is part of Merlin, Version: 1.6.2.
#
# For details, see https://github.com/LLNL/merlin.
#
Expand Down Expand Up @@ -40,10 +40,10 @@
from celery.signals import worker_process_init

import merlin.common.security.encrypt_backend_traffic
from merlin.config import broker, celeryconfig, results_backend
from merlin.config import broker, results_backend
from merlin.config.configfile import CONFIG
from merlin.log_formatter import FORMATS
from merlin.router import route_for_task
from merlin.utils import nested_namespace_to_dicts


LOG = logging.getLogger(__name__)
Expand All @@ -54,59 +54,65 @@
results_ssl = False
try:
BROKER_URI = broker.get_connection_string()
LOG.debug(f"broker: {broker.get_connection_string(include_password=False)}")
LOG.info(f"broker: {broker.get_connection_string(include_password=False)}")
broker_ssl = broker.get_ssl_config()
LOG.debug(f"broker_ssl = {broker_ssl}")
LOG.info(f"broker_ssl = {broker_ssl}")
RESULTS_BACKEND_URI = results_backend.get_connection_string()
results_ssl = results_backend.get_ssl_config(celery_check=True)
LOG.debug(
LOG.info(
f"results: {results_backend.get_connection_string(include_password=False)}"
)
LOG.debug(f"results: redis_backed_use_ssl = {results_ssl}")
LOG.info(f"results: redis_backed_use_ssl = {results_ssl}")
except ValueError:
# These variables won't be set if running with '--local'.
BROKER_URI = None
RESULTS_BACKEND_URI = None

# initialize app with essential properties

app = Celery(
"merlin",
broker=BROKER_URI,
backend=RESULTS_BACKEND_URI,
broker_use_ssl=broker_ssl,
redis_backend_use_ssl=results_ssl,
task_routes=(route_for_task,),
)

# load merlin config defaults
app.conf.update(**celeryconfig.DICT)

# load config overrides from app.yaml
if (
(not hasattr(CONFIG.celery, "override"))
or (CONFIG.celery.override is None)
or (len(nested_namespace_to_dicts(CONFIG.celery.override)) == 0)
):
LOG.debug("Skipping celery config override; 'celery.override' field is empty.")
else:
override_dict = nested_namespace_to_dicts(CONFIG.celery.override)
override_str = ""
i = 0
for k, v in override_dict.items():
if k not in str(app.conf.__dict__):
raise ValueError(f"'{k}' is not a celery configuration.")
override_str += f"\t{k}:\t{v}"
if i != len(override_dict) - 1:
override_str += "\n"
i += 1
LOG.info(
f"Overriding default celery config with 'celery.override' in 'app.yaml':\n{override_str}"
)
app.conf.update(**override_dict)

# auto-discover tasks
app.conf.update(
task_serializer="pickle", accept_content=["pickle"], result_serializer="pickle"
)

app.autodiscover_tasks(["merlin.common"])

app.conf.update(
task_acks_late=True,
task_reject_on_worker_lost=True,
task_publish_retry_policy={
"interval_start": 10,
"interval_step": 10,
"interval_max": 60,
},
redis_max_connections=100000,
)

# Set a timeout to acknowledge a task before it's available to grab
# again (default 24 hours).
app.conf.broker_transport_options = {
"visibility_timeout_seconds": CONFIG.celery.visibility_timeout_seconds,
"max_connections": 100,
}

app.conf.update(broker_pool_limit=0)

# Task routing: call our default queue merlin
app.conf.task_routes = (route_for_task,)
app.conf.task_default_queue = "merlin"

# Log formatting
app.conf.worker_log_color = True
app.conf.worker_log_format = FORMATS["DEFAULT"]
app.conf.worker_task_log_format = FORMATS["WORKER"]


@worker_process_init.connect()
def setup(**kwargs):
Expand Down
2 changes: 1 addition & 1 deletion merlin/common/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
#
# LLNL-CODE-797170
# All rights reserved.
# This file is part of Merlin, Version: 1.7.3.
# This file is part of Merlin, Version: 1.6.2.
#
# For details, see https://github.com/LLNL/merlin.
#
Expand Down
2 changes: 1 addition & 1 deletion merlin/common/abstracts/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
#
# LLNL-CODE-797170
# All rights reserved.
# This file is part of Merlin, Version: 1.7.3.
# This file is part of Merlin, Version: 1.6.2.
#
# For details, see https://github.com/LLNL/merlin.
#
Expand Down
Loading

0 comments on commit 103fb8b

Please sign in to comment.