Skip to content

Gremlin is an evolutionary algorithm that discovers weaknesses in machine learning models.

License

Notifications You must be signed in to change notification settings

leap-ec/gremlin

Repository files navigation

Gremlin, an adversarial evolutionary algorithm that discovers biases or weaknesses in machine learners

Gremlin learns where a given machine learner (ML) model performs poorly via an adversarial evolutionary algorithm (EA). The EA will find the worst performing feature sets such that a practitioner can then, say, tune the training data to include more examples of those feature sets. Then the ML model can be trained again with the updated training set in the hopes that the additional examples will be sufficient for the ML to train models that perform better for those sets.

2022 R&D 100 Award Winner Gremlin is a 2022 R&D 100 Award Winner!

Requires

Installation

  1. Activate your conda or virtual environment
  2. cd into top-level gremlin directory
  3. pip install .

Configuration

Gremlin is a thin convenience wrapper around [LEAP] (https://github.com/AureumChaos/LEAP). Instead of writing a script in LEAP, one would instead point the gremlin executable at a YAML file that describes what LEAP classes, subclasses, and functions to use, as well as other salient run-time characteristics. gremlin will parse the YAML file and generate a CSV file containing the individuals from the run. This CSV file should contain information that can be exploited to tune training data.

Example Gremlin configuration YAML:
pop_size: 25
algorithm: async # or bgen
async: # parameters for asynchronous steady-state EA
  init_pop_size: ${pop_size}
  max_births: 2000
  ind_file: inds.csv # optional file for writing individuals as they are evaluated
  ind_file_probe: probe.log_ind # optional functor or function for writing ind_file

pop_file: pop.csv # where we will write out each generation in CSV format
problem: problem.QLearnerBalanceProblem("${oc.env:GREMLIN_QLEARNER_CARTPOLE_MODEL_FPATH}")
representation: representation.BalanceRepresentation()
preamble: |
  import probe # need to import our probe.py so that LEAP sees our probe pipeline operator
pipeline: # isotropic means we mutate all genes with the given stds
  - ops.random_selection
  - ops.clone
  - mutate_gaussian(expected_num_mutations='isotropic', std=[0.1, 0.001, 0.01, 0.001], hard_bounds=representation.BalanceRepresentation.genome_bounds)
  - ops.pool(size=1)
Essentially, you will have to define the following
  • A LEAP Representation subclass, as denoted in the above config file
    • This, in turn, will mean defining a LEAP Decoder and optionally a bespoke LEAP Individual subclass.
      • The latter is mostly to override a __str__() member function for pretty printing.
    • We also suggest using a python named tuple to define the genes in the phenotype as a convenience. (See examples/MNIST/representation.py)
  • A LEAP Problem subclass.
    • This is the meat of defining your problem for Gremlin to tackle.
    • This subclass may be responsible for loading your models and then applying them as denoted by genes values per individual.
    • E.g., the examples/MNIST/representation.py dictates that each individual has a single gene that represents a given digit, and the corresponding problem.py defines an evaluate() function that decodes a given individuals digit as denoted in its gene, and then checks to see how well the MNIST model predicts for that digit across a test dataset.
    • Similarly you will need to define a LEAP Problem subclass that is responsible for loading datasets and models and then predicting how effective they are for a given feature represented by a single individual.

Examples

Example code and configuration for a real problem can be found in examples/MNIST. This problem involves Gremlin discovering that one of the digits for the MNIST training data is poorly represented.

This can be run simply by (must be in examples/MNIST directory):

$ gremlin config.yml

Documentation

The wiki has more detailed documentation, particularly on how the YAML config files can be set up for Gremlin runs.

Versions

Note that more detailed explanations for version changes can be found in the CHANGELOG.

  • v0.6, in progress on develop
  • v0.5, 2/3/23
    • Main installed executable now gremlin and not gremlin.py. Added optional async.with_client config section. Improvements made to setup. py.
  • v0.4, 9/30/22
    • Added config variable async.with_client that allows for interacting with Dask before the EA runs; e.g., client.wait_for_workers() or client.upload_file()
    • Replaced imports with preamble in YAML config files thus giving more flexibility for importing dependencies, but also allows for defining functions and variables that may be referred to in, say, the pipeline.
  • v0.3, 3/9/22
    • Add support for config variable algorithm that denotes if using a traditional by-generation EA or an asynchronous steady-state EA
  • v0.2dev, 2/17/22
    • revamped config system and heavily refactored/simplified code
  • v0.1dev, 10/14/21
    • initial raw release

Sub-directories

  • gremlin/ -- main gremlin code
  • examples/ -- examples for using gremlin; currently only has MNIST example

Main web site

The gremlin github repository is https://github.com/markcoletti/gremlin.
main is the release branch and active work occurs on the develop branch.

About

Gremlin is an evolutionary algorithm that discovers weaknesses in machine learning models.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages