Automated Threat Models

Warning

The main functionality relating to automated threat models in continuous assurance will be deprecated here in favour of the Rust implementation run as an Azure function in the CA project

Threat modelling can become time consuming, and doing a thorough threat model for every single service is not scalable when taking into account security resources and the number of services in operation at DfE.

This repo attempts to produce a working example of how one could partly automate the threat modelling process for some of our lower risk services.

As part of secure by design principles we should be:

running threat models for every service
producing the results and reporting them

This example is based on a real service called service-security-posture-hardening (run as an internal tool by CISD).

The main tool used is called threagile, and it is run as a docker container.

Using GitHub Actions, Docker, and Splunk, we can continuously watch the yaml file that holds the data for your threat model, and when changes are made, they produce new reports that can be ingested into Splunk.

This method:

removes the need for a security resource and long sessions
ensures the data can be kept up to date when mitigations are implemented, allowing it to be tracked effectively
gives a basic understanding of potential threats a system might face by producing output based on information given in a yaml file
produces data flow diagrams that may be useful not just in threat modelling, but also architectural decisions and audit requirements

Getting started with an initial threat model YAML

Pre-requisites (local)

Before getting started, you will need to:

install docker
pull the container image from GitHub Packages by:
- pulling the image: docker pull ghcr.io/dfe-digital/automated-threat-models:main
data from Splunk regarding your Azure resources including: name, kind, type (s1XXX-functionX, functionapp, microsoft.web/sites) note that this is to be automated in the next iteration

Automated threat models (DfE)

This project's main purpose is to enable DfE to run continuous automated threat models, which data can be ingested by the continuous assurance platform.

The automated threat models will be kept in the continuous assurance private repo, which will query this project.

The automation is based on dfe-threagile being able to read data regarding Azure resources in Splunk.

Basic usage:

Run DfE automation

$ docker run --rm -it -v "$(pwd)":/app/work --entrypoint python dfe-digital/automated-threat-models dfe_threagile.py

Run with continuous assurance file for data asset retrieval

$ docker run --rm -it -v  "$(pwd)":/app/work --entrypoint python dfe-digital/automated-threat-models dfe_threagile.py --ssphp-yaml "path-to-yaml"

Run threagile against output

$ docker run --rm -it -v "$(pwd)":/app/work dfe-digital/automated-threat-models -verbose -model "path to model" -output /app/work

GitHub Actions

on:
  push:
    paths:
      - 'threagile.yaml' # useful to filter this job to execute only when the threat model changes

jobs:

  threagile_job:
    runs-on: ubuntu-latest
    name: Threat Model Analysis
    steps:
      
      # Checkout the repo
      - name: Checkout Workspace
        uses: actions/checkout@v4
     
      # Run Threagile
      - name: Run Threagile
        id: threagile
        uses: pritchyspritch/run-threagile-action@v2
        with:
          model-file: 'threagile.yaml' # for threagile only
          output-dir: 'put/files/here' # default: threagile/output - for threagile only
          optional-args: '-create-example-model' # optional args for whichever command you wish to run
          dfe_threagile: true/false # set whether to run the dfe python code for automated tm or to run threagile
     
      # Archive resulting files as artifacts
      - name: Archive Results
        uses: actions/upload-artifact@v4
        with:
          name: threagile-report
          path: threagile/output
     
      # Optional step to link from repo's README.md if you want. This can also be committed to a separate branch if desired.
      - name: Commit & Push Report and DFD Diagram
        run: |
          git config --local user.email "threagile@example.com" # customize as desired
          git config --local user.name "Threagile" # customize as desired
          git add threagile/output/report.pdf
          git add threagile/output/data-flow-diagram.png
          git commit -m "Update threat model report and data-flow diagram by Threagile" # customize as desired
          git push

Create a stub model and example model (manual)

It's advisable to create a stub and example model from threagile to give you a framework YAML file to work with and a thorough example with hints you can copy.

Create a stub

$ docker run --rm -it -v "$(pwd)":/app/work threagile/threagile -create-stub-model -output /app/work

Create an example

$ docker run --rm -it -v "$(pwd)":/app/work threagile/threagile -create-example-model -output /app/work

Use a schema or template in your IDE

$ docker run --rm -it -v "$(pwd)":/app/work threagile/threagile -create-editing-support -output /app/work

Using hints, comments and macros

When writing your YAML file, there will be fields you need to fill that must have a value from a limited list, or they should link to another object you've created (linked by the id field).

In order to assist you, there are a few things that can help you, such as:

comments on many of the lines in the example model
type values list for each field that you can check by running $ docker run --rm -it threagile/threagile -list-types
macros that you can run, that will take you through a wizard to produce a objects in your YAML based on your answers

The macros available are for:

----------------------
Built-in model macros:
----------------------
add-build-pipeline --> Add Build Pipeline
add-vault --> Add Vault
pretty-print --> Pretty Print
remove-unused-tags --> Remove Unused Tags
seed-risk-tracking --> Seed Risk Tracking
seed-tags --> Seed Tags

The "Add Build Pipeline" is especially useful, as it will allow you to add all your assets relating to GitHub, Azure Devops, Kubernetes, Docker Hub, packages etc.

To run a macro you will need to run:

$ docker run --rm -it -v "$(pwd)":/app/work threagile/threagile -model /app/work/threagile.yaml -output /app/work -execute-model-macro <NAME>

Filling in the YAML with your system data

Start by filling in the metadata at the top with information about the service and yourself. Ensure you add useful tags, and that you keep tags consistent throughout the doc, so if you put azure-function-app in your main list of tags, when creating a technical asset that's a function app, use the same tag.

Then work your way down the YAML. The YAML consists of:

initial metadata
data assets (e.g. actual data you hold: keys, school records, usernames)
technical assets (e.g. function app, github repo, CI/CD, K8s)
- technical assets are large objects, so ensure you refer to the comments in your example model
- there are communication links attached that are used to build the data flows
trust boundaries (e.g. network boundaries, different cloud hosts - github, azure, DfE network)
shared runtimes (e.g. k8s)
individual risk categories (this could be a risk that you think can't be linked to assets you've listed, but something you know you're concerned about)
risk tracking (this is where you track your risks and mitigations)

Make sure to refer to your example, as well as the model produced in this repository (which is a real world DfE app). This will be crucial in understanding how to build your own

Producing your initial threat model output

Once you've produced your YAMl model, you will need to run the command shown below to produce your outputs:

$ docker pull ghcr.io/dfe-digital/automated-threat-models:nightly
$ docker images ls # get the image ID
$ docker run --rm -it -v "$(pwd)":/app/work <image-id> -verbose -model /app/work/YOUR_YAML.yaml -output /app/work

TODO / Roadmap

example GitHub Action to watch file
use params for python
improve readme
automate more of the yaml building beyond what threagile can do
automate the Azure data ingestion from Splunk

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.github		.github
example-models		example-models
ide-templates		ide-templates
output		output
tests		tests
yaml-templates		yaml-templates
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
build_data_assets.py		build_data_assets.py
build_tech_assets.py		build_tech_assets.py
dfe_threagile.py		dfe_threagile.py
produce_risk_tracker.py		produce_risk_tracker.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Threat Models

Getting started with an initial threat model YAML

Pre-requisites (local)

Automated threat models (DfE)

Run DfE automation

Run with continuous assurance file for data asset retrieval

Run threagile against output

GitHub Actions

Create a stub model and example model (manual)

Create a stub

Create an example

Use a schema or template in your IDE

Using hints, comments and macros

Filling in the YAML with your system data

Producing your initial threat model output

TODO / Roadmap

About

Releases

Packages

Languages

DFE-Digital/automated-threat-models

Folders and files

Latest commit

History

Repository files navigation

Automated Threat Models

Getting started with an initial threat model YAML

Pre-requisites (local)

Automated threat models (DfE)

Run DfE automation

Run with continuous assurance file for data asset retrieval

Run threagile against output

GitHub Actions

Create a stub model and example model (manual)

Create a stub

Create an example

Use a schema or template in your IDE

Using hints, comments and macros

Filling in the YAML with your system data

Producing your initial threat model output

TODO / Roadmap

About

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages