Skip to content

The Deidentify app uses MedCAT to automatically redact or replace sensitive information in documents. Customisable rules and batch processing make it easy to handle large datasets.

License

Notifications You must be signed in to change notification settings

antsh3k/deidentify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AnonCAT logo

Deidentify app

The Deidentify app is a tool for redacting and replacing sensitive information in a document. It uses MedCAT, an advanced natural language processing tool, to identify and classify sensitive information, such as names, addresses, and medical terms.

Example

AnonCAT Screenshot

Features

  • Redact sensitive information: The Deidentify app can automatically redact sensitive information from a document, replacing it with a placeholder value, such as "[REDACTED]".
  • Replace sensitive information: Alternatively, the app can replace sensitive information with a different value, such as a random name or address, to maintain the structure and context of the original document.
  • Add customizable rules: The app allows users to create custom rules for identifying and classifying sensitive information based on their specific needs and use cases.
  • Batch processing: The app can process multiple documents at once, making it easy to redact or replace sensitive information in large datasets.

DeID Model

For out of the box models please contact: contact@cogstack.org

  • Step 1: Upload your model to the models dir.

  • Step 2: In the file .env change the MODEL_NAME parameter to the file name of your model. Save the changes you made to the .env file.

MODEL_NAME = '<NAME OF MODEL HERE.zip>'

Build your own model

To build your own models please follow the tutorials outlined in MedCATtutorials

Note: This is currently under development

Starting the demo service

Start the Docker services by using docker-compose. This will build the necessary Docker images and start the services.

docker-compose up

Once the services are up and running, you can access the Deidentify app in your web browser by navigating to http://localhost:8002.

Setup from source

  • Install dependencies:
pip install -r requirements.txt
  • Set up the database:
python manage.py migrate

Frontend Dependencies

This project uses frontend dependencies managed by npm. To install them, follow these steps:

  1. Make sure you have Node.js and npm installed on your system.

  2. Navigate to the project's app directory.

  3. Run the following command to install the frontend dependencies:

    npm install
    npm run build

Run the app:

python manage.py runserver 8002

About

The Deidentify app uses MedCAT to automatically redact or replace sensitive information in documents. Customisable rules and batch processing make it easy to handle large datasets.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published