Gen3 Data Model

Repo to keep information about the Gen3 data model design.

Installation

Use poetry to install dependencies:

poetry install

Jupyter + Graphviz

It's helpful to examine the relationships between nodes visually. One way to do this is to run an Jupyter notebook with a Python2 kernal. When used with Graphviz's SVG support, you can view a graphical representation of a subgraph directly in a REPL. To do so, install the pyproject.toml dependencies. There is an example Jupyter notebook at examples/jupyter_example.ipynb (replicated in examples/jupyter_example.py for clarity)

pipenv install --dev
PG_USER=* PG_HOST=* PG_DATABASE=* PG_PASSWORD=*   jupyter notebook examples/jupyter_example.ipynb

Documentation

Visual representation

For instructions on how to build the Graphviz representation of the datamodel, see the docs readme.

Dependencies

Before continuing you must have the following programs installed:

Python 3.9

The gen3datamodel library requires the following pip dependencies

Project Dependencies

Project dependencies are managed using Poetry

Example validation usage

from gen3datamodel import node_avsc_object
from gen3datamodel.mappings import get_participant_es_mapping, get_file_es_mapping
from avro.io import validate
import json


with open('examples/nodes/aliquot_valid.json', 'r') as f:
    node = json.loads(f.read())
print validate(node_avsc_object, node)  # if valid, prints True


print(get_participant_es_mapping())  # Prints participant elasticsearch mapping
print(get_file_es_mapping())         # Prints file elasticsearch mapping

Example Elasticsearch mapping usage

from gen3datamodel import mappings
print(mappings.get_file_es_mapping())
print(mappings.get_participant_es_mapping())

Tests

bash test/ci_commands_script.sh

Contributing

Read how to contribute here

Name		Name	Last commit message	Last commit date
Latest commit History 783 Commits
.github/workflows		.github/workflows
bin		bin
docs		docs
examples		examples
gen3datamodel		gen3datamodel
migrations		migrations
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gen3 Data Model

Installation

Jupyter + Graphviz

Documentation

Visual representation

Dependencies

Project Dependencies

Example validation usage

Example Elasticsearch mapping usage

Tests

Contributing

About

Releases 28

Packages

Languages

License

uc-cdis/gen3datamodel

Folders and files

Latest commit

History

Repository files navigation

Gen3 Data Model

Installation

Jupyter + Graphviz

Documentation

Visual representation

Dependencies

Project Dependencies

Example validation usage

Example Elasticsearch mapping usage

Tests

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases 28

Packages 0

Languages

Packages