Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Overview, Schema, and Dictionary Development #158

Open
eekevall opened this issue Sep 29, 2023 · 1 comment
Open

Data Overview, Schema, and Dictionary Development #158

eekevall opened this issue Sep 29, 2023 · 1 comment
Assignees

Comments

@eekevall
Copy link

Challenge Brief:

There is a need to create a comprehensive data overview, schema, and dictionary for the Greenstand project. This will streamline the data management process, ensure consistency across the project, and improve the overall quality of the data; ultimately leading to better outcomes for the project.

The goal of this ticket is to establish a clear understanding of the data Greenstand generates, its structure, relationships, and formats, as well as to prepare the data for use in various projects, including the Northeastern University partnership.

The scope of this task includes the following:

  • Developing a data overview that summarizes the key characteristics of our data, including the number of records, fields, data types, and any relevant metadata.
  • Creating a schema that defines the structure of our data, including the relationships between tables, fields, and data types.
  • Building a dictionary that defines the meaning and format of each field in our data, along with any validation rules or constraints.
  • Preparing the data for use in various projects by cleaning, transforming, and formatting it according to project requirements.

Deliverables:

  • A comprehensive data overview document that summarizes the key characteristics of our data.
  • A schema diagram that illustrates the structure of our data and relationships between tables, fields, and data types.
  • A dictionary that defines the meaning and format of each field in our data, along with any validation rules or constraints.
  • Data preparation scripts or tools that can be used to clean, transform, and format the data for use in various projects.
@Davidezrajay
Copy link
Contributor

Davidezrajay commented Oct 4, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants