Restructure code, Data Package 2.0, Documentation Site, etc. #84

dtemkin-volpe · 2024-07-17T22:08:14Z

Sorry for the big pull request! This request has a few changes, including:

Align with the Data Package 2.0 specification
- The main changes are adding the fieldsMatch entry to allow for end-users to have additional columns (eg. for relevant notes), clarify that every resource is a table, and add $schema (which replaces profile), and changes the enum constraints to use the new categories field (this makes the documentation easier to read, but changes the DB's schema from using VARCHAR to TEXT, so feedback on whether I should change this back would be appreciated)
- I'm hoping the frictionless-py package is updated soon to take advantage of the new features, but for right now, everything is (mostly) backwards compatible.
Reformat the scripts code to ensure that code is not run multiple times and to make more efficient (instead of just having disparate functions, put everything in a class and import that between files)
Add an empty string to the missingValues field for every table, mainly for CSV processors to recognize that an empty column is a null value.
Fix Specification Clarification for node_id and zone_id in node.csv #83 by adding an id_type entry in the config table, that must either be "string" or "integer". This would tell any software using GMNS how to interpret the id fields in other tables.
Fix errors in documentation files
Create a MkDocs site that uses the automatically generated documentation that will be hosted at the repository's GitHub Pages site
Add Dependabot version updates, for Python packages and GitHub Actions

Let me know if there are any issues with this or if anyone wants any changes.

…develop

Test changing the id type

ssmith55 · 2024-07-18T11:08:04Z

From what I can see, it looks good. As a test, I'd like to run the workflows. Something like:
Test #1: change the ID from text to integer. See what happens to the markdown and generated sql. What field type for ID appears in the markdown (ID to match the json, or would it be integer or test?)
Test #2: change something else. Make sure we have not broken what we have done before
Finally, in our Cambridge Intersection example, double check that the field_type in the config file is consistent with the ID field types in the actual data.

…develop

dtemkin-volpe · 2024-07-18T14:09:02Z

I think the idea was that people could define what types the id fields would be in their config.csv file, rather than in the json spec files (so the markdown documentation or generated SQL wouldn't change). Because the config.csv file isn't really required, the id fields are defined as an any type in the spec, and downstream software could then enforce whatever is set in the config file. I enabled the workflow on my end, let me try changing something and make sure everything works still.

ssmith55 · 2024-07-18T14:41:09Z

Yes, that makes sense. Thank you. Maybe we can create two versions of the template SQLite database, one with INT keys and the other with TEXT keys.

Update Spec Documentation

Fix free_speed description

Update Spec Documentation

ssmith55 · 2024-07-19T13:00:05Z

I did a little test, and all seems to be well. Diego, I'm ok with merging if you feel it is ready. Thanks.

dtemkin-volpe added 10 commits July 2, 2024 12:50

restructure to avoid running code multiple times

5e86d6a

Merge branch 'develop' of https://github.com/dtemkin-volpe/GMNS into …

a1a0b41

…develop

Merge branch 'zephyr-data-specs:develop' into develop

0b858af

could theoretically be used by others?

38e38b1

add example validation

c8dfa57

align with data package 2.0

e223047

add id_type field

1409643

only generate db whenever schema changes

ad6fdae

add mkdocs site

32b9e3a

add dependabot

ea00982

dtemkin-volpe requested review from ssmith55 and ianberg-volpe July 17, 2024 22:08

dtemkin-volpe self-assigned this Jul 17, 2024

dtemkin-volpe requested a review from a team as a code owner July 17, 2024 22:08

Update config.schema.json

c946cd8

Test changing the id type

dtemkin-volpe added 2 commits July 18, 2024 09:32

remove commented out code from mkdocs config

bd93cc4

Merge branch 'develop' of https://github.com/dtemkin-volpe/GMNS into …

79c5441

…develop

dtemkin-volpe added 4 commits July 18, 2024 10:11

set id_type spec to string type

e4b0196

making sure everything works...

dc267b7

fix...

398cce2

add more details to id_type description

a2d4c96

dtemkin-volpe and others added 6 commits July 18, 2024 11:00

lets try this one more time...

ac8ba70

Update documentation

fcdd109

Merge pull request #16 from dtemkin-volpe/autogenerated-docs

474b330

Update Spec Documentation

Update segment_tod.schema.json

33764f7

Fix free_speed description

Update documentation

760f2f7

Merge pull request #17 from dtemkin-volpe/autogenerated-docs

06b94ef

Update Spec Documentation

dtemkin-volpe merged commit f3bd462 into zephyr-data-specs:develop Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure code, Data Package 2.0, Documentation Site, etc. #84

Restructure code, Data Package 2.0, Documentation Site, etc. #84

dtemkin-volpe commented Jul 17, 2024

ssmith55 commented Jul 18, 2024

dtemkin-volpe commented Jul 18, 2024

ssmith55 commented Jul 18, 2024

ssmith55 commented Jul 19, 2024

Restructure code, Data Package 2.0, Documentation Site, etc. #84

Restructure code, Data Package 2.0, Documentation Site, etc. #84

Conversation

dtemkin-volpe commented Jul 17, 2024

ssmith55 commented Jul 18, 2024

dtemkin-volpe commented Jul 18, 2024

ssmith55 commented Jul 18, 2024

ssmith55 commented Jul 19, 2024