Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

create definitive source of various datasets #1

Open
jennybc opened this issue Oct 9, 2014 · 2 comments
Open

create definitive source of various datasets #1

jennybc opened this issue Oct 9, 2014 · 2 comments
Assignees

Comments

@jennybc
Copy link
Owner

jennybc commented Oct 9, 2014

As I go through the figures, it's becoming clear that we need a single definitive source for several datasets.

Example: the museum visit length data from Fig 4.10 + others.

It needs to exist in one single location, probably in short and tall form.

This underscores how nice it would be to create a companion nbr or cmeg data package, even if it's data we have simulated that behaves like the original data we see in these figures.

Then, instead of copying the data around to multiple figure directories, we could just load it with a call to library(cmeg).

Side bonus: we could set factor levels sensibly.

@jzhaoo
Copy link
Collaborator

jzhaoo commented Oct 10, 2014

So would all datasets be put into a cmeg data package or only the ones used more than once?

@jennybc
Copy link
Owner Author

jennybc commented Oct 10, 2014

I suggest you extend your existing "data availability" table and report to cover this. Right now we know which figures rely on which datasets.

Yes, how many figures use the data is important. So is the amount of reshaping, factor fussing, etc.

The default setting should be "yes, include in the data package".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants