Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a utility that extracts the licenses based on a code repo URL #10

Closed
npch opened this issue Apr 1, 2021 · 4 comments · Fixed by #24
Closed

Create a utility that extracts the licenses based on a code repo URL #10

npch opened this issue Apr 1, 2021 · 4 comments · Fixed by #24
Assignees

Comments

@npch
Copy link
Member

npch commented Apr 1, 2021

Should be able to get license just from the GitHub API

@ha0ye ha0ye self-assigned this Apr 1, 2021
@ha0ye
Copy link
Collaborator

ha0ye commented Apr 1, 2021

I can handle this, from past experience interfacing with the github api in R.

Now in branch "license-from-github-url"

Specific implementation:

  1. read in data/output/CORD19_sampled_with_repos.csv
  2. look up github license (bonus: other API info, e.g. contributors list, references)
  3. write out data/output/CORD19_sampled_with_repos_with_github-metadatada.csv

@ha0ye
Copy link
Collaborator

ha0ye commented Apr 1, 2021

Basic functionality is set now, and the output data is updated.

There are some minor squiggles to resolve, e.g. in one case, the organization github was linked, and not a specific repo.

@sdruskat
Copy link
Collaborator

sdruskat commented Apr 7, 2021

Great, I think in order to close this issue, we should just merge the license info back into a version of the dataset file, e.g., based on CORD19_software_popularity_sampled_QA_DOI.csv?

@ha0ye
Copy link
Collaborator

ha0ye commented Apr 8, 2021

@sdruskat I'll make a new PR to run the new file through my script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants