Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get more links for selected nodes #41

Open
naught101 opened this issue Dec 31, 2019 · 2 comments
Open

Get more links for selected nodes #41

naught101 opened this issue Dec 31, 2019 · 2 comments

Comments

@naught101
Copy link

The current script gets the links from the first paragraph, but this is sometimes not particularly useful. For example, Dog only returns "Carl Linnaeus" (this might be a bug though, because the first paragraph of https://en.wikipedia.org/wiki/Dog has more links than that..).

It would be good to be able to (optionally) use more paragraphs to rip links from, so that nodes with weak first paragraphs can be expanded..

Also, I wonder if it wouldn't be better to use the first 3 paragraphs by default. I have a local copy that gets the first three, and it seems to capture a much more representative set of links..

@naught101
Copy link
Author

Even better, obviously, would be a way of ranking links by their importance, but I'm not sure that Wikipedia's data structure allows that kind of analysis..

@controversial
Copy link
Owner

controversial commented Dec 31, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants