Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SemanticScholar search optimization #94

Open
r-wrobel opened this issue Apr 29, 2024 · 2 comments
Open

SemanticScholar search optimization #94

r-wrobel opened this issue Apr 29, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@r-wrobel
Copy link

r-wrobel commented Apr 29, 2024

Hi there,
if someone perform a search_semanticscholar(query) litstudy executes a request for each search result.
Therefore I suggest using
https://api.semanticscholar.org/graph/v1/paper/search?query=searchString&limit=100&fields=title,authors,year,venue,abstract,citations,references,externalIds to request the complete data for the DocumentSets.
The API Limit is limit+offset<=1,000 results. A disadvantage is the missing numCitedBy field. With len(citations) <= 1,000 the amount derived from this will be limited to 1,000.

If someone needs more results, they can't use the current implementation anyway. The /paper/search/bulk can return 10,000,000 total results with up to 1,000 per request. This bulk search does not contain any information on citation and reference and therefore requires the request_paper() step as before.

@isazi
Copy link
Member

isazi commented May 2, 2024

Interesting, would you like to submit a pull request for this?

@isazi isazi added the enhancement New feature or request label May 2, 2024
@r-wrobel
Copy link
Author

r-wrobel commented May 3, 2024

Sure, but I was only looking for quick solutions to my problems. In addition, I don't yet know all the connections within litstudy. The code quality will be accordingly.
Feel free to cherry-pick the good features from #96

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants