Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve sample caller #1148

Open
flooie opened this issue Aug 30, 2024 · 2 comments
Open

Improve sample caller #1148

flooie opened this issue Aug 30, 2024 · 2 comments
Assignees

Comments

@flooie
Copy link
Contributor

flooie commented Aug 30, 2024

We should update the sample caller to include the ability to use the extract from text functions that are rapidly becoming more common in our scrapers

grossir added a commit to grossir/juriscraper that referenced this issue Sep 4, 2024
Solves freelawproject#1148

Extracts document's content mimicking Courtlistener's workflow, for easier and more complete testing of Juriscraper's scrapers

- Adds optional arguments: `--extract-content` and `--doctor-host`
- Executes Site.extract_from_text and prints extracted metadata
- Saves to /tmp/ the extracted content for visual debugging
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

No branches or pull requests

3 participants
@flooie @grossir and others