Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add independent expenditure scraper #30

Merged
merged 1 commit into from
Sep 23, 2024
Merged

Conversation

antidipyramid
Copy link
Contributor

Overview

See title.

Closes #27

Testing instructions

Run python -m scrapers.financial_disclosure.scrape_independent_expenditures



class IndependentExpenditureScraper(scrapelib.Scraper):
election_years = ("2021", "2022", "2023", "2024")
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I used the same election years as in scrape_offices.py.

@antidipyramid antidipyramid marked this pull request as ready for review September 12, 2024 18:54
Copy link
Member

@hancush hancush left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks legit to me, @antidipyramid! How many expenditure records does it create, and how does that compare to the number of expenditures in the portal? If it's close, can you go ahead and send the data file to Marjorie and ask her for feedback on the format, and ask whether she notices any records or important data points missing?

@antidipyramid
Copy link
Contributor Author

@hancush We didn't get any comments/questions-- seems like Marjorie is OK with the format.

@hancush
Copy link
Member

hancush commented Sep 23, 2024

@antidipyramid I'm good for this to come in. You can address any data issues Marjorie reports, separately.

@antidipyramid antidipyramid merged commit 089c730 into main Sep 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Scrape independent expenditure data
2 participants