Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Things that are not getting parsed but could be if desirable #10

Open
LindsayYoung opened this issue Feb 3, 2014 · 2 comments
Open

Comments

@LindsayYoung
Copy link
Member

The following could, theoretically, be added to the Congressional Record parser:

  • Foreign travel expenditure reports
  • Congressional authority statements

Any feedback on whether these would be worth doing? Would there be a better source for this information? Are there any other additional documents in the Congressional Record that would be useful to parse?

@konklone
Copy link
Member

konklone commented Feb 5, 2014

Amendment text might be an excellent get - see the conversation at unitedstates/congress#52.

Constitutional authority statements also show up on Congress.gov, e.g. go to HR 5 and click the "Constitutional Authority" link. (Though they are clearly doing this by yanking it out of the CR, there's a CR page number on the popup.)

@nclarkjudd
Copy link
Contributor

Current version of the parser is pretty good at identifying speeches but can't pick up on much else. There's a probabilistic test in the test suite that fails and should continue to fail until all text in each record page is correctly parsed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants