-
-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Don't set court='scotus' for South Carolina citations #84
Comments
You will never see SC in the year field of a Supreme Court decision (if the person who wrote it is citing things properly). Bluebook R. 10.4(b) State courts.
|
Thanks @devlux76. This looks like a great first bug. Any interest in trying to tackle it with a test and a fix? |
Sure! Do we know where this citation came from? Bowen v. Massachusetts, 487 U.S. 879, 108 S. Ct. 2722, 101 L. Ed. 2d 749 (1988) Bd. of Pub. Instruction of Palm Beach Cty., Fla. v. Cohen, 413 F.2d 1201 (5th Cir. 1969) Fort Sumter Tours, Inc. v. Andrus, 440 F. Supp. 914 (D.S.C.), aff'd, 564 F.2d 1119 (4th Cir. 1977)
But I don't see any of them with the above citation just doing a fulltext search. |
That'd be a question for @jcushman, but I suspect he wouldn't know anymore. At this point, it's worth just running with the example he gave. I'd make a test using it, make sure the test fails, then write the code to fix it. |
The one I encountered it in was the Bowen case. Not sure if it got my version from Courtlistener or Lexis, but in the Courtlistener one you link you'll see it if you search |
Lets review this for this sprint to see if this is still occurring and issue |
It seems that it doesn't just fail there, the plaintiff is incorrect, it returns If you try to parse something like: 'Foo 12334 v. Bar, 1 U.S. 1' the plaintiff only returns the number, the defendant is correct |
I think this happens because the current approach to get the plaintiff names is to get the two words before v., and the problem is that in those two words we count the spaces. For example, if we pass this string
The current algorithm looks for the stopword v. (It means there is a plaintiff), we use the stopword index from the word list to get the two previous elements in the list, in this case: `['Smith', ' '] which is correct. Here:
But it fails when the plaintiff has more than two words, for example: The algorithm will return this words list:
and the two words before v. are: I'm guessing this was set to two elements before v. because is common for plaintiffs to have short names. That's why I'm thinking about how we can adjust this. |
I'll move this to a new issue so we can close this one, now we can relate SC to South Carolina instead of scotus. A test case has been added. |
Eyecite thinks that South Carolina citations are SCOTUS citations:
The
SC
in the year could be ambiguous, but theF.Supp.
reporter should automatically rule SCOTUS out as a possibility for the court here.The text was updated successfully, but these errors were encountered: