Debates around the legitimacy of the Supreme Court and the uproar over Roe v. Wade towards the later half of 2022 influenced us greatly as we brainstormed ideas for our introductory machine learning project. As we pored through our options and studied the uses of AI, we arrived upon this project: can the predictive power of machine learning be used to predict the outcomes of Supreme Court cases?
Image from The Supreme Court Database Homepage
After some searching, we selected Washington University's Modern Dataset of past Supreme Court Cases for its breadth of data values and relevance to our current times.
Of the available dataset variations, we chose Case-Centered Data, which focused on the outcomes of specific cases.
The first challenge we encountered was figuring out our dataset. Because the outcome of a SCOTUS case is determined by so many different variables, there didn't appear to be many correlations between them at first.
We also had to do quite a bit of processing on the values in the dataset, since there were many null and non-Integer values.
Because none of us had have a strong foundation in law, much of the legal terminology also required research on our ends, so we could understand just what the dataset conveyed. After we overcame the first hurdle, we began building our models. We encountered the most errors when optimizing our model through K-Fold Cross Validation, which led to us doing many tasks manually. Lastly, as this was a first brush with AI and ACM Projects for many of us, we definitely had to work hard to manage our time and polish our skills!
Image from Deepnote's Homepage
All of our work was done on Deepnote, an virtual platform with processing power that allows for real-time collaboration. We created several notebooks in this environment: the first hosts our initial data loading, processing, and feature selection attempts (including our heatmap). The deep-learning notebook includes our neural network, and the pipeline notebook includes our exploratory data analysis, the sklearn pipeline we built for data processing, and our supervised learning models as well as their results.
View it and see the results here:
Check out our full process description and list of models right here!
Ada He | LinkedIn | Github
Nitya Pillar | LinkedIn | Github
Jenny Mar | LinkedIn | Github
Han Hoang | LinkedIn ! Github
Serena Cheng | LinkedIn | Github
Rushil Chandrupatla | LinkedIn | Github