Distant Reading the Law

Law as Data pp. 3–19
DOI: 10.37911/9781947864085.01

  1. Distant Reading the Law 

    Authors: Michael A. Livermore, University of Virginia; and Daniel N. Rockmore, Dartmouth College

 

Excerpt

The law is a textual enterprise—it is through the written word that legislatures, executive bodies, and judges make the law. The study of the law requires interacting with that text by reading statutes, regulations, judicial opinions, or other legal sources. For hundreds of years, the process of reading the law began by picking up a book. With the digitization of massive corpora of legal documents, advances in computer hardware, machine learning, and natural language processing techniques have made legal documents a form of “big data.” As a consequence, a new form of “reading”—grounded in quantitative analysis and mathematics—has taken hold. This new form of reading is poised to change how the law is studied and understood by providing fresh perspectives on old questions and spurring entirely new research agendas.

Distant Reading the Law

It is now common for cultural artifacts—be they texts, images, objects, or even cities—to be represented in digital form (Jones 2013). Some artifacts are generated natively as digital, such as online literary magazines, newspapers, or videos. Others are transformed from the physical to the digital, as occurs when a paper text is scanned, and still others exist as representations of an underlying, perhaps lost, physical artifact in the world, such as a virtual reality reconstruction of portions of Ancient Rome. Regardless, when realized in digital form, cultural objects become a form of data that can be subject to analysis through the tools and techniques of computer science, statistics, and mathematics.1 This change in form allows for quantitative analysis of phenomena that have, to date, largely been the exclusive province of qualitative methods. The analysis of large bodies of text is one such realm.

Document organization and retrieval is perhaps among the most familiar of the success stories of such work (Salton, Wong, and Yang 1975; Berry 2004; Berry and Castellanos 2008). Less familiar has been the use of related ideas to understand broad trends in textual corpora where large-scale temporal patterns in a collection of documents are of interest. Literary scholar Franco Moretti (2013) invented the term “distant reading” to characterize the growing use of quantitative tools to study literary texts.

Bibliography

Abraham, K. S., and G. E. White. 2013. “Prosser and His Influence.” Journal of Tort Law 6 (1): 27–74.

Barzun, C. L. 2015. “Inside-Out: Beyond the Internal/External Distinction in Legal Scholarship.” Virginia Law Review 101 (5): 1203–1288.

Baude, W., A. S. Chilton, and A. Malani. 2017. “Making Doctrinal Work More Rigorous: Lessons from Systematic Reviews.” Chicago Law Review 84 (1): 37–58.

Berry, M. W., ed. 2004. Survey of Text Mining. New York, NY: Springer.

Berry, M. W., and M. Castellanos, eds. 2008. Survey of Text Mining II. New York, NY: Springer.

Black, R. C., R. J. Owens, J. Wedeking, and P. C. Wohlfarth. 2016. US Supreme Court Opinions and Their Audiences. Cambridge, UK: Cambridge University Press.

Black, Ryan C., and James F. Spriggs II. 2008. “An Empirical Analysis of the Length of US Supreme Court Opinions.” Houston Law Review 45 (3): 622–682.

Blei, D. M. 2012. “Probabilistic Topic Models.” Communications of the ACM 55 (4): 77–84.

Blei, D. M., and J. Lafferty. 2007. “A Correlated Topic Model of Science.” Annals of Applied Statistics 1 (1): 17–35.

Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. “Latent Dirichlet Allocation.” Journal of Machine Learning Research 3:993–1022.

Calabresi, G., and A. D. Melamed. 1972. “Property Rules, Liability Rules, and Inalienability: One View of the Cathedral.” Harvard Law Review 85 (6): 1089–1128.

Chalfin, A., A. M. Haviland, and S. Raphael. 2013. “What do Panel Studies Tell Us About a Deterrent Effect of Capital Punishment? A Critique of the Literature.” Journal of Quantitative Criminology 29 (1): 5–43.

Corley, P. C. 2008. “The Supreme Court and Opinion Content: The Influence of Parties’ Briefs.” Political Research Quarterly 61 (3): 468–78.

Evans, M., W. McIntosh, J. Lin, and C. Cates. 2007. “Recounting the Courts? Applying Automated Content Analysis to Enhance Empirical Legal Research.” Journal of Empirical Legal Studies 4 (4): 1007–1039.

Hall, M. A., and R. F. Wright. 2008. “Systematic Content Analysis of Judicial Opinions.” California Law Review 96 (1): 63–122.

Holmes, O. W. 1897. “The Path of the Law.” Harvard Law Review 10 (8): 457–478.

Jones, S. E. 2013. The Emergence of the Digital Humanities. New York, NY: Routledge.

Klingenstein, S., T. Hitchcock, and S. DeDeo. 2014. “The Civilizing Process in London’s Old Bailey.” Proceedings of the National Academy of Sciences 111 (26): 9419–9424.

Law, D. S. 2016. “Constitutional Archetypes.” Texas Law Review 95 (2): 153–243.

Lee, T. R., and S. C. Mouritsen. 2017. “Judging Ordinary Meaning.” Yale Law Journal 127 (4): 788–879.

Macey, J., and J. Mitts. 2014. “Finding Order in the Morass: The Three Real Justifications for Piercing the Corporate Veil.” Cornell Law Review 100 (1): 99–156.

Moretti, F. 2013. Distant Reading. London, UK: Verso.

Mouritsen, S. C. 2010. “The Dictionary Is Not a Fortress: Definitional Fallacies and a Corpus-Based Approach to Plain Meaning.” BYU Law Review 2010 (5): 1915–1980.

Oldfather, C. M., J. P. Bockhorst, and B. P. Dimmer. 2012. “Triangulating Judicial Responsiveness: Automated Content Analysis, Judicial Opinions, and the Methodology of Legal Scholarship.” Florida Law Review 64 (5): 1189–1242.

Posner, R. A. 1996. “Pragmatic Adjudication.” Cardozo Law Review 18 (1): 1–20.

———. 2002. “Legal Scholarship Today.” Harvard Law Review 115 (5): 1314–1326.

Ramji-Nogales, J., A. I. Schoenholtz, and P. G. Schrag. 2007. “Refugee Roulette: Disparities in Asylum Adjudication.” Stanford Law Review 60 (2): 295–411.

Rauterberg, G., and E. Talley. 2017. “Contracting Out of the Fiduciary Duty of Loyalty: An Empirical Analysis of Corporate Opportunity Waivers.” Columbia Law Review 117 (5): 1075–1152.

Rice, D. 2014. “The Impact of Supreme Court Activity on the Judicial Agenda.” Law & Society Review 48:63–90.

Salton, G., A. Wong, and C.-S. Yang. 1975. “A Vector Space Model for Automatic Indexing.” Communications of the Association for Computing Machinery 18 (11): 613–620.

Shammas, C., M. Salmon, and M. Dahlin. 1987. Inheritance in America from Colonial Times to the Present. New Brunswick, NJ: Rutgers University Press.

Smith, J. L. 2014. “Law, Fact, and the Threat of Reversal from Above.” American Politics Research 42 (2): 226–256.

Spaeth, H. J., L. Epstein, A. D. Martin, J. A. Segal, T. J. Ruger, and S. C. Benesh. 2016. Supreme Court Database, Version 2016 Release 01.

Stiglitz, E. H. 2014. “Unaccountable Midnight Rulemaking? A Normatively Informative Assessment.” New York University Journal of Legislation & Public Policy 17 (1): 137–192.

BACK TO Law as Data