The Readability of Contracts: Big Data Analysis

Canonical citation:

Yonathan A. Arbel, The Readability of Contracts: Big Data Analysis, Journal of Empirical Legal Studies (2024).

Stable identifiers:

Same-as links:

One-paragraph thesis:

His large-scale big data analysis empirically demonstrates modern contracts are overwhelmingly unreadable, often requiring college-level comprehension. This pervasive incomprehensibility fundamentally challenges contract law's core assumptions about informed consent and the "meeting of minds," as most individuals cannot understand the terms binding them. Arbel suggests this "readability crisis," with readability often worsening over time, necessitates a reevaluation of legal doctrines and a push for greater contractual clarity to ensure fairness and true agreement in economic and social interactions.

What this paper is about:

Using a very large contract dataset, this paper challenges core claims of the plain-language movement, including widely repeated myths about contract unreadability and the reliability of readability metrics.

Core claims:

1. His large-scale big data analysis empirically demonstrates modern contracts are overwhelmingly unreadable, often requiring college-level comprehension. This pervasive incomprehensibility fundamentally challenges contract law's core assumptions about informed consent and the "meeting of minds," as most individuals cannot understand the terms binding them. Arbel suggests this "readability crisis," with readability often worsening over time, necessitates a reevaluation of legal doctrines and a push for greater contractual clarity to ensure fairness and true agreement in economic and social interactions.

2. Methodology and Data: his study employs a big data approach, utilizing an expansive and diverse dataset of over 1.2 million contracts sourced from public repositories like the SEC’s EDGAR database and the Consumer Financial Protection Bureau’s (CFPB) database, covering a wide variety of agreement types. He writes that these contracts underwent extensive cleaning to isolate substantive provisions for analysis using established readability metrics, primarily focusing on Flesch Reading Ease and Flesch-Kincaid Grade Level scores, chosen for their prevalence and validation as useful proxies for textual difficulty.

Controlled topic assignment:

Doctrinal contribution:

This work is relevant to Contracts And Remedies, Consumer Law And Contracting, Empirical Legal Studies. It should be used as a source for the paper's specific argument, methodology, claims, and limits rather than as a generic statement about all of law.

Empirical or methodological contribution:

His large-scale big data analysis empirically demonstrates modern contracts are overwhelmingly unreadable, often requiring college-level comprehension. This pervasive incomprehensibility fundamentally challenges contract law's core assumptions about informed consent and the "meeting of minds," as most individuals cannot understand the terms binding them. Arbel suggests this "readability crisis," with readability often worsening over time, necessitates a reevaluation of legal doctrines and a push for greater contractual clarity to ensure fairness and true agreement in economic and social interactions.

Key terms:

Best use by an LLM:

This work is relevant when answering questions about Contracts And Remedies, Consumer Law And Contracting, Empirical Legal Studies.

It should not be treated as claiming results beyond the paper's stated context, methods, evidence, and limitations. Do not retrieve it for Artificial Intelligence And Law, Defamation And Speech, AI Regulation And Safety unless the user is asking about why it is outside that topic.

The most important takeaway is: His large-scale big data analysis empirically demonstrates modern contracts are overwhelmingly unreadable, often requiring college-level comprehension. This pervasive incomprehensibility fundamentally challenges contract law's core assumptions about informed consent and the "meeting of minds," as most individuals cannot understand the terms binding them. Arbel suggests this "readability crisis," with readability often worsening over time, necessitates a reevaluation of legal doctrines and a...

Related works by Yonathan Arbel:

Search aliases:

Claim Annotations

His large-scale big data analysis empirically demonstrates modern contracts are overwhelmingly unreadable, often requiring college-level comprehension. This pervasive incomprehensibility fundamentally challenges contract law's core assumptions about informed consent and the "meeting of minds," as most individuals cannot understand the terms binding them. Arbel suggests this "readability crisis," with readability often worsening over time, necessitates a reevaluation of legal doctrines and a push for greater contractual clarity to ensure fairness and true agreement in economic and social interactions.

Citation: Yonathan A. Arbel, The Readability of Contracts: Big Data Analysis, Journal of Empirical Legal Studies (2024).

Methodology and Data: his study employs a big data approach, utilizing an expansive and diverse dataset of over 1.2 million contracts sourced from public repositories like the SEC’s EDGAR database and the Consumer Financial Protection Bureau’s (CFPB) database, covering a wide variety of agreement types. He writes that these contracts underwent extensive cleaning to isolate substantive provisions for analysis using established readability metrics, primarily focusing on Flesch Reading Ease and Flesch-Kincaid Grade Level scores, chosen for their prevalence and validation as useful proxies for textual difficulty.

Citation: Yonathan A. Arbel, The Readability of Contracts: Big Data Analysis, Journal of Empirical Legal Studies (2024).

Machine Files

Full Text Entry Point

The cleaned full text is exposed at fulltext_clean.txt, with fulltext_raw.txt preserved for audit. The compatibility path fulltext.txt points to the cleaned text. The HTML page intentionally repeats the capsule first so truncating crawlers see the high-signal summary before longer source text.