April 22nd, 2021

The Overruling Dataset: A Benchmark for Detecting Legal Decisions that Have Been Overruled

In conjunction with Stanford’s Regulation, Evaluation, and Governance Lab, we’re excited to present the Overruling Dataset: a benchmark corresponding to the task of determining when a sentence is overruling a prior decision. This is a binary classification task, where positive examples are overruling sentences and negative examples are non-overruling sentences extracted from legal opinions. In law, an overruling sentence is a statement that nullifies a previous case decision as a precedent, by a constitutionally valid statute or a decision by the same or higher ranking court which establishes a different rule on the point of law involved. The Overruling dataset consists of 2,400 sentences.

Casetext constructed this dataset by selecting positive overruling samples through manual annotation by attorneys and negative samples through random sampling sentences from the Casetext law corpus. This procedure has a low false positive rate for negative samples because the prevalence of overruling sentences in the whole law is low. Less than 1% of cases overrule another case and within those cases, usually only a single sentence contains overruling language. Casetext validates this procedure by estimating the rate of false positives on a subset of sentences randomly sampled from the law and extrapolating this rate for the whole set of random samples to determine the proportion of sampled sentences to be reviewed by human reviewers for quality assurance.

The Overruling task is important for lawyers because the process of verifying the authorities of cases are still valid and cases have not been overruled is critical to ensuring the validity of legal arguments. This need has led to the broad adoption of proprietary systems, such as Shepard’s (on Lexis Advance) and KeyCite (on Westlaw), as well as SmartCite (Casetext’s AI-based Citator system). High language model performance on the Overruling tasks could enable further automation of the identification of cases that are no longer good law.

You can download the Overruling dataset here. To learn more about how current models perform on the Overruling dataset, see this recent work.

The Overruling Dataset: A Benchmark for Detecting Legal Decisions that Have Been Overruled

How Products Liability Lawyers are Using Tech in Practice

Plaintiffs’ Lawyers Agree on the Importance of Law Firm Practice Modernization in Recent Survey, But Many are Unaware of Options

Featured posts

The world’s first generative AI legal assistant is a year old!

Techniques for writing effective legal AI prompts

What makes large language models tick?

The global race to regulate AI

AI innovation or AI regulation?

CoCounsel adds another new skill

AI’s biggest year (yet)

The estate planning practitioner’s guide to CoCounsel

Efficiently evaluating LLMs for legal tasks

Applying today’s legal ethics to today’s AI (part 2)

What does reliability mean when it comes to legal AI?

Applying today’s legal ethics to today’s AI (part 1)

Understanding the AI Executive Order

CoCounsel for criminal law practitioners

Preventing and preparing for law firm cybersecurity attacks is fundamental to success

Making the most of today’s AI takes a village

Legal AI is helping lawyers leave tedious contract work behind

Getting a jump on AI regulation

4 steps to acing your next deposition, using AI

4 steps to better Legal Research Memo results

With AI, you get what you give

Top 5 tips for better CoCounsel prompts

The right way to regulate AI

How CoCounsel sharpens everyday practice for litigators

4 steps to avoid becoming the next “ChatGPT lawyer”

Casetext to join Thomson Reuters, ushering in a new era of legal technology innovation

A 10-year overnight success

How to use AI and keep firm and client data safe

What it takes to build an AI legal assistant lawyers can rely on

Leading national law firm Dykema adopts CoCounsel to enhance services

GPT-4 alone is not a reliable legal solution—but it does enable one

CoCounsel

Why Casetext

WHO WE SERVE

RESOURCES

Legal Research

The Overruling Dataset: A Benchmark for Detecting Legal Decisions that Have Been Overruled

How Products Liability Lawyers are Using Tech in Practice

Plaintiffs’ Lawyers Agree on the Importance of Law Firm Practice Modernization in Recent Survey, But Many are Unaware of Options

Featured posts

CoCounsel

Why Casetext

WHO WE SERVE

RESOURCES

Legal Research

Business email

Draft Correspondence

Review Documents

Legal Research Memo

Prepare for a Deposition

Extract Contract Data

Contract Policy Compliance

Summarize

Search a Database

Skills