Crowdsource PACER liberation in tribute to Aaron Swartz

United States Courts logo

In late January, Aaron Greenspan announced “Operation Asymptote” to the influential Liberation Technologies mailing list:

In case anyone is interested, I’ve built a tool to crowdsource the downloading of PACER materials. You can find details here:

I looked into Operation Asymptote, and recommend it as an effective and poetic tribute to Aaron Swartz‘s memory. Here’s some background on how it works.

“PACER” stands for Public Access to Court Electronic Records. It’s a network of servers hosting case and docket information from federal district, bankruptcy, and appellate courts.

As far as open government history is concerned, PACER was ahead of its time, initially providing terminal access in libraries and office buildings as early as 1988, then moving to the web in 2001.

Its network architecture and system design have not kept pace with the times. Neither has its fee structure, which was increased to $0.10 per page in September 2011. Charges are even applied to search results, where a page is defined as 4,320 bytes. I suppose one could argue it makes sense that the Administrative Office of the United States Courts should charge a nominal fee for documents which are in the public domain if you consider the cost of running and securing the service, maybe even upgrading it now and then. But that’s not what the fees are exclusively used for. In fact, PACER makes a sizable profit and some of those funds are used in a slushy way by the U.S. Courts, enabling at least one court to purchase flat screen LCDs and audio speakers installed in court benches.

What other options are out there for accessing federal case law? Open government pioneer Carl Malamud says commercial ventures such as Lexis-Nexis, West Law, and Bloomberg Law compete for a $6.5 billion market built around extracting rents from this public commons:

Countless government lawyers, public interest lawyers, and solo practitioners are quick to point out that they are priced out of the market and cannot afford access to the tools they need for their job. For the rest of us, the law truly has been locked up behind a cash register, affordable only to those who can pay the enormous price. We are a nation of laws, but the laws are not publicly available. This is a fundamental issue for democracy, for if we are a nation of laws, we must be able to consult the cases and codes of our government.

This brings to mind something important Jacob Appelbaum said the other day:

The old phrase “Ignorance of the law is no excuse” really rings hollow in an era of secret law.

The PACER system excludes a segment of the public as well as law practitioners who cannot afford access to the case law, which enforces its own form of ignorance. When Aaron Swartz met Steve Schultze in 2008 and learned about the PACER system, it seems he recognized an injustice and decided to do something about it. And as seems emblematic of what I have learned of Aaron Swartz’s ways, he outsmarted an institution with the assistance of technology. Here’s Steve Schultze’s description of meeting Aaron Swartz, the idea for a “Thumb Drive Corps” to liberate PACER documents from 16 public libraries temporarily granted free access, and Aaron Swartz’s automation of that process so he could download 2.7 million files in two days.

Steve’s post also describes the provenance of the technology underlying Aaron Greenspan’s proposed Operation Asymptote, the RECAP Firefox plugin.

I called up one of the authors [of the paper “Government Data and the Invisible Hand”], Ed Felten, and he told me to come down to Princeton to give a talk about PACER. Afterwards, two graduate students, Harlan Yu and Tim Lee, came up to me and made an interesting suggestion. They proposed a Firefox extension that anyone using PACER could install. As users paid for documents, those documents would automatically be uploaded to a public archive. As users browsed dockets, if any documents were available for free, the system would notify them of that, so that the users could avoid charges. It was a beautiful quid-pro-quo, and a way to crowdsource the PACER liberation effort in a way that would build on the existing document set.

As a result, we have the RECAP collection at The Internet Archive which as of this writing consists of 851,083 items.

Here’s the RECAP website where you can install the plugin, or browse the archive.

And here’s the next piece of the puzzle:

The Judicial Conference of the United States approved a measure in March 2010 stating that you will not owe a [PACER] fee unless your account accrues more than $10.00 of usage in a given quarter. In September 2011, this amount was increased to $15.00. If you accrue less than $15.00, your fees are waived for that quarter and your billing statement will have a zero balance. This policy change will be effective for the July 2012 statement.

So that means that any individual using PACER can download 150 pages every quarter for free. If you use the RECAP plugin while you are doing it, those pages are automatically uploaded to the Internet Archive where they become true public records without having to do anything except click on a link. Here’s the PACER registration page, where you will need a credit card to set up an account but don’t necessarily have to be charged fees.

Don’t know what to download? That’s where Aaron Greenspan’s Operation Asymptote and his public access law website PlainSite can help. As he explains in his post announcing the project, Aaron Greenspan wanted to find out all about Assistant United States Attorney Stephen P. Heymann, who played a role in prosecuting Aaron Swartz’s case. And he did. Here’s all of Heymann’s cases.

Now he wants to make “every U.S. Attorney and [Assistant U.S. Attorney]’s full career as a prosecutor available to the public to examine in its entirety.” So those are the links queued up in Operation Asymptote. Register with PACER, start Firefox w/ RECAP installed, navigate to the Operation Asymptote site, and begin clicking links till you reach $15 in charges, which you won’t be charged for.

That’s what you might call poetic justice.

Rebecca MacKinnon: Consent of the Networked

Consent of the NetworkedDon’t miss the two-week discussion with Rebecca MacKinnon, author of Consent of the Networked, about global online civil liberties, led by EFF-Austin’s Jon Lebkowsky, currently in progress on the WELL.

Now, I assume everybody here is familiar with the recent fight to kill the Stop Online Piracy Act (SOPA), where we saw a joining of forces between American Internet companies and activists against the entertainment industry and other American companies that fall under the category of “the copyright lobby.” It is unfortunate that some American businesses want to corrode people’s freedom to connect in order to protect their outmoded business models, and fortunate that other American businesses are putting some serious cash and lobbying muscle into countering them. But congress wouldn’t have halted its trajectory if it hadn’t been for the grassroots activists like and many others, as well as nonprofits like Wikipedia who brought a moral force to the argument that tipped the scales and mobilized voters to call their representatives.

When it comes to legislation like the Cyber Intelligence Sharing and Protection Act (CISPA) which passed the House and is on its way to the Senate, the role of American Internet companies is a lot more troubling. Despite concerns that this legislation lacks safeguards that would protect Americans from unaccountable spying by the NSA and others, many American businesses continue to support it because they are concerned about the security of their networks and want something to be done. They have yet to be convinced that they should only support legislation that contains adequate civil liberties protections. Which brings me back to the original point- achieving freedom to connect is much easier than achieving freedom from illegitimate, unaccountable surveillance.

Finally, there is the issue of what I call the power exercised by Internet companies over people’s identities and their privacy. This has more to do with freedom from fear than freedom to connect. To make a long story short, American companies like Google and Facebook do a much better job at freedom to connect than they do at freedom from fear. For a taste of what my book says about the lands of Facebookistan and Googledom, see this adapted excerpt in Slate:

As for activists in the USA, people are doing a tremendous amount of good work fighting to keep our own Internet open and free, despite a lot of political and commercial forces pushing in the opposite direction. American activists working for Internet freedom elsewhere around are most effective, in my view, when they start from the premise that Internet freedom faces threats absolutely everywhere, and that the United States is a far cry from a perfect model particularly on issues of surveillance. Showing up with an attitude that basically says “Hi, I’m a white night from the land of the free riding in to save you” doesn’t tend to go down well. A more effective attitude is “Hi, I’m here in solidarity to support you in your part of the global struggle. How can I be most helpful?” A number of times I’ve seen people from Egypt, Syria and China get asked that question. Often the answer is: “sort out your own country’s contradictions so that our governments can have better models to follow.”

Next Meetup: Sandy Stone on “Online identity and the fight for cyberfreedom”

Anonymous, ZModem, and Whiskey
Anonymous, ZModem, and Whiskey
Image Credit: Jacob Dexe - "Hacktivismen som demokrativerktyg"

“How in Hell Did We Get Here?: Online identity and the fight for cyberfreedom in the age of the Military-Industropolitical Complex”
by Allucquére Rosanne (Sandy) Stone

A fast-forward, semifictional history of online identity, with particular attention to the present collisions of massive political power and individual and collective agency, including how the speaker was transformed into a cat and survived the Great Hurricane of ’39 to become complicit in a Mexican Revolutionary Movement; with Graphical Illustrations, Extremely Bright Lights, and the Sound of Explosions. Maybe.

DATE: Thursday October 6th 7-9pm
NEW LOCATION: B.D. Riley’s Irish Pub and Restaurant [ @BDRileysAustin ], 204 E. 6th Street, Austin, Texas 78701; between Brazos and San Jacinto. We’ll be meeting in a dedicated space towards the back.
RSVP: Plancast

Allucquére Rosanne (Sandy) Stone [ website, wikipedia, cyborg anthropology entry ] is an academic theorist, media theorist, author, performance artist, and general troublemaker. She is Professor Emerita in the College of Communication at the University of Texas at Austin and Founding Director of the Advanced Communication Technologies Laboratory (ACTLab) in the department of Radio, Television and Film. Concurrently she is Wolfgang Kohler Professor of Media and Performance at the European Graduate School (EGS) and Founding Director of the radical new Experimental Media program ACTLab@EGS, senior artist at the Banff Centre for the Arts, and Humanities Research Institute Fellow at the University of California, Irvine. Stone has worked in and written about film, music, experimental neurology, writing, engineering, and computer programming. She is transgender and is considered a founder of the academic discipline of transgender studies, is the author of numerous books, novels, and essays, has been profiled in ArtForum, Wired, Mondo 2000, and many other publications, and Jon Lebkowsky has referred to her as “a force of nature.” She loves chocolate, cats and, apparently, getting herself into hair-raisingly scary situations from which escape is nearly impossible. Nevertheless she finds time to be a loving wife, boon companion, caring mother, and exemplary grandmother, while still running the hell all over the world to perform at conferences in too many disciplines to mention.

B.D. Riley’s on 6th Downtown